This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
We're currently using Regex and text to columns to parse raw HTML as text into the appropriate format when web scraping, when a tool to at least parse tables would be hugely beneficial.
This functionality exists within Qlik so it would be nice to have this replicated in Alteryx.
Obviously, we need to retain the ability to scrape raw HTML, but automatically parsing data using the <td>, <th> and <tr> tags would be nice.
In the following page there is a table showing the states and territories of the US:
With Qlik, you can input the URL and it will return the available tables in tabular format:
As this functionality exists elsewhere it would be nice to incorporate this into Alteryx.
+1 - PowerQuery already also has this functionality. I can point it at a Wikipedia page or similar and it just automatically scrapes the data. It even does 'table like' data.
any updates on this?
By the one the number one 3rd party data source seems to be web pages static or dynamic.
Especially in competitive price comparisons etc.
Any chance Alteryx achieve a capability like
Excel also has this!
Thanks Mceleavey!! That`s awesome. Sure helps a lot!
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.