Now that it's live, don't forget to accept your certification badge on Credly today! Learn more here.

Alteryx Designer Discussions

Find answers, ask questions, and share expertise about Alteryx Designer.

Extract a table from Wikipedia

7 - Meteor



I tried to use the download tool to extract a table from a wikipedia page:

I'm trying to get the big table with country code and country names.



I've used the download tool to get the data, used Text to column tool with "Split to Rows" and \n as delimiter.

It basically gives me the html code of the page.


What's next to isolate that specific table?

This would take seconds in powerquery and easily refresh, it'd be great to have a user friendly tool to grab data from web...





11 - Bolide

Hey Tibo,


In situations like this, I like to build a custom API via and use it with the download tool in Alteryx. This is free, and can capture data from very complex webpages with little effort. 




12 - Quasar
12 - Quasar

This isn't perfect but it should get you started.


Using XML tool, dynamic select, and transpose you can pull the XML data for each specific row of the table you are after.

15 - Aurora

The attached solution utilizes observed specifics in the HTML to extract the relevant rows generate the Abbr/CountryName list in straight-forward fashion.  Granted, if they tweak the table, results may go bad, but that may well be the case with any solution.


(But as noted above, is probably the way to go here: it captures every column in the table).

16 - Nebula
16 - Nebula

Try using to set up a parse process.


This then presents a JSON API endpoint you can use in Alteryx much more easily


For this page I tried the and it worked staright away.


You need to create an account and save the API. After this go to Export and Simple API integration.


For this page my generated API was:


I set up a download tool to grab the JSON and then parse with standard tools.


Attached a sample

16 - Nebula
16 - Nebula

Double post 😞


Hi Tibo,


There's a few different ways of Parsing the html table listed here, so I thought that I would include mine. The base of it is around using REGEX to parse each element to a new line (I used to replace each element with a pipe (eg Replace([DownloadData],"<table","|") and then parse on the pipe before I got used to REGEX).


I'll also jump on the love for, as once the page gets a little more complicated than a simple table, handles those elements better.



7 - Meteor

Thanks Guys,


This is helpful, I'm familiar with so will use that as it is the easiest here.

I just can't wait for such functionality to be embedded in a proper Alteryx tool because all the parsing methodologies outlined here are way too long for me, I'd be quicker copying and pasting the table in a text input and refresh it manually 🙂


Anyway this gives me ideas for other stuff so thanks for spending the time




5 - Atom

Is it true that the API functionality of is now a paid feature?