Alteryx Designer Discussions

Find answers, ask questions, and share expertise about Alteryx Designer.

Html data table extraction

Technom
5 - Atom

Hi,

I am trying to extract a data table called "Gold Mineral Reserves" from the following website. Screenshot shows location of the table.

https://www.sec.gov/Archives/edgar/data/756894/000119312519083744/d695547dex991.htm

 

My workflow that I have taken from the community does not format the data correctly to be able to split to rows and columns.

 

Any suggestions how to solve this would be great.

 

thanks!

4 REPLIES 4
BrandonB
Alteryx
Alteryx

This could be done in a more elegant manner using regular expressions, but this pulls the table as desired. 

 

parsing.png

Technom
5 - Atom

Great many thanks. Is there any way to extract the column headers (proven, probable total) and the sub headers (tonnes (000s), grade (g/mt), contained oz (000s) etc. ) as well?

BrandonB
Alteryx
Alteryx

Probably easiest to just use a select tool to manually change them as shown below. The HTML for those headers is on different lines and doesn't really provide much value in parsing out via workflow given that they are consistent. If there were a bunch more it might make sense, but this only took a minute or so. 

 

parsing v2.png

Technom
5 - Atom

Thanks!

Labels