Alteryx Server Discussions

Find answers, ask questions, and share expertise about Alteryx Server.
The Expert Exam is now live online! Read about the specifics and what it took to bring it to life in the blog by our very own Elizabeth Bonnell!
SOLVED

Downloading HTML tables, Alteryx not finding tables?

Highlighted
5 - Atom

I'm trying to download and parse the following tables from this site: Corporate tax rates table. I've managed to understand where the data is coming from: Data source.

 

When I'm downloading the data, Alteryx is only returning the "Locations" and "Footnotes" tables, but nothing in between and no countries.

 

Could somebody help me with parsing the tables? I tried a lot, to no avail.

 

This is what Alteryx spits out, by the way, as you see only two tables:

 

 

 

<!doctype html>

<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<title>KPMG</title>
<script language="JavaScript">

var selectedTax = "Corporate"

</script>
<script language="JavaScript" src="js/jquery-1.6.2.min.js"></script>
<script language="JavaScript" src="js/js-GM-PRGM-009.js"></script>
<link href="css/style-GM-PRGM-009.css" rel="stylesheet" type="text/css">
</head>

<body>
<div class="GMPRGM009ResponsiveTable">
  <table width="100%" >
    <tbody>
      <tr>
        <th class="GMPRGM009xslTHLocation">Location</th>
        <th class="GMPRGM009xslTHFootnotes">Footnotes</th>
      </tr>
    </tbody>
  </table>
</div>
</body>
</html>

 

 

 

Highlighted
Alteryx
Alteryx

Have a look at the attached workflow as an example.  This will give you some insights.

Highlighted
16 - Nebula
16 - Nebula

The actual data is coming from https://s3.amazonaws.com/kpmg-global/tax-rates-tool/js/taxRateTool-data.js

(found this URL by looking at the network tab in Chrome)

 

This provides the data in JSON format.

 

You can download this and then parse.

 

Something like:

clipboard_image_0.png

 

Sample attached

 

Highlighted
Alteryx
Alteryx

@JDFS404 

 

The websites use a script to render the content ie the tables are not built out using static HTML which Alteryx can download and parse. You can use a method described by @DavidM here to scrape websites with dynamically generated content.

Highlighted
5 - Atom

Exactly what I was looking for, @jdunkerley79! Thanks very much :).