Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Download *ALL* HTML

calebhowellks
6 - Meteoroid

I am working on some shipment data analysis for my company. The data on this is lacking, so I am looking to batch download the raw HTML from websites like this (https://tools.usps.com/go/TrackConfirmAction?qtc_tLabels1=92748903029148543477719247) to see the detailed tracking information. 

 

This is the information I am looking for:

calebhowellks_0-1660834086231.png

 

In the past, I have simply run the URL through the download tool and then parsed the results. I am coming up short there, as the data I am seeking does not show up in the response. 

calebhowellks_1-1660834213431.png

 

This is really weird, because when I "Save As" in Google Chrome, the document saved contains all information I could need. 

 

I then explored using Python to grab this data, but I came up short (I do not know Python well). I followed this tutorial: https://community.alteryx.com/t5/Engine-Works/Web-Scraping-will-it-Alteryx/ba-p/429475 

 

Any help would be appreciated! Goal is to:

  1. Have a list of USPS Tracking URLs
  2. Download/extract all information
  3. Pass this data on in the workflow to be analyzed (I am very good with parsing through Regex, so I am confident that if I can download the data, that I can succeed)

 

Thanks!!!

 

Caleb

 

1 REPLY 1
Luke_C
17 - Castor

Hi @calebhowellks 

 

I'd suggest using the USPS APIs for this -  they're free.

https://www.usps.com/business/web-tools-apis/track-and-confirm-api.htm

 

Labels