Let’s talk Alteryx Copilot. Join the live AMA event to connect with the Alteryx team, ask questions, and hear how others are exploring what Copilot can do. Have Copilot questions? Ask here!
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Download *ALL* HTML

calebhowellks
6 - Meteoroid

I am working on some shipment data analysis for my company. The data on this is lacking, so I am looking to batch download the raw HTML from websites like this (https://tools.usps.com/go/TrackConfirmAction?qtc_tLabels1=92748903029148543477719247) to see the detailed tracking information. 

 

This is the information I am looking for:

calebhowellks_0-1660834086231.png

 

In the past, I have simply run the URL through the download tool and then parsed the results. I am coming up short there, as the data I am seeking does not show up in the response. 

calebhowellks_1-1660834213431.png

 

This is really weird, because when I "Save As" in Google Chrome, the document saved contains all information I could need. 

 

I then explored using Python to grab this data, but I came up short (I do not know Python well). I followed this tutorial: https://community.alteryx.com/t5/Engine-Works/Web-Scraping-will-it-Alteryx/ba-p/429475 

 

Any help would be appreciated! Goal is to:

  1. Have a list of USPS Tracking URLs
  2. Download/extract all information
  3. Pass this data on in the workflow to be analyzed (I am very good with parsing through Regex, so I am confident that if I can download the data, that I can succeed)

 

Thanks!!!

 

Caleb

 

1 REPLY 1
Luke_C
17 - Castor
17 - Castor

Hi @calebhowellks 

 

I'd suggest using the USPS APIs for this -  they're free.

https://www.usps.com/business/web-tools-apis/track-and-confirm-api.htm

 

Labels
Top Solution Authors