Bring your best ideas to the AI Use Case Contest! Enter to win 40 hours of expert engineering support and bring your vision to life using the powerful combination of Alteryx + AI. Learn more now, or go straight to the submission form.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Download *ALL* HTML

calebhowellks
6 - Meteoroid

I am working on some shipment data analysis for my company. The data on this is lacking, so I am looking to batch download the raw HTML from websites like this (https://tools.usps.com/go/TrackConfirmAction?qtc_tLabels1=92748903029148543477719247) to see the detailed tracking information. 

 

This is the information I am looking for:

calebhowellks_0-1660834086231.png

 

In the past, I have simply run the URL through the download tool and then parsed the results. I am coming up short there, as the data I am seeking does not show up in the response. 

calebhowellks_1-1660834213431.png

 

This is really weird, because when I "Save As" in Google Chrome, the document saved contains all information I could need. 

 

I then explored using Python to grab this data, but I came up short (I do not know Python well). I followed this tutorial: https://community.alteryx.com/t5/Engine-Works/Web-Scraping-will-it-Alteryx/ba-p/429475 

 

Any help would be appreciated! Goal is to:

  1. Have a list of USPS Tracking URLs
  2. Download/extract all information
  3. Pass this data on in the workflow to be analyzed (I am very good with parsing through Regex, so I am confident that if I can download the data, that I can succeed)

 

Thanks!!!

 

Caleb

 

1 REPLY 1
Luke_C
17 - Castor
17 - Castor

Hi @calebhowellks 

 

I'd suggest using the USPS APIs for this -  they're free.

https://www.usps.com/business/web-tools-apis/track-and-confirm-api.htm

 

Labels
Top Solution Authors