Hello!
Looking for some help on downloading a csv file that is in a zip file from the DOL 5500 website. Right now we manually download the file to a shared folder on our network and use that as the input in a workflow, however I'm looking to sort of dynamically get this data. Some of the threads I've tried looking at where archived so I couldn't use those as examples.
The below site is were the zip links are and the other two are direct links to the zip file.
https://www.dol.gov/agencies/ebsa/about-ebsa/our-activities/public-disclosure/foia/form-5500-datasets
https://askebsa.dol.gov/FOIA%20Files/2022/Latest/F_5500_2022_Latest.zip?_ga=2.168214697.982608220.1696943063-258589663.1695410842
https://askebsa.dol.gov/FOIA%20Files/2021/Latest/F_5500_2021_Latest.zip?_ga=2.168214697.982608220.1696943063-258589663.1695410842
Any help on where to get started would be great! Thanks!
Hi @tbuenaflor
The go-to post for this topic was always "https://community.alteryx.com/t5/contentarchivals/contentarchivedpage/message-uid/444413" provided by someone from Alteryx. The post has gone, but I snagged a copy of the workflow since it was so useful.
Alteryx Input tools can read a file from within a zip archive, but you have to know the name of the file that you want to extract. The attached workflow uses a python tool to read the directory of the zip file, which is then passed to a batch file to extract the individual files. You can use this as the basis for your workflow by replacing the initial Directory browse tool with a download tool that writes to a specific location.
Dan
@danilang you're awesome for saving the workflow! Going to give it a shot. Thank you so much!