Automatically Download Excel File from website
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi all,
How would I go about automatically downloading the excel file in the following website, then select a specific tab within it? I have created a view on a static file and would like to be able to update this monthly as new government data comes in.
Cheers!
- Labels:
- Download
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi @ben7
You would need Alteryx Server to schedule a workflow that uses the download tool to download the information.
The URL looks to contain a date in the file:
https://download.asic.gov.au/media/5293710/asic-insolvency-statistics-series-1a-published-october-20...
You could use the download tool to get the full page HTML, then parse the URL for the file download. If the date has changed to the current month then download it. Otherwise wait until it's schedule to run again (next day, next hour, whatever is appropriate).
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi Joe,
I'm quite new to the software, would you be able to break that down into simple steps please?
Thanks!
Ben
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
That's awesome!
How did you know you had to use the text to columns function after the download tool? I could only see 3 cells within the download tool
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@ben7 wrote:
How did you know you had to use the text to columns function after the download tool? I could only see 3 cells within the download tool
The download tool will always return two fields (on top of the input ones); DownloadData and DownloadHeaders.
The DownloadData field will contain what the web request (URL) returns, much the same it would do if you entered it into the bar in Chrome.
I used the Text To Columns tool to then split the information out from one row in Alteryx, to the multiple rows of HTML that are within the field.
If you double click the cell, this will load the cell viewer where you can see all the data inside.
The Text To Columns is almost always the first step after using a download tool, or at least it is for me.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi @joes, I have a similar problem but it is with a website that requires a login. Can you private message me so I can have you take a look at it?
Kevin
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
I have the same issue. Were you able to fix it?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi Joe,
I am looking to download from a website that provides 15 minute updates, clean it and then provide this file to Tableau. I am trying to figure out a way to do this by CSV if possible. I just can't figure out how to query the HTML site on a constant basis within 15/30 minute increments. Any suggestions on how to set up this workflow? Sorry, new user, and for being potentially repetitive.
Thanks,
Brendan
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@bpd44 wrote:Hi Joe,
I am looking to download from a website that provides 15 minute updates, clean it and then provide this file to Tableau. I am trying to figure out a way to do this by CSV if possible. I just can't figure out how to query the HTML site on a constant basis within 15/30 minute increments. Any suggestions on how to set up this workflow? Sorry, new user, and for being potentially repetitive.
Thanks,
Brendan
Hi Brendan,
I believe you would want to have Alteryx Server and schedule the workflow to run every 15 minutes.
That way it becomes and automated process
