This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
here is an image of the link I want to download in Alteryx (circled in red):
The url appears to resolve to blob:https://www.hesa.ac.uk/9bb84ca8-539b-498e-8b79-44e68cdc0382 so I'm not sure how that gets turned into a csv link in the browser. Also, each time the page refreshes the blob url changes. Finally, if you copy and paste the blob url into an incognito window you get an error. It's all very strange and I can't figure it out.
Thank you for trying @danilang but if it were that simple I think I might have got there myself :-)
I can see you're looking at the Inspect Element view in chrome, but in my initial question I already pointed out that the link in question is not contained in the html source code. It is this html code, after all, which would be downloaded by the Download tool in Alteryx. If you right-click on the page in Chrome and choose View page source you'll see what I mean:
I have highlighted the part of the html where the link should appear, but it appears to be obfuscated within the manual-scrollbar div class. Am I missing something? Does it appear when you click on view source?
And even if the link was present in the source code, how would you get Alteryx to turn the blob: link into a csv as the browser does. How does Chrome know what to do with that link? Thanks
I did some research on the blob: prefix and found this
Here's a workflow that does just that.
It's a hack and is highly dependent on the response, but it gives these results with the code in Country1 and the country name/description in Country2. I'll leave the exercise of cleaning these up to you.
Ah yes of course. In this case it's not so much that there's a CSV sitting on a server. Rather, the csv is created by the browser based on the information in the webpage in which case, as you suggest, the route to go down is to parse the html in the usual fashion. For some reason I had become fixated on finding the csv file rather than viewing it as the same thing as the table on the page. Thank you the insight, and for the helpful example. Every time I dabble with regex I get a little bit better, and then I forget it all!