We are celebrating the 10-year anniversary of the Alteryx Community! Learn more and join in on the fun here.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

1 PDF not downloading, webscraping

Roche
8 - Asteroid

Hi everyone, 

 

I have built a flow to download the pdfs from table 1 on this site: End of Sale and End of Service Life - Extreme Networks

All pdfs download except for nr 80, for which the hyperlink is https://kapost-files-prod.s3.amazonaws.com/uploads/asset/file/5b4f5a2af9e4e600b3000147/Immediate%20E...

Does someone perhaps know why this pdf would not download successfully?  I was wondering if it has something to do with the amazonaws.com/ uploads action - not sure.  The file does show as a pdf in the folder where I keep the pdfs, but when I open it there is no file opening to Adobe Reader. 

Roche_0-1650615789224.png

 

Can someone perhaps help me with this?  

 

I also attach a sample workflow with an example path on my machine  \\wg0041\Alteryx-Shared

 

Thank you for helping!

 

Rouche

 

 

2 REPLIES 2
TheOC
16 - Nebula
16 - Nebula

hey @Roche 
Thank you for attaching the sample workflow, its made it a bunch easier to investigate. I do appreciate that.

 

So the file downloaded contains:

TheOC_0-1650616523629.png
(Opened in Notepad++, which is usually pretty good for these things).
There are a few reasons that the access would be denied, however the reason in this case is the 'encode URL text' option in your download tool:

TheOC_1-1650616742519.png



The encode URL option effectively converts symbols within the URL to 'request friendly formats'. However, the URL you are trying to query is already encoded:
https://kapost-files-prod.s3.amazonaws.com/uploads/asset/file/5b4f5a2af9e4e600b3000147/Immediate%20E...

See: '%20' within the URL. This represents a space.

 

by unchecking that option, i have managed to get the file to download to my drive no problem.


You can read more about that here:
https://help.alteryx.com/20214/designer/download-tool


Hope this helps,

TheOC

 

Cheers,
TheOC
Connect with me:
LinkedIn Bulien
Roche
8 - Asteroid

Good morning, 

 

Thank you.  Downloaded the file using your advice. 

 

Rouche

Labels
Top Solution Authors