Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

1 PDF not downloading, webscraping

Roche
8 - Asteroid

Hi everyone, 

 

I have built a flow to download the pdfs from table 1 on this site: End of Sale and End of Service Life - Extreme Networks

All pdfs download except for nr 80, for which the hyperlink is https://kapost-files-prod.s3.amazonaws.com/uploads/asset/file/5b4f5a2af9e4e600b3000147/Immediate%20E...

Does someone perhaps know why this pdf would not download successfully?  I was wondering if it has something to do with the amazonaws.com/ uploads action - not sure.  The file does show as a pdf in the folder where I keep the pdfs, but when I open it there is no file opening to Adobe Reader. 

Roche_0-1650615789224.png

 

Can someone perhaps help me with this?  

 

I also attach a sample workflow with an example path on my machine  \\wg0041\Alteryx-Shared

 

Thank you for helping!

 

Rouche

 

 

2 REPLIES 2
TheOC
16 - Nebula
16 - Nebula

hey @Roche 
Thank you for attaching the sample workflow, its made it a bunch easier to investigate. I do appreciate that.

 

So the file downloaded contains:

TheOC_0-1650616523629.png
(Opened in Notepad++, which is usually pretty good for these things).
There are a few reasons that the access would be denied, however the reason in this case is the 'encode URL text' option in your download tool:

TheOC_1-1650616742519.png



The encode URL option effectively converts symbols within the URL to 'request friendly formats'. However, the URL you are trying to query is already encoded:
https://kapost-files-prod.s3.amazonaws.com/uploads/asset/file/5b4f5a2af9e4e600b3000147/Immediate%20E...

See: '%20' within the URL. This represents a space.

 

by unchecking that option, i have managed to get the file to download to my drive no problem.


You can read more about that here:
https://help.alteryx.com/20214/designer/download-tool


Hope this helps,

TheOC

 

Cheers,
TheOC
Connect with me:
LinkedIn Bulien
Roche
8 - Asteroid

Good morning, 

 

Thank you.  Downloaded the file using your advice. 

 

Rouche

Labels
Top Solution Authors