Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Download PDF from website

Ray_Pospisil
8 - Asteroid

Hi all,

 

I am trying to download PDF which is stored at a particular website.

 

I already have the HREF link but somehow cannot use the download to blob correctly.

 

I tried to follow the information in the following link:

 

https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Web-scraping-including-embedded-pdf-do...

 

but failed.

 

I am probably missing some small thing which somebody with fresh pair of eyes sees instantly when looking at my workflows.

 

Can I kindly ask anybody for help?

 

First step is to learn how to download the PDF and second step will follow when I will want to actually even name that downloaded PDF based on field which will be coming in from a different tool.

 

I attach my current WIP.

 

All the best,

 

Radek

 

 

5 REPLIES 5
BrandonB
Alteryx
Alteryx

Here you go. I helped out a bit with your href parsing using some regex. The final workflow is attached!

 

PDF to Blob.png

 

Ray_Pospisil
8 - Asteroid

Hi BrandonB,

 

Thank you very much for your reply.

 

It seems that your text in column "PDF" looks to be in the same format as "URLdownload1" column in my workflow.

 

How do I render a PDF out of it?

 

I was checking the Blob Convert and addition of Render but somehow do not know how to download the PDF.

 

What is wrong with the setting these two tools in my initial workflow please?

 

All the best,

 

Radek

BrandonB
Alteryx
Alteryx

Is your objective to download the PDFs to a folder? In the Download tool, you can use the To a File option with "Filename from a Field". You will need a path specified in another column that names the file and specifies where it will be written. This will download all of the files specified to a folder. 

Ray_Pospisil
8 - Asteroid

Hi BrandonB,

 

Thank you very much for your support.

 

I have just finished the whole Alteryx App for my end users.

 

However, it does not work on 100% of cases.

 

Sometimes, I download (let's say all 5 PDFs) I want but only one of them can be opened and the rest gives me an error that file is corrupted.

 

What could it be?

 

WF attached

 

Radek

jarrod_r
7 - Meteor

Were you able to get this resolved?

Labels
Top Solution Authors