Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Using a reverse proxy for webscraping

Roche
8 - Asteroid

Hi everyone, 

 

I need to webscrape the data from this site: https://www.checkpoint.com/support-services/support-life-cycle-policy/#appliances-support

 

I have started the process but it looks like I will need a reverse proxy to do this task: (used the data clean only to see the outcome)

Roche_0-1654848505397.png

I had a look at this article: https://community.alteryx.com/t5/Alteryx-Server-Knowledge-Base/Requirements-for-Configuring-Alteryx-... and a few other articles re reverse proxies.  However I will need to do this task within Alteryx.  I am not sure how to go forward on this since I am newly introduced to reverse proxies and my understanding of what they describe in this article is limited.

 

Can anyone perhaps help me on what I need to do in this case?  (I have previously webscraped data from dynamic and static webpages using alteryx.)  I do not find documentation / examples from the community that can help me set up this flow and am also not sure how apache or nginx might come into play here.

 

Also, initially I will run this flow locally and thereafter would like to also run it on the server - not sure if it would be different if I need to run it on Alteryx server as well.

 

Would appreciate your help.

 

Thanks, 

Rouche

2 REPLIES 2
Matt_D
10 - Fireball

Hi @Roche 

 

Untick the option in the Download Tool - Encode URL Text

 

This allows me to return the data to be parsed.

Roche
8 - Asteroid

Oh goodness!  Thank you for that.  Thought I will need to do reverse proxies!

Labels
Top Solution Authors