Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Using a reverse proxy for webscraping

Roche
8 - Asteroid

Hi everyone, 

 

I need to webscrape the data from this site: https://www.checkpoint.com/support-services/support-life-cycle-policy/#appliances-support

 

I have started the process but it looks like I will need a reverse proxy to do this task: (used the data clean only to see the outcome)

Roche_0-1654848505397.png

I had a look at this article: https://community.alteryx.com/t5/Alteryx-Server-Knowledge-Base/Requirements-for-Configuring-Alteryx-... and a few other articles re reverse proxies.  However I will need to do this task within Alteryx.  I am not sure how to go forward on this since I am newly introduced to reverse proxies and my understanding of what they describe in this article is limited.

 

Can anyone perhaps help me on what I need to do in this case?  (I have previously webscraped data from dynamic and static webpages using alteryx.)  I do not find documentation / examples from the community that can help me set up this flow and am also not sure how apache or nginx might come into play here.

 

Also, initially I will run this flow locally and thereafter would like to also run it on the server - not sure if it would be different if I need to run it on Alteryx server as well.

 

Would appreciate your help.

 

Thanks, 

Rouche

2 REPLIES 2
Matt_D
10 - Fireball

Hi @Roche 

 

Untick the option in the Download Tool - Encode URL Text

 

This allows me to return the data to be parsed.

Roche
8 - Asteroid

Oh goodness!  Thank you for that.  Thought I will need to do reverse proxies!

Polls
We’re dying to get your help in determining what the new profile picture frame should be this Halloween. Cast your vote and help us haunt the Community with the best spooky character.
Don’t ghost us—pick your favorite now!
Labels