Get Inspire insights from former attendees in our AMA discussion thread on Inspire Buzz. ACEs and other community members are on call all week to answer!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Web Scraping from a Password-Protected Site

RogerWyllie
6 - Meteoroid

Hi Alteryx Community, 

 

I am attempting to design a workflow that can extract credit ratings from a website such as Standard&Poors. These websites however require the user to login to the site before displaying any data. 

 

I have been able to execute a POST request to login to the S&P website according to the instructions found on the webpage linked -> https://www.thedataschool.co.uk/joe-carr/webscraping-through-alteryx-as-if-you-are-logged-in/, however, I cannot figure out how to remain logged in while executing a GET request to initiate my web scraping (even after retaining the cookie returned from the POST request).

 

I have already tried discussing this with an Alteryx Solutions Engineer but we could not resolve this query. I am hoping somebody in the community can help point me in the right direction. I am not very savvy with web programming so I may be structuring my requests incorrectly as well.

4 REPLIES 4
BrandonB
Alteryx
Alteryx

Do you have access to their API? This would be a much more direct route: https://www.spglobal.com/marketintelligence/en/documents/spciq_api_v2.pdf

 

This will likely depend on the licensing that your company has arranged with theirs. 

RogerWyllie
6 - Meteoroid

Hi Brandon, 

 

I am trying to avoid the use of any APIs as this will complicate the approval process for the tool I am trying to create. If I cannot find any solution then I will be sure to use the link you provided though, thank you! 

PhilippK
Alteryx Alumni (Retired)

Hi @RogerWyllie ,

 

I solved a similar challenge via the Alteryx Python tool and the package Selenium (with this you can automate/remote control your browser).

 

@DavidM wrote a greate article about that:

https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Python-Code-Tool-Web-Scraping-Dynamic-...

 

Best regards

Phil

RogerWyllie
6 - Meteoroid

Hi Philip, 

 

Thank you for this, I will be sure to give it a go! 

Labels