Alteryx Designer Desktop Discussions

RDF25087 · ‎07-10-2023

Hi all -

I'd like to scrape branch details from the ukps.com/stores.html website.

Down the right is a table of branches with city, phone number, website, email and address. I'd like to be able to scrape that info into a table for each of those elements.

Is this possible? I fear this is way beyond my understanding!

Thanks for any assistance you might be able to provide.

RDF

geraldo · ‎07-10-2023

@RDF25087

Yes, it is possible.
The addresses you want are in the generated html.
Then use the download tool and make a regex to scrape

RDF25087 · ‎07-11-2023

Thanks for the response, but as mentioned I think this is a bit beyond my current skill levels - a bit more guidance on how to build the flow would be appreciated.

RDF

RDF25087 · ‎07-11-2023

Looking at the HTML I don't think the data is in a table?

RDF25087 · ‎07-11-2023

Any help?

geraldo · ‎07-12-2023

@RDF25087

An workflow example

I hope it helps in understanding

acarter881 · ‎07-12-2023

Hey, @RDF25087.

To simplify this process, I would use Python. In my opinion, it will be easier to solve and easier to understand.

I suggest looking into requests (to send the HTTP GET request) and Beautiful Soup (to parse the HTML).

I believe these are the main libraries you need. Seems like the data you want are in the HTML and not loaded dynamically with JavaScript, so something like Selenium won't be necessary.

RDF25087 · ‎07-13-2023

@geraldo @acarter881 - thank you both for your support - it's much appreciated.

@geraldo - when I try to run your workflow above I get the attached error on the download.

Thanks

RDF

geraldo · ‎07-13-2023

@RDF25087

This message has something related to your internet access.
For me it's running perfectly. I don't have a proxy or firewall activated.
Can you open a url through the browser?

https://www.ukps.com/stores.html

RDF25087 · ‎07-13-2023

@geraldo

Thanks for the reply. I work for a large business that do everything they can to make downloading/accessing the internet as difficult as possible. When I get home I'll try running it on my home network.

Alteryx Designer Desktop Discussions

Web Scrape Branch Details

Re: Min and Max positive Negative Number Highlight...

Re: Outlook 365 Input tool issue

Re: parsing text to date

Re: If formula to determined a value based on date...

Re: Running multiple alteryx workflows within alte...