Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Data Scrapping from web site

Khanzali007
8 - Asteroid

Dear Team,

 

I Need your support in my workflow, to extract the information from the webpage as Name, Type etc. I have attached the screen shot of the web page & the relevant information is highlighted in yellow.

 

Thanks in Advance.

8 REPLIES 8
markcurry
12 - Quasar

Hi @Khanzali007 

 

See the attached workflow, I hope that gives you what you're looking for..

 

 

Khanzali007
8 - Asteroid

Hi @markcurry 

 

Thanks for the reply, your Solution works perfectly. I need bit more help from you from the same context. 

 

I need to extract more information from the same website, such as filling history. for Ref i have attached Screen short.

Filling History can be seen by link - https://irisgst.com/gstin-filing-detail/?gstinno=27AADCD0086F1ZW

 

mceleavey
17 - Castor
17 - Castor

Hi @Khanzali007 ,

 

I've attached a workflow and a macro I built to scrape HTML tables (save this to your macros folder before you open the workflow).

 

mceleavey_0-1622198390980.png

 

mceleavey_1-1622198405912.png

 

Hope this helps,

 

M.

 

 



Bulien

atcodedog05
22 - Nova
22 - Nova

Neat macro @mceleavey 🙂

mceleavey
17 - Castor
17 - Castor

Thanks, @atcodedog05 

 

I was doing a lot of web scraping and so building this to scrape the tables into a .csv format saved a LOT of time.

 

M.



Bulien

pujaguptars
6 - Meteoroid

Hi @mceleavey , 

 

I want to download addresses of the office locations from here - https://www.adanirealty.com/contact-us

But when I use this url in download tool then it gives me an error. Could you please help? 

 

Much thanks!

mceleavey
17 - Castor
17 - Castor

Hi @pujaguptars ,

 

The tool won't work in this case because the required information is not in an HTML table, but rather just placed in DIV tags in the HTML. This means it's a manual scraping operation.

I've built this out for you using a bit of Regex magic:

 

mceleavey_0-1631780047284.png

mceleavey_1-1631780087404.png

 

 

 

I've attached the workflow.

Hope this helps,

 

M.

 



Bulien

Anudeep_Yalamuru
8 - Asteroid

@mceleavey Good morning, could you please assist with my below query please.

 

https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/Web-scrapping-Multiple-Pages/t...

 

Details of the post are

 

I need to get the details from this webpage - "https://www.gov.ie/en/directory/category/495b8a-schools/?school_roll_number="

 

I need to extract all school details like in this page

 

"https://www.gov.ie/en/directory/page/5pcuno-2aoze9-/"

 

Could you please assist me on getting all the 4000 odd schools details please.

 

Thanks,

Anudeep

Labels