Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Extract Data from Website

Jeff13
6 - Meteoroid

I'm trying to get practically everything (data related) from this website. If someone can get me started, I'm sure I can figure the rest out.

 

https://www.groupexpro.com/schedule/988/?view=new

 

In case this could be helpful, below is the script that can embedded on a website.

 

<script>
var acct = '988'; var loc = ''; var cat = ''; var stylesheet=''; var hideLastnames = true;
var jsHost = (("https:" == document.location.protocol) ? "https://" : "http://");
document.write("<scr"+"ipt src='"+jsHost+"ajax.googleapis.com/ajax/libs/jquery/1.7/jquery.min.js' type='text/javascript'></scr"+"ipt>");
document.write("<scr"+"ipt>var jQuery = jQuery.noConflict(true);</scr"+"ipt>");
document.write("<scr"+"ipt src='https://www.groupexpro.com/schedule/embed/schedule_embed_new_responsive.js.php?a="+acct+"' type='text/javascript'></scr"+"ipt>");
</script>

 

Thanks in advance.

3 REPLIES 3
TheOC
15 - Aurora
15 - Aurora

hey @Jeff13 

when loading the page, i can see 5 get requests in the network tab of chrome:

TheOC_0-1647354026019.png



This is loading the data for the page.

These seem to be requests for:
Schedule,
Categories,
Studios,
Classes,
Instructors.

You can use a download tool with a get request to retrieve this data. I have built a workflow that does this for the first request, 'schedules'. I have also imported the other requests - if you do so need them. They should work with the same logic.

Its worth me reminding you that not all sites are happy with people scraping their data, and I would first recommend getting permission from them, before scraping data from their pages. The potential consequence is being restricted access to their site.

Please find my workflow attached,
TheOC


Bulien
Jeff13
6 - Meteoroid

Worked like a charm!!! Thank you so much.

TheOC
15 - Aurora
15 - Aurora

hey @Jeff13 

No problem! Glad I could help!

TheOC


Bulien
Labels