Get Inspire insights from former attendees in our AMA discussion thread on Inspire Buzz. ACEs and other community members are on call all week to answer!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Downloading data from website

jozsefm
5 - Atom

Hi all,

 

I would like to download data from the following website: https://www.submarinecablemap.com/#/

In the picture, you can see what I would like to get as text at the end of the process - the submarine cables in the list on the right side:

Capture.PNG

Unfortunately, if I use the Download tool, the result is just the text I put at the end of this message that does not contain what I want. I don't know the reason why this happens. Maybe because these are not just texts but hyperlinks? If so, how can I solve this?

The main goal would be to get the texts under the hyperlinks (that you can see on the next picture) but I am not even sure if it's possible because the URL changes a little bit when I click on each hyperlinks. What do you think? Thank you in advance!

Capture1.PNG

 

The text I got from download tool:

<!DOCTYPE html><html><head><meta charset="utf-8"><meta http-equiv="X-UA-Compatible" content="IE=edge"><title>Submarine Cable Map</title><meta name="viewport" content="width=device-width,initial-scale=1"><meta name="twitter:card" value="summary_large_image"><meta name="twitter:image" content="http://twentyfiveyears-assets.telegeography.com/submarinecablemap.png"><meta property="og:image" content="http://twentyfiveyears-assets.telegeography.com/submarinecablemap.png"><meta name="twitter:title" content="Submarine Cable Map"><meta property="og:title" content="Submarine Cable Map"><meta name="twitter:description" content="A comprehensive and regularly updated interactive map of the world's major submarine cable systems and landing stations."><meta property="og:description" content="TeleGeography’s free interactive Submarine Cable Map is based on our authoritative Global Bandwidth research, and depicts active and planned submarine cable systems and their landing stations. Selecting a cable on the map projection or from the submarine cable list provides access to the cable’s profile, including the cable’s name, ready-for-service (RFS) date, length, owners, website, and landing points."><meta property="og:url" content="https://www.submarinecablemap.com/"><meta property="og:type" content="article"><meta name="twitter:site" content="@telegeography"><meta name="twitter:creator" content="@telegeography"><meta property="og:site_name" content="https://www.submarinecablemap.com/"><meta name="keywords" content="undersea,submarine,cable,network,map,landing,station,international,bandwidth,fiber,optic,capacity,telecom,telecommunications,statistics,data"><meta name="description" content="TeleGeography's comprehensive and regularly updated interactive map of the world's major submarine cable systems and landing stations."><link rel="shortcut icon" href="/assets/images/favicon.ico"><meta name="submarinecablemap-dot-com/config/environment" content="%7B%22modulePrefix%22%3A%22submarinecablemap-dot-com%22%2C%22environment%22%3A%22production%22%2C%22rootURL%22%3A%22/%22%2C%22locationType%22%3A%22hash%22%2C%22EmberENV%22%3A%7B%22FEATURES%22%3A%7B%7D%2C%22EXTEND_PROTOTYPES%22%3A%7B%22Date%22%3Afalse%7D%7D%2C%22APP%22%3A%7B%22name%22%3A%22submarinecablemap-dot-com%22%2C%22version%22%3A%220.0.0+42662be6%22%7D%2C%22exportApplicationGlobal%22%3Afalse%7D"><link integrity="" rel="stylesheet" href="/assets/vendor-79d734a4a7138cc96a1a1c551acc2bf3.css"><link integrity="" rel="stylesheet" href="/assets/submarinecablemap-dot-com-fa141a2bbcb123c00a2f1f24bff56b31.css"></head><body><script src="https://maps.googleapis.com/maps/api/js?key=AIzaSyC1PHXojVLLhjdUILGHUThAtGL51HIA7Ps" type="text/javascript"></script><script src="/assets/vendor-f6e906f9d90eed41e67400e162d4e09f.js" integrity="sha256-qewvZbg5LL4IGwHQRAPmHOssxakunzO7smue5jEwEoI= sha512-SwRNjgQYQcgY8443TP0DMzlYYUrQvQZgeJ8GLzUvas9LGDFAvcECSN9965rWNZmGOqplLi4IXoi5N13NtqbC6g=="></script><script src="/assets/submarinecablemap-dot-com-9dbe7efe32fafc7d5c5dea1adee43e8e.js" integrity="sha256-On3cVfXZC6whnVbSRsyfdpKVs6fJ7pa5GFK3PGwA0EU= sha512-06csQfVKdRPGCmMLmsAGN62B1coLt5BKYlKvHwmpmHgfjt3XV2ztTFzaG0W/ce2A/aqpb9vJBAvYOY4/a5gILA=="></script><script type="text/javascript">var _gaq=_gaq||[];_gaq.push(["_setAccount","UA-397578-4"]),_gaq.push(["_trackPageview"]),function(){var t=document.createElement("script");t.type="text/javascript",t.async=!0,t.src=("https:"==document.location.protocol?"https://ssl":"http://www")+".google-analytics.com/ga.js";var e=document.getElementsByTagName("script")[0];e.parentNode.insertBefore(t,e)}()</script><script type="text/javascript">!function(e,t,a,n){if(!e.getElementById(a)){var s=e.createElement(t),c=e.getElementsByTagName(t)[0];s.id=a,s.src="//js.hs-analytics.net/analytics/"+3e5*Math.ceil(new Date/3e5)+"/594292.js",c.parentNode.insertBefore(s,c)}}(document,"script","hs-analytics")</script></body></html>

1 REPLY 1
DavidThorpe
Alteryx
Alteryx

Hi @jozsefm 

 

A neat little trick with websites such as these is to use the 'inspect element' option in your browser to identify data sources;

 

If I right-click on the '2 Africa' and choose to inspect element, I can see a new panel at the bottom or right of my screen (browser dependent) showing the webpage HTML:

 

DavidThorpe_1-1611921503867.png

 

Clicking the 'network' tab in the panel at the bottom/right, then running a web-page action (e.g. select 'Submarine Cable List' or '2Africa' etc.) shows an API transaction is happening behind the scenes to fetch the information:

 

DavidThorpe_2-1611921675139.png

 

This reveals a link to a .json API request, that when run using the Alteryx Download tool will reveal a master list of the API information you're looking for!

 

DavidThorpe_3-1611921815960.png

 

Taking each of these IDs and subsequently passing them to the API path by dynamically creating new URLs, reveals further API calls that contain all the relevant base data for the list of cable names:

DavidThorpe_4-1611922115680.png

 

I've attached the workflow for your use

 

Thanks & enjoy!

Labels