I need help scraping data from https://www.ffiec.gov/npw/Institution/TopHoldings. Each top holding company has a set of subsidiaries that I’ll like to download. Please help as I’m new to this, I get a 403 message when I try…thank you
So do you see the section which says:
https://www.ffiec.gov/npw/Institution/BuildTier
If you click on response you can see that that's the data you are looking for.
Now that you've identified the API you need to dive into the request. Take a look at the payload tab. You can see that it's a) using POST - not GET and b) that it's sending in a URL encoded form. If you expand that you can see the values are:
RssdID=1039502
ProfileDtStart=5/19/2019 12:00:00 AM
ProfileDtEnd=12/31/9999 12:00:00 AM
X-Requested-With=XMLHttpRequest
You will need to set up these fields in Alteryx in the body/payload section as a URL encoded form. Only the RssdID will change.
Next make sure that you have your content-type set up for Alteryx in your headers ---it needs to say:
Content-Type: application/x-www-form-urlencoded
Try that.
I might be doing something wrong because I get the 403 forbidden message
Can you share your screenshots of your configuration of the Download tools' payload/body --- and what you are sending in?
@apathetichell I have attached a copy of the workflow. thank you.