Good morning
We are collecting HTML data from this this web page using a Download tool. When we run the workflow the Download tool tells us the data we are downloaded is a huge 1.1TB - which is obviously not good 😧 - but when we look at the raw data there are only about 50,000 records.
Has anyone had to overcome this kind of thing before? Perhaps there are some tricks to avoid creating such a data volume in the download or possibly a way to avoid it happening in the first place.
Here's hoping 🤞
Thanks
ianjonna