I have a use case whereby I want to scrape a website via its sitemap and download each of the html and then convert those into markdown for processing into Azure AI.
Currently we have a python exe that does both these tasks individually, but the 15k html files end up becoming 126k md files.
I was wondering if I could leverage Alteryx to do this as 1 process, both scrap and convert on a 1-1 bases.
Has anyone used Alteryx to convert HTML to Markdown, which I assume would be done via the Python tool?
Thanks
Mark
This is an interesting use case. I'd like to learn too - bumping this thread for you.