Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

HTML to Markdown conversion

markashman
9 - Comet

I have a use case whereby I want to scrape a website via its sitemap and download each of the html and then convert those into markdown for processing into Azure AI.

 

Currently we have a python exe that does both these tasks individually, but the 15k html files end up becoming 126k md files.

 

I was wondering if I could leverage Alteryx to do this as 1 process, both scrap and convert on a 1-1 bases.

 

Has anyone used Alteryx to convert HTML to Markdown, which I assume would be done via the Python tool?

 

Thanks

Mark

1 REPLY 1
caltang
17 - Castor
17 - Castor

This is an interesting use case. I'd like to learn too - bumping this thread for you.

Calvin Tang
Alteryx ACE
https://www.linkedin.com/in/calvintangkw/
Labels
Top Solution Authors