In case you missed the announcement: Alteryx One is here, and so is the Spring Release! Learn more about these new and exciting releases here!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

HTML to Markdown conversion

markashman
9 - Comet

I have a use case whereby I want to scrape a website via its sitemap and download each of the html and then convert those into markdown for processing into Azure AI.

 

Currently we have a python exe that does both these tasks individually, but the 15k html files end up becoming 126k md files.

 

I was wondering if I could leverage Alteryx to do this as 1 process, both scrap and convert on a 1-1 bases.

 

Has anyone used Alteryx to convert HTML to Markdown, which I assume would be done via the Python tool?

 

Thanks

Mark

1 REPLY 1
caltang
17 - Castor
17 - Castor

This is an interesting use case. I'd like to learn too - bumping this thread for you.

Calvin Tang
Alteryx ACE
https://www.linkedin.com/in/calvintangkw/
Labels
Top Solution Authors