ACT NOW: The Alteryx team will be retiring support for Community account recovery and Community email-change requests Early 2026. Make sure to check your account preferences in my.alteryx.com to make sure you have filled out your security questions. Learn more here
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

HTML to Markdown conversion

markashman
9 - Comet

I have a use case whereby I want to scrape a website via its sitemap and download each of the html and then convert those into markdown for processing into Azure AI.

 

Currently we have a python exe that does both these tasks individually, but the 15k html files end up becoming 126k md files.

 

I was wondering if I could leverage Alteryx to do this as 1 process, both scrap and convert on a 1-1 bases.

 

Has anyone used Alteryx to convert HTML to Markdown, which I assume would be done via the Python tool?

 

Thanks

Mark

1 REPLY 1
caltang
17 - Castor
17 - Castor

This is an interesting use case. I'd like to learn too - bumping this thread for you.

Calvin Tang
Alteryx ACE
https://www.linkedin.com/in/calvintangkw/
Labels
Top Solution Authors