HTML to Markdown conversion
Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
markashman
9 - Comet
‎05-17-2024
06:17 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
I have a use case whereby I want to scrape a website via its sitemap and download each of the html and then convert those into markdown for processing into Azure AI.
Currently we have a python exe that does both these tasks individually, but the 15k html files end up becoming 126k md files.
I was wondering if I could leverage Alteryx to do this as 1 process, both scrap and convert on a 1-1 bases.
Has anyone used Alteryx to convert HTML to Markdown, which I assume would be done via the Python tool?
Thanks
Mark
Labels:
- Labels:
- Transformation
1 REPLY 1
17 - Castor
‎05-19-2024
11:45 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
This is an interesting use case. I'd like to learn too - bumping this thread for you.
Calvin Tang
Alteryx ACE
https://www.linkedin.com/in/calvintangkw/
Alteryx ACE
https://www.linkedin.com/in/calvintangkw/
