How to extract table information from 15MB html file
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
I'm trying to query a table in a html file that is 15MB in size, the file is saved locally.
The HTML to text tool only allows just over 5MB.
Have any of you come across this before?
Thanks,
J
- Labels:
- Help
- Input
- Parse
- Text Mining
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@JayTray
this can be done using python or read as CSV and then transform it
find the reference
HTML format as Alteryx input - Alteryx Community
Convert HTML to text - Alteryx Community
Hope this helps.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Ask the InformationLab.... @OllieClarke ?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@apathetichell this is one of our French team’s tools.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
The Wu-Tang Clan of Alteryx community.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
I'll check this internally with my colleague who created the macro !
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@JayTray Are you getting an error message ? If so, would you be able to share a screenshot of the error message you are getting ? Any information about the structure of the HTML file you are trying to parse or how you know that the macro is only handling up to 5MB of data would also be great, and if you could share a sample workflow that would be ideal!