In case you missed the announcement: The Alteryx One Fall Release is here! Learn more about the new features and capabilities here
ACT NOW: The Alteryx team will be retiring support for Community account recovery and Community email-change requests Early 2026. Make sure to check your account preferences in my.alteryx.com to make sure you have filled out your security questions. Learn more here
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Parsing HTML with Python Tool

Nosal25
8 - Asteroid

After reading many articles about HTML parsing and NOT to use REGEX, which is how I am doing it, with a high level, but not 100% accuracy.  Has anyone used a Python HTML parsing package within the Python Tool? I am parsing many fields of HTML/CLOB with REGEX but I am looking for a better way.  Thank you

 

11 REPLIES 11
rd916
6 - Meteoroid
---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
<ipython-input-13-f33cf0cb8fd3> in <module>
----> 1 from bs4 import BeautifulSoup

ModuleNotFoundError: No module named 'bs4'

I am getting the following error- is there anything that I can do? or is there something I am doing wrong? 

AndrewKramer
Alteryx Alumni (Retired)

You need to install the Beautiful Soup Package into the Python Virtual Environment

 

Run Alteryx as Administrator and enter the following code in a cell in the Python Tool

from ayx import Package
Package.installPackages(['bs4'])
import bs4

 

You'll only have to install this package once, and then you should be good to go. 

Labels
Top Solution Authors