Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Parsing HTML with Python Tool

Nosal25
8 - Asteroid

After reading many articles about HTML parsing and NOT to use REGEX, which is how I am doing it, with a high level, but not 100% accuracy.  Has anyone used a Python HTML parsing package within the Python Tool? I am parsing many fields of HTML/CLOB with REGEX but I am looking for a better way.  Thank you

 

11 REPLIES 11
rd916
6 - Meteoroid
---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
<ipython-input-13-f33cf0cb8fd3> in <module>
----> 1 from bs4 import BeautifulSoup

ModuleNotFoundError: No module named 'bs4'

I am getting the following error- is there anything that I can do? or is there something I am doing wrong? 

AndrewKramer
Alteryx Alumni (Retired)

You need to install the Beautiful Soup Package into the Python Virtual Environment

 

Run Alteryx as Administrator and enter the following code in a cell in the Python Tool

from ayx import Package
Package.installPackages(['bs4'])
import bs4

 

You'll only have to install this package once, and then you should be good to go. 

Labels