Alteryx Designer Desktop Discussions

Nosal25 · ‎01-14-2019

After reading many articles about HTML parsing and NOT to use REGEX, which is how I am doing it, with a high level, but not 100% accuracy. Has anyone used a Python HTML parsing package within the Python Tool? I am parsing many fields of HTML/CLOB with REGEX but I am looking for a better way. Thank you

rd916 · ‎08-07-2019

---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
<ipython-input-13-f33cf0cb8fd3> in <module>
----> 1 from bs4 import BeautifulSoup

ModuleNotFoundError: No module named 'bs4'

I am getting the following error- is there anything that I can do? or is there something I am doing wrong?

AndrewKramer · ‎08-07-2019

You need to install the Beautiful Soup Package into the Python Virtual Environment

Run Alteryx as Administrator and enter the following code in a cell in the Python Tool

from ayx import Package
Package.installPackages(['bs4'])
import bs4

You'll only have to install this package once, and then you should be good to go.

Alteryx Designer Desktop Discussions

Parsing HTML with Python Tool

Re: Unable to get an output

Re: Extracting the list of sheet names across mult...

Re: Chaining Apps

Re: Firm names parse

Re: Help with Multi-Row formula