Hi,
I'm interested in using Alteryx Designer to solve in extracting "specific text" and generate into a new column
as I'm very new here, I would like to ask for any suggestion, utilizing alteryx tools to accomplish this task. Here is the example of my text data:
Raw Text (Before Extracted) | Manufacturer (After Extracted) | Manucfacturer (Replace Duplicates) |
METHOD : CERTIFICATE_REQUIREMENT : (IF ANY)_ | - | - |
METHOD : CERTIFICATE_REQUIREMENT : (IF ANY)_ | - | - |
METHOD : CERTIFICATE_REQUIREMENT : (IF ANY)_ | - | - |
.... Description:_Manufacturer:_Manufacturer Part No:_Vendor/Supplier:… | - | - |
Modifiers: Parts_HEADER CHARACTERISTICS_Manufacturer:_Classification/service:_... | - | - |
...Modifiers: Parts_HEADER CHARACTERISTICS_Manufacturer:_Classification/service:_... | - | - |
Manufacturer: AB_Manufacturer Part Number: … | AB | AB |
...Modifiers: Parts_HEADER CHARACTERISTICS_Manufacturer:EB_Classification/service: … | EB | EB |
Noun: Compressors_Modifiers: Parts_HEADER CHARACTERISTICS_Manufacturer: NANI_Classification/service: … | NANI | NANI |
Noun: Compressors_Modifiers: Parts_HEADER CHARACTERISTICS_Manufacturer:_Classification/service: … | - | - |
Classification/service: OFF GAS COMPRESSOR_Part Name : GASKET OILCOOLER:A... | - | - |
Manufacturer: EQ_Part Name: GASKET NON-ASBESTOS_... | EQ | EQ |
Manufacturer :MIK_Classification/service:OFF GAS COMPRESSOR_... | MIK | MIK |
Manufacturer :HOR_Classification/service:OFF GAS COMPRESSOR_... | HOR | HOR |
Manufacturer :MJU_Classification/service:OFF GAS COMPRESSOR_Vendor/Supplier :... | MJU | MJU |
Noun: Compressors_Modifiers: Parts_HEADER CHARACTERISTICS_Manufacturer: ATL_Classification/service: ... | ATL | ATL |
Manufacturer :ATL LIMITED_Manufacturer Part Number : 1613126813_Classification/service:DIAPHRAGM_... | ATL LIMITED | ATL |
Noun: Compressors_Modifiers: Parts_HEADER CHARACTERISTICS_Manufacturer: TBW_Classification/service: REFRIGERATOR_Model:_... | TBW | TBW |
Manufacturer :FACE_Classification/service:OIL FILTERS_Part Name : FILTER CARTRIDGE 15 MICRON_Part Number/Position Number :PP-614-D10B-NC_... | FACE | FACE |
Manufacturer: GARL | GARL | GARL |
Noun: Compressors_Modifiers: Parts_HEADER CHARACTERISTICS_Manufacturer: C-5400_Classification/service: … | C-5400 | C-5400 |
Noun: Compressors_Modifiers: Parts_HEADER CHARACTERISTICS_Manufacturer: GARL 3000_Classification/service: … | GARL 3000 | GARL |
Manufacturer: VIK_Manufacturer Part Number: … | VIK | VIK |
Highly Appreciate all coming suggestion,
Bordin W.
Solved! Go to Solution.
The first column could be solved with a REGEX.
The second column might need some Fuzzy Match. I recommend you to take a look at this article to know how to use it.
https://community.alteryx.com/t5/Alteryx-Designer-Knowledge-Base/Tool-Mastery-Fuzzy-Match/ta-p/45485
Fuzzy Match requires a few attempts to make it work, it's even considered an art by some people.
WF with Regex attached.
Cheers,
Thank you so much 🙂