Hi Team,
I am working on a project with huge data,
trying to match the company name with business name.
for example,
Company Name | Business Name | Match | Desired Result |
Va run | Varun Shashidhar | FALSE | TRUE |
VarunShashidhar | Var un INC | FALSE | TRUE |
V a RUN | Varun Shashidhar | FALSE | TRUE |
VarunShashidhar | Varun Sh ashidhar | FALSE | TRUE |
Solved! Go to Solution.
I am hoping someone can help because this is the best I can do and would like to see how this resolved.
I added a formula tool and created two new columns capturing the company name and business name
then I add a data cleanse tool and selected only the two newly created fields, check replace with blanks, replace with 0, Leading and trailing, tabs etc., all whitespace, Modified case to lower.
then I add a formula tool with this formula **the field names are the newly created fields.
IF REGEX_Match([adjusted company name], ".*" + [adjusted business name] + ".*") OR REGEX_Match([adjusted business name], ".*" + [adjusted company name] + ".*") THEN "TRUE" ELSE "FALSE" ENDIF
My output was TRUE, FALSE, TRUE, TRUE
the false was for the business day with INC.
@VarunShashidhar another option with the fuzzy match tool
https://community.alteryx.com/t5/Tool-Mastery/Tool-Mastery-Fuzzy-Match/ta-p/45485
Thank you Gibson
I looked at the fuzzy tool and could not get 4 trues. Can you provide an example with this dataset as I am curious how to do this with that tool.
i did try doing with the Fussy match tool, i could not get the expected result, i think that the Regex is the best option to use, compared to fussy match for the given situation.