community
cancel
Showing results for 
Search instead for 
Did you mean: 

Alteryx designer Discussions

Find answers, ask questions, and share expertise about Alteryx Designer.
Upgrade Alteryx Designer in 10 Steps

Debating whether or not to upgrade to the latest version of Alteryx Designer?

LEARN MORE

Fuzzy Match to Match on Letters and Numbers, Ignore Periods and Commas

Meteoroid

Hi Community,

 

I'm working with a dataset that includes entities, their utilization amounts and corresponding date. 

Some of the entity names are formatted differently.

 

For example:

Entity           Utilization Amount      Date

A, INC.                  300                   5/13/2007

A INC.                   500                   6/20/2009

A  INC                   700                   12/23/2019

 

I want to keep all of the entries and am curious if there's a way to use Fuzzy Match to ensure that the workflow knows that all three of those entities listed are equivalent.

 

For instance, I might want to pull all of the utilization amounts for the given entity above in a spreadsheet that includes the output, but then if I try to filter in Excel, I would have to manually select "A, INC", "A INC.", and "A."

 

Is there a way to avoid this, so that the workflow matches

 "A, INC." = "A INC." = "A INC"?

 

Perhaps I need to ignore the commas and the periods? If anyone has ideas on how to approach this, I would really appreciate it! Many thanks in advance.

Alteryx Partner

Hello @vanorve ,

 

If your problem is the punctuation you could either:

 

1.- Duplicate the field and use the data cleansing for it as a field only to filter in your excel.

2.- Use directly the data cleansing on the field but losing all the punctuation.

 

If there are field which are more different in your WF you would need to use the fuzzy match as you say.

 

In summary: As far as I know there is no way to keep the information you want to filter through later in the excel file. You would need to add another field just for the sorting.

 

Cheers

Bolide

Take a look at the Example under the Make Group tool

 

It combines total sales amounts for vendors with similar names, using a fuzzy match

 

Uses tools: Fuzzy Match, Make Group, Find Replace

 

Example:

GLEN & ALLENS HARDWARE CO
GLEN AND ALLENS HARDWARE CMPY

 

BROWNSVILLE DEPARTMENT OF REVENUE
BROWNSVILLE DEPT OF REVENUE

 

CO TEXTILE AND VARNISH
COLORADO TEXTILE AND VARNISH

Labels