Fuzzy Matching - numbers spelt vs digits
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hello Alteryx Community! I do hope someone can shed some light on the Fuzzy Matching options.
I'm looking find the best set of options to help with the situation of identifying when a number is in text form using letters vs. digits e.g. "TEN" vs. "10".
Would the Soundex w/Digits on the Generate Keys and "Words & Digits: Jaro Distance" on Match function.
I would really appreciate the community's thoughts!
Cheers
Dave
- Labels:
- Fuzzy Match
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@Data_Dave
Do you have a sample input and output?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi Qiu - thanks for joining in. I don't have a sample set as yet - this will be pulled from another system with many many company names in various formats. The idea is that looking through two lists where person 1 might have noted down a company name using letters to spell out a number, and person 2 might have used digits.
But have some random examples:
1 and 1 --> one and one
Big 2 Toyota could be Big Two Toyota
Big 5 Sporting Goods could be Big Five Sporting Goods
21st Century Fox could be Twenty First Century Fox
Century 21 could be Century Twenty One
Forever 21 could be Forever Twenty One
concrete5 --> concrete five
four peaks brewery --> 4peaks brewery
Brothers Four Car Wash --> Brothers 4 Car Wash
Five Guys Burgers --> 5 guys burgers
The Ten Network --> 10 network/network 10
Does this help?
Thanks again, Dave
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
@Data_Dave
I wish I did not follow up 😁
I dont have a clue how to do this with Fuzzy Match, and I gave a few tries with it, did not get any luck.
The last I can think of is to have a matching table, quite brutal though.
1 --> one
2--> two
something like that.
sorry...
