I have a table that is unidentifiable linked to other data sources except for the fact that I can reproduce a composed key that would fully match in some cases and partially match in many other cases.
I'm trying to join letters on the address field (like John Doe Wallstreet 12 1000AA Amsterdam). On the other hand I have three data sources for different types of receivers. From all three sources I've reproduced a column that follows the format pattern in [letters].[address] an union joined them together to a single source with a field [AddressType].
In my last step I want to join my [letters] and union joined table together on the address fields. I've noticed 97 exact matches but also 903 unmatched from my 1000 letter sample file. I've applied data cleansing on new lines, double white spaces and trimming whitespaces (not all)
Is there a way to fuzzy join the tables? E.g. when the addresses have a similarity up to 90% or so?