how to compare columns and find similarity and output the similarities in new column
above this is 2 column(remarks & item name) for which i would like to compare them. In the remarks column, it contains data which are not structured like ( for one same items each personnel had given their own terminology .) i need to filter it out and make into new column and then filter it out later for analysis
sample file attached.
i need to filter out remarks column with respect to OWS(component). but due to manual data recording , the terminology used are different . OWS , 15 PPM calibration, oily water separator, calibration & certification 15ppm all these words denote - OWS component.
\
Basically i want to filter out component - OWS
1. fuzzy match can be used to group these words and assume it as OWS
2. or can i find multiple words and replace them with OWS
User | Count |
---|---|
106 | |
82 | |
72 | |
54 | |
40 |