Description:
For a project, I need to anonymize a set of values in my data set. For example: There is a reporter row with a user name e.g. "mtruss". For each "mtruss", I want to anonymize the value to "Reporter 1".
I have to repeat this for every unique user name.
As I need to create a classification / pattern recognition from the users and their relationship to other rows, it is important, that the "mtruss" is always "Reporter 1". For the next user e.g. "jdoe" it would always be "Reporter 2". This means: The relationship of "Reporter 1" to a certain entry should remain intact and representative.
Conditions:
I have access to a list (CSV) of each unique value, that needs to be anonymized. In this case all user names.
Question:
Is there a standardized way to do this with Dataprep?