I have a file with over 70 millions lines and I want to correlate two columns with some inputs
Column A | Column B | Selection |
C1 | 100001 | X |
C1 | 100002 | X |
C1 | 100002 | X |
C1 | 100003 | X |
S1 | 100001 | |
S1 | 200001 | |
S1 | 200002 | |
S1 | 200003 | |
UA1 | 100001 | X |
UA1 | 100002 | X |
UA1 | 100003 | X |
UA1 | 100004 | X |
UA1 | 100005 |
I would like to relate every line that has the same info of Column B between C1 and UA1. So I wouldn't have none of S1 on it only those who are C1 and UA1 with X.
I tried with the Unique tool but I didn't have success.
Solved! Go to Solution.
Hello @felipemorselli
I've put together the attached example that shows one way this can be achieved. By the way, the UA1|100004 line in your example should not be selected according to your criteria.. Let me know if you have any questions about the suggested methodology.
Hello @CharlieS !
Thanks for pointing the item that shouldn't be on my output file! It worked so fine!
Kind Regards,
Felipe M.