I am in need of finding total duplicates in a CSV file where there is multiple criteria for what is considered a duplicate.
This is what I need to check against using a CSV that has millions of records.
IF
(!IsEmpty([FIRSTNAME]) AND [FIRSTNAME]=[FIRSTNAME] AND
!IsEmpty([LASTNAME]) AND [LASTNAME]=[LASTNAME] AND
!IsEmpty([BILLINGADDRESS1]) AND [BILLINGADDRESS1]=[BILLINGADDRESS1] AND
!IsEmpty([BILLINGZIPCODE]) AND [BILLINGZIPCODE]=[BILLINGZIPCODE]
)
OR
(!IsEmpty([FIRSTNAME]) AND [FIRSTNAME]=[FIRSTNAME] AND
!IsEmpty([LASTNAME]) AND [LASTNAME]=[LASTNAME] AND
!IsEmpty([BILLINGCITY]) AND [BILLINGCITY]=[BILLINGCITY] AND
!IsEmpty([BILLINGZIPCODE]) AND [BILLINGZIPCODE]=[BILLINGZIPCODE]
)
OR
(!IsEmpty([FIRSTNAME]) AND [FIRSTNAME]=[FIRSTNAME] AND
!IsEmpty([LASTNAME]) AND [LASTNAME]=[LASTNAME] AND
!IsEmpty([BILLINGZIPCODE]) AND [BILLINGZIPCODE]=[BILLINGZIPCODE]
)
OR
(!IsEmpty([FIRSTNAME]) AND [FIRSTNAME]=[FIRSTNAME] AND
!IsEmpty([LASTNAME]) AND [LASTNAME]=[LASTNAME] AND
!IsEmpty([PHONE]) AND [PHONE]=[PHONE]
)
OR
(!IsEmpty([FIRSTNAME]) AND [FIRSTNAME]=[FIRSTNAME] AND
!IsEmpty([LASTNAME]) AND [LASTNAME]=[LASTNAME] AND
!IsEmpty([EMAIL]) AND [EMAIL]=[EMAIL]
)
THEN 1 ELSE -1 ENDIFBased on the above criteria the below would be True or False, if I did not miss anything in my checking.