Dear community,
A quick question on comparing two datasets.
I have two large datasets (over 100 columns and 150k rows) which are largely identical, but there are some small differences I wanted to identify. I used a method described on the forum, which uses both transpose and crosstab, in order to compare every single cell between the two tables.
The issue is that the original data, which contains a mixture of strings, numbers and dates, are all converted to V_WString with a size 2048 after they've been through the transpose/crosstab combo, which creates some unusual errors.
The best example is table A having a value of say 9.48 and table B has 09.48 in the corresponding cell. That leading zero obviously shouldn't be there (it's interesting why it even appeared to begin with) and it's creating a lot of false positive noise which masks the real differences.
Any ideas on how to deal with this? Thank you!