Data validation using multiples data sets
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hello!
I am trying to validate addresses located in one dataset using two other sets of data (one with cleaned states and the other with cleaned zip codes). I need to check for any mismatches and then mark those for review. I am not sure how to properly do this though and would appreciate any advice on the best way to do so.
- Labels:
- Workflow
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
For the states maybe you could uppercase both data sets and then join both data sets. Anything falling out the left hand side or right hand side (whichever side is the data set being validated) would indicate rows where the data has a non valid state. This same approach can be used on the zip codes using a join.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi Brandon, thank you! After the join, would I need to use the union tool to combine everything back together to export in one document?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Yep that’s probably a good idea but maybe add a formula tool prior to union that says “state issue” or “zip issue” in the expression so that you know what issue exists with the row
