I'm adding records to a redshift table, and need to make sure there are no duplicates. I have multiple keys to uniquely identify records. (about 6 keys).
The standard output data tool allows "Update - insert if new". But it is very (too) slow. I want to use the in-db tools to to speed up the processing. I haven't been able to figure out how to use the in-db tools to identify duplicates (join tool doesn't produce non-matches, no unique tool, sample tool doesn't allow group by, summarize doesn't allow "first", etc.). So, I'm requesting a feature that would allow an in-database upsert. Or, if anyone has other ideas, I'd appreciate it! Thank you!