Hi,
I have two tables A on MSSQL and B on Redshift. I need to incrementally send data from A to B and I can't trust any created_at or updated_at fields.

Because table A is 9M records I was doing a join ONLY on the primary key to know which records are missing from my Redshift table.
How can I do an Input Data with an input like ... SELECT field1,field2 FROM A WHERE primary_key IN list, where list is the output of the join ids (the missing ones)
Doing a join primary key is easy and fast, doing a join on the whole table just to select a few records is a waste of CPU.
Any ideas?
Thanks
Joao