Dear all,
I'm looking for a better solution for incremental loading records into a database than I have. Would be appreciated for any ideas.
Case:
I have to perform a daily loading from sftp to MS SQL table.
The database includes historical records over 3 years (~10 mln rows).
External data source (sftp) has a flat file that is updated every day and includes data for 3 latest months.
Every day we need to get data from sftp and override existing records from the existing table.
As Is approach:
1. Extract the entire dataset from the database
2. Filter data to exclude 3 latest months
3. Extract data from SFTP
4. Apply union tool
5. Override table via output tool
The main limitation of the current approach is low performance due to operating under an entire dataset.
I suppose there are should be another way such as extracting records for only 3 latest month from database and appending new from sftp, but I don't fully aware of how to perform that way using Alteryx.