Hi all,
I have a workflow which brings in 1 .csv file with 6m rows of data and about 150 columns. It is split into two years 2020 and 2021. My workflow only needs the 2021 data. Am i able to split the data using an SQL statement such as (SELECT * FROM 'file' WHERE YEAR = 2021) to only bring part of the input data in?
This would have the data coming in and speed up the workflow, also, if i could modify the SELECT, i could then just bring in the columns that i need.
Any ideas?
Many thanks.
David.