I am currently pulling in files using a directory tool with a wildcard on the file name and then feeding that to a dynamic input tool. An issue that has come up is that the number of overall columns can vary from each xlsx, however the main columns that I want will always exist- those other columns can be dropped.
Using the current method, starting at the top of the files in the folder, they will all be brought in unless the schema doesnt match the first file that was read in.
I am thinking i need to be able to loop through each file, select the main columns that i know exist in each from that specific sheet, then append them as those columns are extracted.
Any ideas on what I might need to do to get this working? The above was just my thoughts on what i think i need to do, please feel free to recommend other options.
Thanks for the help!