Hi There. I am trying to perform simple data profiling on a large number of data files but have run into some problems. I hope someone can help me out.
For a single data input file, I am able to use the Field Summary tool to perform the required data profiling and write the results to an Excel workbook. I now wish to extend this to be run on (for example) 100 data input files and output the results into separate sheets in an Excel workbook. I have used a macro to read in my 100 input files (this works fine) but because I feed this into a Select tool, I end up with a single dataset (encompassing the 100 files I inputted) on which the data profiling is being performed. Please see below a screenshot of my current workflow.
To be clear, what I want is as follows:
[input x files] > [perform data profiling on each of the x files] > [output results for each of the x files into separate CSV files/Excel sheets]
Any help would be appreciated.
Thanks!
