Hello
I have a list of URL's to be downloaded from a server. The files are of different types like csv, pdf, text etc. For all those days when the Stock Markets were open, the new files are to be downloaded from that server.
Once I download all these individual files, I need to do some data cleaning tasks on them and then upload them into the database into their Respective Tables. One table for each file type, already exist in the database and the new data simply needs to be appended to the respective tables on a daily basis.
Now, I am not being able to figure out, if I should be downloading all these 10+ Data Files into a single folder for "EACH DATE".
OR
if I should have a single folder for each "FILE CATEGORY", and then put all the files for all different dates into that particular folder alone. Which would look something like this, for example -
C:\Data\Alteryx\PR-Files
Date1File
Date2File
Date3File
C:\Data\Alteryx\DAT-Files
Date1File
Date2File
Date3File
C:\Data\Alteryx\EQUITIES-Files
Date1File
Date2File
Date3File
Should the Download Directory Structure be "DATE WISE" or "FILE NAME WISE" in my particular case, when there are 10+ different files to be downloaded on a Daily Basis and needs to be processed and then appended into the Respective Database Tables.
Which particular method makes more sense from the Alteryx Workflow Efficiency Perspective? Which method will allow me to have more control on the workflow designing and then running the workflow on a daily basis.
I even need to include a step at the end of these workflows which will actually VERIFY that all these 10+ files have been downloaded, processed and uploaded to the respective database tables. I need to get this FINAL REPORT for each of these 10+ individual files, on a Daily Basis, so that I can just look at this report and be sure that the whole workflow has happened successfully for all of those files. And if there was a failure for any particular file then this report will automatically HIGHLIGHT/NOTIFY that issue and then I can fix the problem for that particular file.
So I am thinking about choosing that particular directory structure, which will allow me to perform the above type of VERIFICATION and generate the Final Report of Success or Failure for Each Data File, for Every Single Date.
Please suggest any ideas that you have in this regards.
Thanks a lot
PS: You might have a look at my previous thread as well, where I generated these URL lists to be downloaded on a daily basis - https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Workflow-to-generate-List-of-URL-to-be-downloaded/m-p/739044