Hi All –
I was asked to be part of a proof-of-concept project at my company that potentially could be a big deal for me. What’s going on is that our Agencies send our Network heads various client information templates that all get put into a single workbook by Network and sent to corporate. Some workbooks could have several tabs with information. Much of the information is somewhat standard, but the format could be wildly different. Now in the past, I have aggregated hundreds of templates via batch macro, but my experience has always been that the templates are all the same and locked down – so not to bad. However, these templates are not locked down with no intention of doing so) and the format could change, but the information should always be somewhat consistent.
What happens right now is that all these templates are sent to a couple of (non-Alteryx) developers in Hong Kong and they spend some time getting the information into PowerBi via the internally built Power query. Much of it is still manual and laborious. There could be 10-15 workbook with varying amount of tabs in them,
Technology management decided that Alteryx could be the answer, but I have always predicated this with some consistency in the source data format. I am now grappling more with how to make this more dynamic where Alteryx could adjust for some of these differences.
Attached are some examples of how data could come in. Each workbook might have a mix of these in different tabs. The goal would be to line up as much of the data in an automated way as possible. I am looking for advice on handling workbooks like the one attached where each workbook could have different data structures like the one attached.
If you have a workflow to share – or want to try your hand at these – I would love to see the techniques, you employ in bringing structure to the unknown (Never stop learning.).
Each template has some core information, like entity, client, Net Revenue (USD), but not all of it. The goal is to create a single dataset that lines up as best as possible (all entities in column A, Name in B, Net Revenue (USD) in a column and Net Revenue (Local) in another column…etc. Not every bit of information will be in every spreadsheet, but there is some sort of consistency. But dynamic enough to adjust to some changes or even new templates that get added.
Thanks in advance for your assistance!
The data has all been randomized.
Seth