We are celebrating the 10-year anniversary of the Alteryx Community! Learn more and join in on the fun here.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Best Practices for Standardizing Client Data

dillonrispoli
5 - Atom

Hi Alteryx Community,

 

I’m kicking off a new use tax project and would love to get your thoughts on how best to structure it in Alteryx.

 

Project Context:

For each new client, our team receives data files with varying formats. The first step in our workflow is to transform client data into a standardized column template (see attached Excel file for the headers we plan to use). This template will then feed into downstream use tax analysis.

 

Initial Approach:

My current thought is: Each time we onboard a new client, the team will analyze the client’s raw data file. They’ll then map each column from the client file to the corresponding standardized template column in our Excel file. Once mapped, we’ll run an Alteryx workflow to restructure the data into the standardized template format.

 

Open Questions:

Is there a more scalable or automated way to do the column mapping step across different clients? Has anyone built something similar (e.g., using Alteryx macros or a mapping table approach)? Are there recommended best practices for maintaining flexibility while still enforcing a standardized structure?

 

Would really appreciate any ideas, lessons learned, or examples from your own projects. Thanks in advance!

3 REPLIES 3
tofu159mac
5 - Atom

Hello,

To scale the column mapping in Alteryx, use a mapping table (Excel or database) with two columns: Client Column and Standard Column. Combine this with a Dynamic Rename tool (using mapping table) inside mymilestonecard a Batch Macro for each client. 

 

Best Regard,

 

Tofu

Gaurav_Dhama_
12 - Quasar

First question, what is the input data looking like.

Second question, where are you planning to run this workflow? Local or gallery?

 

It majorly depends on the source of the data, if each client uses different sources of data, then I don't believe there is any easy way to standardize that step. For example, Oracle data will be different from SAP data output and again these may vary based on the template client use.

 

In the above case, you can have multiple individual workflows that does the transformation and maps the data, and one bigger workflow that uses this mapped data into final output.

apathetichell
20 - Arcturus

Hey --- this is a fairly normal process in data modelling/data engineering. My go-to  on this is a spreadsheet which contains mapping information --- (file type/client/column name) and which maps via dynamic rename/join/whatever to create the naming context which I need downstream for processing. My downstream processing (ie your core worfklow) becomes a batch macro --- and in my outerworkflow/macros which sit above that --- I'd do my column name mapping. This may include minor prep (case/trimming white space) but ideally it should have master list of client/filetype schema -> workflow required schema. This isn't really an Alteryx problem per se --- more of a general data model issue --- and it's something which Alteryx is not worse at than most systems (I'd posit it's actualy something Alteryx is quite good at). 

Labels
Top Solution Authors