Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Alteryx Designer: Input XML Child Element Name values duplicates

mhoogstraten
5 - Atom

Recently we started looking into converting XML / XAF file in Alteryx. We had some success however started running into severate performance issues due to field bloating.

 

To explain the issue. when choosing journal as XML Child Element Name it would look as follows:

2024-03-20_12h24_44.png

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

The above XML string would then be converted into the following Alteryx data:

journaljrnIDdescjrnTptransactiontransaction_2
 010Sales LedgerA  
 010Sales LedgerA  

 

This itself would not immediately be an issue. However, the issue occurs when working with a large XML file. In a recent example a specific journal had a maximum of [transaction_29882]. This means the Input tool output would have that amount of fields for all the imported rows (even those who did not actually have that amount of transactions).

journaljrnIDdescjrnTptransactiontransaction_2...transaction_29882
 010Sales LedgerA   NULL
 010Sales LedgerA   NULL
 020Purchase LedgerB    

 

If you have a total of 8 million entries for the journal XML Child Element Name which at some point has 29.882 transactions it will then amount to 239 billion cells of data (the majority being NULL values). This of course takes a very long time to import.

 

Question:

Is there a way to import such XML files more effectively?

1 REPLY 1
dsandmann
8 - Asteroid

It would be helpful if you provided an example of what you have tried, as well as an example that could be copied and pasted into a workflow instead of an image of the structure.

 

This should be possible and may require some iterative/batch macros, but you'll need to be a bit more intentional about the way you interact with the structure.

Labels