Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

How to improve the performance when comparing two large datasets?

YSHuang
6 - Meteoroid

We are using Alteryx to compare two very large datasets, with file sizes around 20-30GB and approximately 150-200 columns. Is it possible to shorten the execution time or reduce memory usage by changing the comparison method (for example, by converting the data using a hash function first)? Any suggestions or ideas would be greatly appreciated!

1 REPLY 1
kamanivk
8 - Asteroid

As you mentioned, you can use MD5 hash for the columns needed to be compared.

 

Search for 3 MD5 functions here.

https://help.alteryx.com/current/en/designer/functions/string-functions.html#example-6846024-12

Labels
Top Solution Authors