Let’s talk Alteryx Copilot. Join the live AMA event to connect with the Alteryx team, ask questions, and hear how others are exploring what Copilot can do. Have Copilot questions? Ask here!
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

How to improve the performance when comparing two large datasets?

YSHuang
6 - Meteoroid

We are using Alteryx to compare two very large datasets, with file sizes around 20-30GB and approximately 150-200 columns. Is it possible to shorten the execution time or reduce memory usage by changing the comparison method (for example, by converting the data using a hash function first)? Any suggestions or ideas would be greatly appreciated!

1 REPLY 1
kamanivk
8 - Asteroid

As you mentioned, you can use MD5 hash for the columns needed to be compared.

 

Search for 3 MD5 functions here.

https://help.alteryx.com/current/en/designer/functions/string-functions.html#example-6846024-12

Labels
Top Solution Authors