This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
Summary: Improved Data Cleanse tool processing speeds by up to 80% using Detour Tools.
[edit -- several images didn't show on original post]
As I was reviewing materials to teach an Alteryx course, one of the first pages discussed the cleanse tool. As many know, the cleanse tool in Alteryx is an Alteryx macro. It's commonly used and also one of the slower tools especially when working with large datasets. Next thought: Could it be made faster with the same results? (not studying each cleanse formula, only how the flow works)
Let's look at the first couple Action Tools: The first action tool first tests is to see if Modify Case is checked.. Etc.. 2nd action tool first tests to see if Replace Blanks is checked….
Notice all the data flows through all the multi-field tools. So, assuming the user did not "choose an item" each multi-field tool processes every record of every checked field flowing through the tool even if it just keeping the same value _CurrentField_
When working with large datasets this becomes very significant and wastes server time, your time. Thankfully, Alteryx provides a detour tool. Hook the action tool to the detour tool to divert it around the multi-field tool. Be sure to close the bypass with a detour end tool. Depending on the selections they make, it will be equal to or faster than the current cleanse macro structure. Testing showed up to an 80% improvement depending on the selections made.
In Summary, understanding how Alteryx processes the data can provide insights into drastically improving performance of your workflows.