Hello Alteryx Community,
I'm working on a project that involves data cleansing and transformation using Alteryx Designer, with an additional requirement to integrate the processed data into Splunk for further analysis. I've encountered a challenge that I hope you can assist me with.
I have a large dataset containing sales data, and I'm trying to optimize a workflow that includes joining multiple tables, filtering specific columns, performing some aggregations, and then sending the transformed data to Splunk. Although the workflow is functional, it's running slower than I'd like, especially during the joining process and Splunk integration.
Here's a brief overview of my workflow:
Input Data: Reading data from multiple CSV files.
Joining Tables: Using the Join tool to combine sales data with product and customer information.
Data Cleansing: Removing null values, correcting data types, etc.
Aggregations: Summing sales by region and product category.
Splunk Integration: Sending the cleaned and aggregated data to splunk for further analysis using the Alteryx Splunk Connector.
Output: Writing the final result to an Excel file.
Could anyone please provide tips on how to optimize this workflow? Specifically, I'm looking for advice on improving the performance of the joining process and the integration with Splunk. Are there any specific tools or techniques within Alteryx or settings within the Splunk connector that can help in this scenario? If you need more information, please let me know, and I can provide additional details.
Thank you in advance for your help! Your insights and experience with both Alteryx and Splunk will be greatly appreciated.
Hi @emma_Wilson - Very succinct explanation and clarity is given, so thank you for that.
Before I give the tips below, it is important for you to have the latest version of Alteryx and also to have AMP engine turned on (if you turned it off). Generally, the bulk of your slowness comes from the Joins and Summarize tools.
With regards to connection to splunk, have you followed the best practice guide here: https://community.alteryx.com/t5/Engine-Works/Splunk-will-it-Alteryx/ba-p/554043 ?
With regards to your request, here are some tips for you:
Beyond that, it's using less tools wherever possible - and being dynamic helps.
If the above solved your need, kindly like & mark as accepted solution so that you may help others find the solution more quickly + to close the thread as is. Thanks!
Best regards,
Calvin Tang
https://www.linkedin.com/in/calvintangkw/