Want to get involved? We're always looking for ideas and content for Weekly Challenges.
SUBMIT YOUR IDEASome thoughts below. I'll append the note in a text file so I have something attached to my response.
Done.
Here's my solution:
Looks like a memory issue in the append cluster tool.
cannot allocate vector of size 7531.1 Gb
Suggestions below.
Memory issued on append cluster, which means the clustering is working but the labeling isn't. So we can batch the latter.
Nice to know!
As an Alteryx CSE I would investigate the error message and the tools used in the workflow and quickly find out:
"Error messages beginning 'cannot allocate vector of size' indicate a failure to obtain memory, either because the size exceeded the address-space limit for a process or, more likely, because the system was unable to provide the memory."
Then I'd ask the community and they'd say to use a batch macro since it's only the Append Cluster tool that's throwing the error and I'd send them this example of a batch macro workflow using the Append Cluster tool. It uses the Tile Tool to create X number of tiles/batches, which then runs those batches through.
I would also mention to the user that he/she should consider using the K-Centroids Diagnostics tool, which will output a 'K-Means Cluster Assessment Report'. Per the tool mastery article, "The K-Centroids Diagnostics Tool provides information to assist in determining how many clusters to specify," which would likely be of benefit to the user as well.
Sources:
https://stat.ethz.ch/R-manual/R-devel/library/base/html/Memory-limits.html