Alteryx Designer Desktop Discussions

bouhayao · ‎03-12-2024

Hi everyone,

I am looking for a way to find if one of the strings contained into one column in a dataset (125 different entries in that said column, so 125 different strings to be searched) is contained into another data set (multi row/column search, like 70k rows with 40+ columns) and if found to return an info about the row in which it was found (a unique identifier is present in the second dataset).

TLDR : looking for a way to find one of 125 strings contained in a single column in Dataset_1 in a Dataset_2 with 70k row/40+ columns

Thanks for any help you can give!

gawa · ‎03-12-2024

hi @bouhayao

I can propose a kind of brute force approach:

1) Transpose Dataset1 into vertical shape (key column shall be unique ID if present)

2) Append Dataset 2 (that has 125 records)

3) Filter the record by Contains([Value],[Find])

# of records from Append Fields tool would be 70k x 40 x 125 = 350 millions. AMP engine should be ON to process data faster.

bouhayao · ‎03-13-2024

Hello Gawa,

Thanks for the solution, sadly while the first dataset I have to use is 125 (doable), there is another with 19K+ entries haha...

Regards,

Alteryx Designer Desktop Discussions

Find string from one value in a column of different values in a dataset

Re: How to select columns dynamically using number...

Re: Issue when using Block Until Done and Power BI...

Example workflow for setting up a custom list to u...

Re: Firm names parse

Re: Help with Multi-Row formula