Let’s talk Alteryx Copilot. Join the live AMA event to connect with the Alteryx team, ask questions, and hear how others are exploring what Copilot can do. Have Copilot questions? Ask here!
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Find string from one value in a column of different values in a dataset

bouhayao
5 - Atom

Hi everyone,

 

I am looking for a way to find if one of the strings contained into one column in a dataset (125 different entries in that said column, so 125 different strings to be searched) is contained into another data set (multi row/column search, like 70k rows with 40+ columns) and if found to return an info about the row in which it was found (a unique identifier is present in the second dataset).

 

TLDR : looking for a way to find one of 125 strings contained in a single column in Dataset_1 in a Dataset_2 with 70k row/40+ columns 

 

Thanks for any help you can give!

2 REPLIES 2
gawa
16 - Nebula
16 - Nebula

hi @bouhayao 

I can propose a kind of brute force approach:

1) Transpose Dataset1 into vertical shape (key column shall be unique ID if present)

2) Append Dataset 2 (that has 125 records)

3) Filter the record by Contains([Value],[Find])

# of records from Append Fields tool would be 70k x 40 x 125 = 350 millions. AMP engine should be ON to process data faster.

 

 

image.png

bouhayao
5 - Atom

Hello Gawa,

 

Thanks for the solution, sadly while the first dataset I have to use is 125 (doable), there is another with 19K+ entries haha...

 

Regards,

Labels
Top Solution Authors