Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Get unique values from multiple files without joining

45179902
8 - Asteroid

I have two types of files in the same directory that I will use for sending out letters. These two types of files have ELS and ILS in their filenames.

 

Before sending out letters, I must make sure that there're no duplicate records in these files. I must find out unique values from these files without joining them together because the two types of files have different layouts yet they must go through the same workflow to output letters.

 

Below are examples of the ELS and ILS files:

 

ELS_group_1.xlsx

CODEFirst_NameLast_NameProfession
10238JohnWhiteTeacher
12049KellyKrenDean
17253Hsiu-wenChenStudent

 

ILS_group_1.xlsx

CODECareersLastNameFirstName
39880StudentLandsbergAlan
10238TeacherWhiteJohn
15042ArtistDanielsLindsey

 

Suppose that before I have these files go through the flows that output letters, I want to ensure that there're no duplicate records in these files, distinguished by CODEs (note 10238 in both files). There could be 20 files in the directory with different numbers of ELS and ILS files. What can I do?

5 REPLIES 5
apathetichell
20 - Arcturus

As a "you cannot use Join" challenge - here's a way. Summarize the codes in one list to create a list of codes. Append the list to the other the other list. filter with [ELSE_CODE] in [concat_ILS_code].

 

You've now found which ELS entries aren't in ILS - but I mean, I'd just use Join.

45179902
8 - Asteroid

@apathetichell I guess you can, but how to join so many files together?

ArtApa
Alteryx
Alteryx

@45179902 - You can use "*" or "?" masks in your Input Data tool. The easiest way to achieve that replace filename with ILS_*.xlsx and ELS_*.xlsx and then simply use a Union tool. 

saveeshkumar
9 - Comet

Hi @45179902 ,

Please find this logic to get unique records and output.

 

step 1

saveeshkumar_0-1642741352122.png

 

step 2

With this output join to the xlsx file to get output.

saveeshkumar_1-1642741776095.png

 

 Simple batch to read xlsx file .

saveeshkumar_2-1642741925097.png

Hope this help you.

 

 

 

apathetichell
20 - Arcturus

oh. interesting. Forgot the 20 file part. union is kind of the way to go. You can union and then summarize/count to see what's repeated. so what i'd do is create a field to signify what the potential workflow is and then do a groupby/count by code and see which are 2 or more.

Labels
Top Solution Authors