Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

How to compare a csv file and excel file to identify duplicates?

Sunithaprdp
8 - Asteroid

I am new to Alteryx. I wanted compare a csv file with 136 characters with an excel worksheet to identify lines with common identifier. The output should be similar to the csv input file. But it should contain only the details of common identifiers. Please help. I am attaching samples.

As per the sample : there is a common identifier between csv SAMPLE and excel sample that is '00012-0123-45 with name ABCDE. So the output should be the eliminated csv file with no second line that is:

QTY0001205789622021STUVW00000001.23400000000000123.0000000000009999.99000000020000000002379.670000000000000.000000000002345.78.

The text file should only contain first line of SAMPLEs.

67 REPLIES 67
Sunithaprdp
8 - Asteroid

Likewise we would have 3 Programs or more.. wherein which respective excel files would be compared with multiple text files with different labelers for a specific program to get desired output for that program

binuacs
21 - Polaris

@Sunithaprdp using one program we can make read all the input text files and the excel files then do the compare based on the requirements. This can be achieved using some other tools. The program names can be extracted from the file name. All these options available in the directory tool. If you provide some test files and  the expected  output files I can help you on that 

Sunithaprdp
8 - Asteroid

Hi,

 

I am attaching few samples. Please consider this as two programs likewise we would get multiple programs. So what I am looking onto is that : There would be an 'INPUT' folder which consists of two subfolders wherein which in one subfolder we can copy all desired excel worksheets and to the other subfolder we can copy all text files.

So in short 2 subfolders named : Worksheet and Text (For example)

Then we will have one 'OUTPUT' folder wherein which we would receive desired output files with respective program names.

 

Samples attached :

Considering as 2 different programs AB PROGRAM & CD PROGRAM with 2 unique program numbers and one specific worksheet to compare

 

Wanted to get desired output for each program , each program number like 1Q21 AB PROGRAM FINAL_12345/ 1Q21 AB PROGRAM FINAL 56798/ 1Q21 CD PROGRAM FINAL _12345/ 1Q21 CD PROGRAM FINAL _56798

Sunithaprdp
8 - Asteroid

Hi,

 

I am attaching few samples. Please consider this as two programs likewise we would get multiple programs. So what I am looking onto is that : There would be an 'INPUT' folder which consists of two subfolders wherein which in one subfolder we can copy all desired excel worksheets and to the other subfolder we can copy all text files.

So in short 2 subfolders named : Worksheet and Text (For example)

Then we will have one 'OUTPUT' folder wherein which we would receive desired output files with respective program names.

 

Samples attached :

Considering as 2 different programs AB PROGRAM & CD PROGRAM with 2 unique program numbers and one specific worksheet to compare

 

Wanted to get desired output for each program , each program number like 1Q21 AB PROGRAM FINAL_12345/ 1Q21 AB PROGRAM FINAL 56798/ 1Q21 CD PROGRAM FINAL _12345/ 1Q21 CD PROGRAM FINAL _56798

Sunithaprdp
8 - Asteroid

Below is the attached pending file of a test program.

 

Could you please help me to figure out on this.

 

Thanks in advance!

Sunithaprdp
8 - Asteroid

Please let me know if any further test samples are required.

 

Looking forward to your support. Thanks in advance!

binuacs
21 - Polaris

@Sunithaprdp sure, I will work on the workflow and will let you know if there are any questions

Sunithaprdp
8 - Asteroid

Thank you so much!

binuacs
21 - Polaris

@Sunithaprdp Are you expecting 4 output files based on the file name?

binuacs
21 - Polaris

@Sunithaprdp Steps to follow before executing the workflow

 

1. update your input folder/CSV file folder in the first directory tool

binuacs_1-1642598209304.png

2. update your xlsx input path in the second directory tool

binuacs_2-1642598261088.png

3. for the output please mention your ourtput folder path in the formula tool

 

binuacs_3-1642598405712.png

 

 

Im expecting some differences in the output. Please review and let me know what are the issues, will fix one by one

 

 

Labels
Top Solution Authors