Hi, Can someone solve this question? I have a file as attached, and want to split it by full name or Chinese name,ensure that there are no duplicate contents in each field of full name or Chinese name. Acutally I have no idea of how many times of the duplicates will be before I get the real file, so I guess may need Macro tool to solve this problem, but have idea how to set the macro. I attached the input and wanted output (the output should be 5 excel files). Thanks.
input:
outputs:
Solved! Go to Solution.
You could unique tool in Alteryx to help you remove the duplicates, just select the columns you want to be unique. If this helps, feel free to like the comment and select it as the solution.
@LeiCheng 
I have a few question about your output.
1. What is the duplicate criteria?
You said "ensure that there are no duplicate contents in each field of full name or Chinese name. " but looking at Excel #1, the Chinese Name are same for the 3 records.
So actuall you mean "ensure that there are no contents that duplicates both for full name and Chinese name. "
2. What is the group cretiria?
I can see ID#1 is not duplicating with any other records, if we look at both full name and Chinese name. 
So the grouping can vary actually? no need to be same as your sample output?
need a more details for the logic in your current sample grouping.
Hi @Qiu , Thank you very much for your rigorous thinking.
1. What is the duplicate criteria? I mean I need make sure this is no duplicated information in any one of the output by full name or (not and) Chinese name. That is :
by full name the output should be two excel files: excel1 and 2; by Chinese name, there should be three files: excel 3,4,5.
I need avoid people with the same name on the same table, so that I can use personal names as unique values to match other information (such as non-unique addresses), so as not to miss any suspected locations of people with the same name.
| Address file | |
| address | name | 
| A | 楊志勤 | 
| B | 楊志勤 | 
| C | 楊志勤 | 
| output | ||
| name | full name(surname+"/"+first name) | address | 
| 楊志勤 | Yang/ZhiQin | A | 
| 楊志勤 | Young/ZhiQin | A | 
| 楊志勤 | Yang/Welsome ZhiQin | A | 
| 楊志勤 | Yang/ZhiQin | B | 
| 楊志勤 | Young/ZhiQin | B | 
| 楊志勤 | Yang/Welsome ZhiQin | B | 
| 楊志勤 | Yang/ZhiQin | C | 
| 楊志勤 | Young/ZhiQin | C | 
| 楊志勤 | Yang/Welsome ZhiQin | C | 
2. What is the group cretiria?
There are two parallel critirias are required, namely the full name and the Chinese name.
@LeiCheng 
Thank you for your clarifications.
I now understand that the data stream will be seperated to 2 based on full name or (not and) Chinese name then union back together.
I am using the Tile tool and is able to get the same result as your sample output. Please take a look.
 
					
				
				
			
		
