Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Problem Reading Fixed Width Unicode

Garrett
11 - Bolide

I'm getting some unexpected results trying to read in a fixed width TXT file that contains unicode characters. When Alteryx reads in columns with unicode characters the field length is no longer "fixed" and seems to get shifted a couple spots.

 

Expectation:

Garrett_2-1605727192445.png

 

Actual Result:

Garrett_1-1605727107230.png

 

I've got the code page set to "Unicode UTF-8".

 

I've also tried hacking the XML to force the field to be "V_WString" rather than "V_String" (because this isn't an option in the Fixed Width dialog), but this still doesn't resolve.

Garrett_3-1605727582588.png

 

Sample data and workflow attached. Any solutions would be very much appreciated! 

3 REPLIES 3
Qiu
21 - Polaris
21 - Polaris

@Garrett 
Can you share your original file rather than the processed one?

Are you accepting other parsing method, as long as get your desired output?

Garrett
11 - Bolide

@Qiu - for this purpose test.txt is the "original file".

 

And I think I'm all set to use string parsing as last resort workaround. What I'm really looking for is a solution to get the initial Input Data tool configuration to work correctly. Thanks though!

 

 

AkimasaKajitani
17 - Castor
17 - Castor

I make the workflow.


Firstly, I read the text file as text file that have only one field, then I split to field by RegEx tool.

 

 

At reading the text.txt, you have to read it as csv file not flat text. The option is below.
It is important that Delimiters is \0.

 

Input Data Setting:

AkimasaKajitani_1-1605794901162.png

 

RegEx tool Setting:

AkimasaKajitani_2-1605796081710.png

 

Result

AkimasaKajitani_0-1605794583720.png

 

Please refer the attached workflow.

Labels