ACT NOW: The Alteryx team will be retiring support for Community account recovery and Community email-change requests Early 2026. Make sure to check your account preferences in my.alteryx.com to make sure you have filled out your security questions. Learn more here
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
解決済み

Remove Non printable character from text file

Thinesh_t
メテオール

Dear Friends,

 

i used to work with the clients ERP data which i used to receive in text file format and at time the file will be in broken because of Non printable characters(non keyboard characters) , New line , Carriage return, white space.

 

I am finding difficult to handle that data cleanse work , i need your suggestion/ if any workflow to help me handle this.

 

Thanks for your support.

8件の返信8
Jonathan-Sherman
15 - Aurora
15 - Aurora

Hi @Thinesh_t,

 

Could you attach a (mock up) sample?

 

Regards,
Jonathan

Thinesh_t
メテオール

Please copy the below data after the commented line (delimiter is "#|#") and paste in txt file, as i couldn't upload due to some restrictions,

 

"�" non printable characters like this to be removed (line above 104)

Remove New line & Carriage return & White space/blanks

'--------------------------------------------------------------------------------------------------------------------------------------------

MANDT#|#MBLNR#|#MJAHR#|#ZEILE#|#PARENT_ID#|#BWART#|#XAUTO#|#MATNR#|#WERKS#|#LGORT#|#CHARG#|#INSMK#|#ZUSTD#|#SOBKZ#|
103#|#1111111113#|#2020#|#11111#|#000000#|#222#|##|#000000000010002222#|#XX02#|#YY01#|#X444444#|##|##|##|##|#1000111111#|##|#000000#|#0000#|#Z#|#EUR#|#11.21#|#11.55#|#0.00#|##|#5.000#|#BLS#|#5.000#|#CAR#|##|#00000#|##|##|#0000#|##|##|#0000#|##|#8315#|##|#2020#|#7929#|##|#000#|##|##|##|#L#|##|##|##|##|#0001364101#|#0000101401#|#00#|#000000000010002042#|#MA01#|#WA01#|#X#|#X#|#9074.000#|#136490.29#|#S
�#|#0.00#|##|#F
104#|#111111111�#|#2020#|#11111#|#000000#|#222#|##|#000000000010002222#|#XX02#|#YY01#|#X444444#|##|##|##|##|#1000111111#|##|#000000#|#0000#|#Z#|#EUR#|#11.21#|#11.55#|#0.00#|##|#5.000#|#BLS#|#5.000#|#CAR#|##|#00000#|##|##|#0000#|##|##|#0000#|##|#8315#|##|#2020#|#7929#|##|#000#|##|##|##|#L#|##|##|##|##|#0001364101#|#0000101401#|#00#|#000000000010002042#|#MA01#|#WA01#|#X#|#X#|#9074.000#|#136490.29#|#S#|#0.00#|##|#F

kirans666
アステロイド

Hi,

 

You can use the below to cleanse the data to remove the unnecessary data from the files. Please let me know if this helps or if you were looking for something else.  You can give this in a multi field formula to apply over all the fields post parsing with a delimiter. 

regex_Replace([field],"[^\w+]",'')

 

PhilipMannering
16 - Nebula
16 - Nebula

Or if you're trying to get rid of those question marks,

 

REGEX_Replace([Field1], '[^\x00-\x7f]', '')
Thinesh_t
メテオール

Thanks for your response but the solution you provided to handle the non printable characters is not working.

 

Thinesh_t_1-1588769893338.png

 

Can you try with the Data i provided in the previous post, copy and paste in a text file and create a solution if possible.

Thinesh_t
メテオール

Thanks for your response but the solution you provided to handle the non printable characters is not working. i tried using replace too but still its not handled.

 

Thinesh_t_1-1588769893338.png

 

Can you try with the Data i provided in the previous post, copy and paste in a text file and create a solution if possible.

PhilipMannering
16 - Nebula
16 - Nebula

Hi Thinesh_t

 

Take a look at the attached file. A few solutions in there...

 

 

Thinesh_t
メテオール

Thanks Philip,

 

It worked using the same you provided but i implemented in formula.

 

Thinesh_t_0-1589208566813.png

 

ラベル
トップのソリューション投稿者