Cast your vote for the official 2025 Inspire Pin! Designs were submitted by fellow Community members and reflect the creativity and passion of Alteryx users across the globe. Vote now!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLUCIONADO

Remove Non printable character from text file

Thinesh_t
Meteoro

Dear Friends,

 

i used to work with the clients ERP data which i used to receive in text file format and at time the file will be in broken because of Non printable characters(non keyboard characters) , New line , Carriage return, white space.

 

I am finding difficult to handle that data cleanse work , i need your suggestion/ if any workflow to help me handle this.

 

Thanks for your support.

8 RESPOSTAS 8
Jonathan-Sherman
15 - Aurora
15 - Aurora

Hi @Thinesh_t,

 

Could you attach a (mock up) sample?

 

Regards,
Jonathan

Thinesh_t
Meteoro

Please copy the below data after the commented line (delimiter is "#|#") and paste in txt file, as i couldn't upload due to some restrictions,

 

"�" non printable characters like this to be removed (line above 104)

Remove New line & Carriage return & White space/blanks

'--------------------------------------------------------------------------------------------------------------------------------------------

MANDT#|#MBLNR#|#MJAHR#|#ZEILE#|#PARENT_ID#|#BWART#|#XAUTO#|#MATNR#|#WERKS#|#LGORT#|#CHARG#|#INSMK#|#ZUSTD#|#SOBKZ#|
103#|#1111111113#|#2020#|#11111#|#000000#|#222#|##|#000000000010002222#|#XX02#|#YY01#|#X444444#|##|##|##|##|#1000111111#|##|#000000#|#0000#|#Z#|#EUR#|#11.21#|#11.55#|#0.00#|##|#5.000#|#BLS#|#5.000#|#CAR#|##|#00000#|##|##|#0000#|##|##|#0000#|##|#8315#|##|#2020#|#7929#|##|#000#|##|##|##|#L#|##|##|##|##|#0001364101#|#0000101401#|#00#|#000000000010002042#|#MA01#|#WA01#|#X#|#X#|#9074.000#|#136490.29#|#S
�#|#0.00#|##|#F
104#|#111111111�#|#2020#|#11111#|#000000#|#222#|##|#000000000010002222#|#XX02#|#YY01#|#X444444#|##|##|##|##|#1000111111#|##|#000000#|#0000#|#Z#|#EUR#|#11.21#|#11.55#|#0.00#|##|#5.000#|#BLS#|#5.000#|#CAR#|##|#00000#|##|##|#0000#|##|##|#0000#|##|#8315#|##|#2020#|#7929#|##|#000#|##|##|##|#L#|##|##|##|##|#0001364101#|#0000101401#|#00#|#000000000010002042#|#MA01#|#WA01#|#X#|#X#|#9074.000#|#136490.29#|#S#|#0.00#|##|#F

kirans666
Asteroide

Hi,

 

You can use the below to cleanse the data to remove the unnecessary data from the files. Please let me know if this helps or if you were looking for something else.  You can give this in a multi field formula to apply over all the fields post parsing with a delimiter. 

regex_Replace([field],"[^\w+]",'')

 

PhilipMannering
16 - Nebula
16 - Nebula

Or if you're trying to get rid of those question marks,

 

REGEX_Replace([Field1], '[^\x00-\x7f]', '')
Thinesh_t
Meteoro

Thanks for your response but the solution you provided to handle the non printable characters is not working.

 

Thinesh_t_1-1588769893338.png

 

Can you try with the Data i provided in the previous post, copy and paste in a text file and create a solution if possible.

Thinesh_t
Meteoro

Thanks for your response but the solution you provided to handle the non printable characters is not working. i tried using replace too but still its not handled.

 

Thinesh_t_1-1588769893338.png

 

Can you try with the Data i provided in the previous post, copy and paste in a text file and create a solution if possible.

PhilipMannering
16 - Nebula
16 - Nebula

Hi Thinesh_t

 

Take a look at the attached file. A few solutions in there...

 

 

Thinesh_t
Meteoro

Thanks Philip,

 

It worked using the same you provided but i implemented in formula.

 

Thinesh_t_0-1589208566813.png

 

Rótulos
Autores com maior número de soluções