community
cancel
Showing results for 
Search instead for 
Did you mean: 

Alteryx designer Discussions

Find answers, ask questions, and share expertise about Alteryx Designer.
Upgrade Alteryx Designer in 10 Steps

Debating whether or not to upgrade to the latest version of Alteryx Designer?

LEARN MORE
SOLVED

Need help on a Parse Regex

Atom

Hello the community,

 

I have some troubles on a Parse Regex for a txt file. This file is a products info list and I just want to extract some columns.

 

- Example (or see the image or file "Data set") :Data set.png

 

- What I am looking for is underlined (this is just 1 line):

18495746201812PROBIOTICA DARMBALANS CAPS STRIP 30ST 200910 C01A 8711744029548 15527557ONBEKEND CAPSULE 000000000000N.V.T. STRIP 00003000STUK 00001500STUK VEMEDIA# SNEDERLAND 000000000022PS

 

- What I want as output is:

RegEx out 1RegEx out 2RegEx out 3RegEx out 4RegEx out 5RegEx out 6RegEx out 7RegEx out 8RegEx out 9RegEx out 10RegEx out 11
18495746201812PROBIOTICACAPS STRIP 30STVEMEDIA#SNEDERLAND000000000022PS

 

I succeeded to parse the first three parts with this expression (\d+)(\d{6})(\u*) and I get blocked to continue and to extract the rest.

(see my attached workflow "Test regex").

 

Can anyone help me please? Should I continue with RegEx or maybe is there another tool which could do the trick?

 

Yanopoff

Pulsar

This gets you a bit closer - just have to figure out a rule for CAPS STRIP 30ST

 

(\d+)(\d{6})(\u*).+\s(.+)\s(\w)(\w+)\s(\d{10})(\d)(\d)(\w{2})

Bolide

Hi,

 

why don't you use an input with a flat ascii file format to get the data you want, it seems to me that your data set.txt would be perfect to do so!

Pulsar

I like @Ladarthur 's idea, but I've not given up on regex...

 

Granted, I've made a few assumptions that you might have to tweak, but here you go:

 

(\d+)(\d{6})(\u*)\s\w+\s(.+)\s\d{6}\s.+\s(.+)\s(\w)(\w+)\s(\d{10})(\d)(\d)(\w{2})

Highlighted
Alteryx Certified Partner

Hello @Yanopoff maybe this help!

best regards..

 

Thanks

Atom

Hi @DavidP , @Ladarthur  and @carlosteixeira2005 ,

 

So great to have some help.

 

Thanks DavidP and carlosteixeira2005 for your regex. It took some time to try DavidP and carlosteixeira2005 solutions. Both were great and I learned a lot. However, it looks like the Regex cannot work for my file. From what I understand, it is because some products do not have all the columns filled (missing or nonexistent data). I tried modify them with the website regex101 but I failed!

 

I also tried Ladarthur solution. It worked just fine and you are right, it fit perfectly my file. Thanks a lot, I did not know about that extension and the way it works.

 

Thanks again everyone for your fast reply ;^)

Yanopoff

Labels