In case you missed the announcement: The Alteryx One Fall Release is here! Learn more about the new features and capabilities here
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Extract dates from free field text

suwenchuan
7 - Meteor

Hi Alteryx community!

 

I'm working on extracting dates from free-form text in user descriptions. Snippet below. I've attached an Excel file as well.

 

Output Sample.png

 

Here's what I've tried so far:

 

1. Creating regex patterns for the date formats:

- New data often includes date formats I haven't accounted for, which means some dates may be missed

- With the large volume of data, manually spotting dates that regex misses (and distinguishing them from text without dates) is challenging

2. Using the Named Entity Recognition tool: 

- The NER tool hasn't been tagging dates accurately, and I can't seem to find any additional settings to improve its precision

 

Any insights or alternative methods to automate this process would be greatly appreciated. Thank you in advance!

3 REPLIES 3
gawa
16 - Nebula
16 - Nebula

hi @suwenchuan 

I'm not sure this is the one you want or you already tried...but please try Regex tool with 'Tokenize' mode like this.

image.png

suwenchuan
7 - Meteor

Hi @gawa, thank you for your reply. I'd like to add a bit more context to my query. Basically, the raw data that I receive will only be column "Description". what I need to get is the "Desired Output",

gawa
16 - Nebula
16 - Nebula

@suwenchuan  OK...then it becomes very difficult problem...I need sometime to come up with solution.

Labels
Top Solution Authors