Start your journey with Alteryx Machine Learning - Take our Interactive Lesson today!

Alteryx Machine Learning Discussions

Find answers, ask questions, and share expertise about Alteryx Machine Learning.
Getting Started

Start your learning journey with Alteryx Machine Learning Interactive Lessons

Go to Lessons

NER (Named Entity Recognition) tool throwing error saying don't have enough data

Manojkumar
8 - Asteroid

Hi Team,

 

As shown below, i have more than 500 entities under label "Name", but when i ran it with NER tool , it is throwing the following error which doesn't make sense, please let me know if i am doing something wrong.

 

Error: " Named Entity Recognition (5) Some labels don’t have enough data points to train an accurate model. Each of these labels has less than 20 data points: ['NAME']."

 

Manojkumar_0-1663928329041.png

 

4 REPLIES 4
IraWatt
17 - Castor
17 - Castor

Hey @Manojkumar,

If you put a summarize tool before the NER tool and configure it to group by the Name column and do a count on Name column. In the output of the summarize tool does each record have more then 20 in count?

Manojkumar
8 - Asteroid

Hi Ira,

 

Thanks for your response, yes when summarise and count i have 565. please find the screenshot below.

 

Manojkumar_0-1663929356487.png

 

IraWatt
17 - Castor
17 - Castor

@Manojkumar can you group on and count with the 'Name' Column, you have grouped and counted on the 'Label' Column.

Manojkumar
8 - Asteroid

Hi Ira,

 

Sorry if i had confused you, 

 

NER entities = Column "Value"

NER label = Column "Label"

 

You can ignore the column "Name"

 

Thanks