Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Filter and Verify Large Batch of Email Addresses

MarieRendle
5 - Atom

How do I validate a large batch of email addresses (70,000)? Is there was a formula flow to help me do that? How do I filter out the data that seems fake (i.e. no @, spaces, etc.)? 

 

Thank you,

 

Marie

3 REPLIES 3
alexnajm
17 - Castor
17 - Castor

You can build logic that checks if the data contains certain things. For example, depending on your data, you may want to:

- confirm they all contain the @ symbol

- confirm that there is always a domain after the @ symbol (.com, .edu, etc)

 

All of these are statements that you can build in a Filter tool in some way! If you decide on some criteria, we can try to help from there - otherwise hope this helps

MarieRendle
5 - Atom

Thank you for reaching out.

 

I would like to make sure that the data:

 

  • Has only 1 @ symbol
  • Is not missing an @ symbol
  • Has at least 3 characters in the domain extension (e.g., .com,.org,.net)
  • Contains no whitespace
  • Has no consecutive dots in the domain part (e.g., example..com)
  • Does not contain domains with no dot (e.g., example@domain)
  • Lists only valid characters in the domain name
  • Restricts special characters to the local part only
  • Does not start or end with a hyphen (e.g., -example.com)
  • Remove invalid patterns used in spam or bots
alexnajm
17 - Castor
17 - Castor

You could probably figure out a way to create all these individually, but my suggest is RegEx_Match - filtering based on the values matching a pattern. Try something like this in your Filter: regex_match([Field1], "^\w+@\w+.\w{3}$")

 

Here's another one I found online: How to create regular expressions (Regex) for email address validation? (boldsign.com). Feel free to search for more - good luck!

Labels
Top Solution Authors