Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Masking Data for Security

rushabh_shah
8 - Asteroid

I want to mask certain fields in my data like email id and mobile number since it's sensitive data before sharing the same with anyone. Both the fields vary in length. 

Please help me with a formula

5 REPLIES 5
jdunkerley79
ACE Emeritus
ACE Emeritus

For masking of text files, probably the easiest thing to do is to use a Multi Field formula. Remember to untick the Copy Output Fields.

2016-08-09_09-43-04.jpg

 

A simple method is to just use the MD5 Hash of the input string. In order to avoid length issues you can truncate it to the length of the original string:

 

LEFT(MD5_UNICODE([_CurrentField_]), LENGTH([_CurrentField_]))

The benefit of this approach is the mapping will always produce the same output. This means keys and joins should still work.

 

 

Masking numbers can be done in a separate multifield formula by multiplying the input by a Random number. If you want to ensure that the mapping is consistent this is a harder process.

 

 

Bharath
8 - Asteroid

Hi,

 

Thanks for your post about masking. it works now but I have a quick question. actually I have my file in SQL server and its large in size so I am using the in - database function (see the attached file). so how to use this formula using the in-database function in Alteryx ?

 

Thanks,

VJ

 

 

 

 

 

 

 

Sailor
7 - Meteor

Hi

 

Thansk for a very good masking solution. It is very useful and would be even more useful if I could unmask the same data.

 

How can this be done?

 

Regards Mats

StephenR
Alteryx
Alteryx

@SailorIf you could unmask the data, that would defeat the purpose of masking it.  Keep a source version that is private and a masked version that can be shared.

Regards,
Stephen Ruhl
Principal Customer Support Engineer

Namshad
7 - Meteor

Hi,

 

Instead of MD5_UNICODE , which all the other encryption keys available.

Can you please share the function to be used in multi field formula like MD5_UNICODE([_CurrentField_])  for a few of the below hash types.

 

SHA-1

SHA-256

Whirlpool

xxHash

MurmurHash
SpookyHash
CityHash

Checksum
CRC-16
CRC-32
CRC-32 MPEG-2
CRC-64

BLAKE-256
BLAKE-512
BLAKE2s
BLAKE2b
ECOH
FSB
GOST
Grøstl
HAS-160
HAVAL
JH
MD2
MD4
MD6
RadioGatún
RIPEMD
RIPEMD-128
RIPEMD-160
RIPEMD-320
SHA-224
SHA-256
SHA-384
SHA-512
SHA-3 (originally known as Keccak)
Skein
Snefru
Spectral Hash
Streebog
SWIFFT
Tiger

Labels