Weekly Challenge

Solve the challenge, share your solution and summit the ranks of our Community!
IDEAS WANTED

We're actively looking for ideas on how to improve Weekly Challenges and would love to hear what you think!

Submit Feedback
We've recently made an accessibility improvement to the community and therefore posts without any content are no longer allowed. Please use the spoiler feature or add a short message in the message body in order to submit your weekly challenge.

Challenge #205: Taynalysis

Highlighted
8 - Asteroid
 
Highlighted
5 - Atom

At first, I was having trouble with getting my unique vs. duplicate values to tie...see spoiler.

Spoiler
Spoiler
Using the Unique tool before any data cleansing (to remove punctuation, stop words) got me to my solution!

AI-vs.-TS.png

 

Highlighted
8 - Asteroid

Hi! Here my solution

Highlighted
8 - Asteroid

Hard to shake off the punctuation and symbol discrepancies depending how you approach the problem.  It was also interesting to play with different ways of tokenizing what a 'word' is.  I.e. is everything after a space (\s) a word? Does Regex (\<\w+\>) Identify a word best? Overall very close to the solution depending on tweaks in cleansing or parsing.

 

Spoiler
Zenon_0-1583423598631.png

 

Highlighted
8 - Asteroid

It's not 100 percents accuracy. Hihi

Spoiler
Capture.PNGCapture.PNG
Highlighted
5 - Atom
 
Highlighted
8 - Asteroid

Uniqueness was definitely different. But ended up with the same top ten words. I feel like my cleaning was more robust which ended up in less unique and more duplicate words.

Highlighted
5 - Atom

Not getting the exact numbers but pretty close though...

 

Attached is my approach

Highlighted
6 - Meteoroid

Here is my two cents

Highlighted
Alteryx Partner

Got there eventually on which characters to exclude:

 

Spoiler
bstroh_0-1583465577417.png