Weekly Challenges

Dani_Lin · ‎06-08-2023

solved

ed_hayter · ‎06-09-2023

Slightly different output but what I have was reading very well so happy to submit.

AHill06 · ‎06-14-2023

So here is my version, could do a group by i guess on the challenge numbers to see the number of comments, but unable to work out if positive or negative comment as dont have those features but pretty happy with the data cleansing done.

tristank · ‎06-22-2023

Like many others my numbers weren't exact but I was actually pretty happy with my output (validated with some random sampling). I used a lot of regex tools and sampling to keep testing different cases. In hindsight once I realized the text was wrapped in P's I could've likely eliminated everything else around it. By getting rid of empty rows and nulls my row count went down by a lot.

Spoiler

Eliot_Rich · ‎06-27-2023

Hello.

The REGEX shown in the proposed solution is instructive, thank you.

However, the Part 1 solution record counts do not tie out.

On the left the record count after the regex and filter for BODY and before the summation is16325.

On the right the input record coming into the file has 16319.

Where did the six records go?

As a number of authors note that they did not "quite" tie out, might the admins review the post?

Thanks.

caltang · ‎06-27-2023

Done. You can get 16,325 by changing the empty text of the field [body] into something other than empty/null(), then the concatenate will pick it up as its own piece whilst the regex (/n) will pick it up as well.

This ensures 16,325 in, and 16,325 out. I've tweaked the solution given by Alteryx to better fit this criteria - makes more sense now.

Spoiler