The solution results are not very good... there are numbers, underscores, words stuck together...
I spent most of my time cleaning up my output. Very happy with the results. It removes all punctuation, numbers, spaces, and nulls without damaging the actual words (e.g. turning sister-in-law into sisterinlaw) and counts only the main body text, and not the front/end fluff.
Proud of this one!