Data Science

MeganBowers · ‎07-19-2022

Introduction

Do you ever feel “happy, free, confused and lonely at the same time?” Well, Taylor Swift does. (Or she did, at 22.)

There’s a lot of emotion in this song—some positive, some negative, and some neutral. And this got me thinking: can I determine the overall mood of her albums with data? With text mining and sentiment analysis, it turns out I can. And since Taylor Swift has written songs since she was 16, I can also look at how her songwriting sentiment has changed.

…Are you ready for it?

Data Pre-processing

I downloaded the lyrics data from Github. The data was structured with one record per lyric, labeled with the song and album. Additionally, each record has the previous and next lyrics in the song.

Screen Shot 2022-07-15 at 10.12.34 AM.png

I brought this dataset into Alteryx Designer with an Input tool. I filtered out the singles and EPs in the data (since I wanted to look at the whole album sentiment) and joined in data on the album release dates.

The Text Pre-processing tool (in the Text Mining tab of the Intelligence Suite tools) took care of the data prep needed to generate word clouds, which I will discuss in more detail in the Visuals section. I selected English as the language and the Lyric field as the text to be processed.

Additionally, I filtered out stop words so the graphic would be informative and not show how often filler words like “and” occur in the lyrics. I learned from the tool documentation that the list of default stop words could be found here on Github. I added a few more stop words after seeing the “oh”s and “ah”s were included in the word clouds even after filtering out the default stop words.

Screen Shot 2022-07-15 at 10.13.24 AM.png

Note: The true Taylor Swift fans out there know that recently Taylor has re-recorded and released some of her old songs as “Taylor’s Version” (so she owns the song rights). For this analysis, I only count one version of a given song since they have the same lyrics, and the release date is based on the original release, as that is when the song was first written.

Line by Line Sentiment Analysis

Since the data records were structured with one lyric per record, for my first analysis, I determined the sentiment of each lyric and then aggregated that score to get an average for each album.

The Sentiment Analysis tool documentation indicated that it is best not to use the Text Pre-processing before sentiment analysis as it can remove some features that the tool uses to detect sentiment. So, I used the unprocessed data and configured the Sentiment Analysis tool like this:

Screen Shot 2022-07-15 at 10.14.17 AM.png

Since the data is already at a “sentence” level, I left the sentence level option unchecked. Also, I wanted the sentiment score, not the category, to aggregate the outputs. So I left the second option unchecked as well.

Ok, so let’s look at the immediate output here:

Screen Shot 2022-07-15 at 10.16.56 AM.png

You can see the sentiment score for each line broken out into negative, neutral, and positive. For each category, the score ranges from 0 to 1, with 0 being the least like that sentiment and 1 being the most. For example, the first record above scored mostly neutral with some positive sentiment. The compound score shows the overall sentiment of the field: -1 is the most negative score, 0 is the most neutral score, and 1 is the most positive score.

What caught my attention immediately here is that lyrics that are classified as neutral, like “can’t remember” or “to be double-crossed,” feel negative when put into the context of the rest of the verse: “Guess I’m feeling unmoored, can’t remember what I used to fight for. I rewind the tape, but all it does is pause on the very moment all was lost; sending signals to be double-crossed.” That feels pretty sad, but if we aggregate the compound sentiment scores for this song ("evermore"), it comes out to -0.11, indicating it is only slightly negative.

This is a lesson in sentiment analysis—sometimes you may want to analyze the sentence level, and other times you want to look at the body of text as a whole. You must determine which makes the most sense for your use case. And sometimes, it isn’t until you look at the results and compare them to your expectations that you realize a different analysis method is needed.

Given these observations, I decided to try analyzing at the song level next.

Song by Song Sentiment Analysis

I used a summarize tool to concatenate the lyrics for each song together. I then ran the field with the entire song text through the sentiment analysis tool to get the compound sentiment scores.

Screen Shot 2022-07-15 at 10.17.46 AM.png

Results

The sentiment analysis at the song level gave a much more extensive range in the compound sentiment scores than the line level analysis. The summary statistics below are from the line-by-line analysis, aggregated at the album level: the sentiment ranges from .000032 (essentially neutral) to .082 (still very neutral).

Screen Shot 2022-07-15 at 10.19.29 AM.png

This next set of statistics is from the song-by-song analysis, aggregated at the album level: the sentiment ranges from 0.13 to 0.71. Since the max sentiment score is 1, the song level analysis covers a broader range of the possible sentiment values.

Screen Shot 2022-07-15 at 10.20.51 AM.png

Are you curious about the most and least positive albums? I was. Here are Taylor Swift’s full albums, ranked from most positive to most negative:

Screen Shot 2022-07-15 at 10.22.48 AM.png

As an avid Taylor Swift listener, I was impressed with the ability of the text mining tools to detect the emotion in her songs correctly. I would categorize Lover, Fearless, Red, and 1989 as generally upbeat and positive. Reputation has a more angry tone, but it's not entirely negative. And the last two albums, folklore and evermore, have a sad and contemplative vibe throughout (along with being more acoustic and slow).

Some of her songs are poetic and contain figures of speech, double entendres, and other language subtleties. My hypothesis for the accurate sentiment scores is that the sentiment analysis tool is powerful and that Taylor Swift has enough songs on each album (and a consistent theme for each album) so that minor errors in language detection are inconsequential when aggregating scores for the whole album.

Outliers

Some notable song-level outliers were “Shake It Off” and "Sad, Beautiful, Tragic."

Taylor’s hit song “Shake It Off” scored a -0.999, a.k.a. as negative as it gets. I can see why negativity is detected here because she sings about lots of examples of people hating on her, and then her response is to “shake it off.” The instruments/composition sets the tone when you listen—it is very upbeat and pop-y. A total dance bop. But Alteryx can’t know this, and with the negative lyrics and the only (seemingly) positive sentiment being “just shake it off,” I can see why it got a negative score.

"Sad, Beautiful, Tragic" scored a 0.997—very positive—which surprised me, given the song's title and tone. The chorus mixes positive and negative emotions, with the lyric "What a sad, beautiful, tragic love there" repeated many times. It also has strictly positive lines, like "We had a beautiful magic love there." Perhaps the repetition of words such as beautiful, love, magic, and dreams throughout the song led the algorithm astray.

It's also worth noting that the sentiment analysis tool classified "This Is Why We Can't Have Nice Things" as positive, failing to pick up on the sarcastic tone of the entire song.

Visuals

Time for some fun data visuals—word clouds!

To create my first visual, I used the data coming out of the Text Pre-processing tool and filtered it down to the most positive album: Lover. Then, I pulled in a simple image of a music note with the image input to add a shape to the word cloud.

Here is the set-up of this portion of the workflow:

Screen Shot 2022-07-15 at 10.48.01 AM.png

And below is the configuration of the word cloud tool. I customized the word cloud by selecting a color scheme and choosing the image I input as the mask (to customize the shape).

Screen Shot 2022-07-15 at 10.48.58 AM.png

Below is the resulting world cloud, which can help with data validation. It allows me to see the most commonly used words in the Lover album, which gives me insight into why it was classified with positive sentiment.

Screen Shot 2022-07-15 at 10.50.04 AM.png

I repeated the same process as above with the lyrics from the most negative album—evermore. (I also chose a different color scheme.) Here is the resulting word cloud:

Screen Shot 2022-07-15 at 10.50.48 AM.png

These visuals provide some insight into the sentiment scores for these albums, but they also remove all context, so they might not be the most helpful visuals for data like song lyrics.

I made one final chart using the Interactive Chart tool to look at how Taylor’s average album sentiment has changed over time.

Screen Shot 2022-07-15 at 10.52.11 AM.png

The graph shows that Taylor’s songs have gotten less positive over time... 2020 hit Taylor hard too, apparently.

Conclusion

This exercise showed me how powerful the text mining tools are with little configuration and how accurate the results can be. I made playlists for the top 20 most positive and negative songs—give them a listen to see how the sentiment analysis tool did!

I attached my workflow to this post if you want to try out a similar analysis. And, if you do not have the Intelligence Suite tools, you can download a free trial here to start playing around with text mining.

sstevens33 · ‎07-29-2022

As a fellow Swiftie, this makes me very happy :)