Sentiment of Security Now! over time
If you believe some people, everything is getting worse1. More so in infosec. For the past few years I listened to many many hours of podcasts, many hours where spent on the weekly show Security Now!. The hosts Steven Gibson and Leo Laporte have been talking about security related news every week over 13 years. Although the content has changed over time, there used to be more explanations but the majority of time is now filled with news, we could use the sentiment in the episodes to see if ‘everything is getting worse’. Has the sentiment of the security now! podcast changed over time? It helps that every episode is transcribed into text so we can use natural language processing tools to work through this problem.
Extracting the data
To gather and extract the relevant information from the transcripts I point you kindly to a seperate github page where I explain how I downloaded every episode and extracted the structure.
I asked permission to scrape all the transcripts but I'm not entirely sure if I can share the content.
What I ended up with is a dataframe with 664 rows (the number of episodes today) and 9 columns.
The text column contains a tibble with a row for everytime anyone speaks untill the other takes over. The length
Steve talks a lot more then Leo, and we see that in the number of words per line of a single episode:
Interestingly my scraper seems to not have detected who said the words on line 45. It was Steve.
State of the machine
At the moment of creation (when I knitted this document ) this was the state of my machine:
