In my last article I showed an analysis of 617 movie scripts, identifying the most said words in those movies and also the trending of positive and negative words. That was done using different data sets, which means I had to do some data cleaning and blending. Today I’ll show you exactly what I did to clean and prepare the final data set using Pentaho Data Integration, a.k.a. Kettle.
The impact of words. So strong that even if they’re not directed to us, they can change how we feel. Movies bring a lot of emotions to us spectators, with all the sceneries, the action, the history, the characters — but what about the impact of the words in movies? I did a small analysis and I found out some interesting things.