Perform Sentiment Analysis with SciKit-Learn

Performing Sentiment Analysis on a Popular IMDB Dataset Using SciKit-Learn

Screen Shot 2020-10-01 at 11 01 15 AM

Screen Shot 2020-10-01 at 11 02 41 AM

KEY CONCEPTS:

Build and employ a logistic regression classifier using scikit-learn

Clean and pre-process text data

Perform feature extraction with nltk

Tune model hyperparameters and evaluate model accuracy

PROJECT PURPOSE:

In this project-based course from Coursera Project Network, I learned the fundamentals of sentiment analysis, and built a logistic regression model that could classify movie reviews as either positive or negative. The popular IMDB data set was used for this project. The goal was to use a simple logistic regression estimator from SciKit-Learn for document classification.