IMDB Movie Reviews - Text preprocessing and classification. Includes BOW model, TF_IDF, VADER entiment analysis, Topic Modelling using Latent Dirichlet Allocation and Word Embeddings. (Python)
This project contains a dataset comprising of information about Hollywood movies released between 1990 and 2019 and it was collected using a polite webscraper. Please keep in mind that IMDB doesn't permit the usage of its data for commercial purposes and this project was solely made for educational purpose.
It analyses the movie review entered by a user for any specific movie and analyses what is the sentiment of the review. It helps the companies rate the movie and understand crowd sentiment regarding it. Sentiment analysis is a natural language processing problem where text is understood and the underlying intent is predicted.
Deploying a Sentiment Analysis Model on Amazon Sagemaker which consists of deploying a Sentiment Analysis model using Recurrent Neural Networks in the Amazon AWS SageMaker tool. The notebook and Python files provided here result in a simple web application which interacts with a deployed recurrent neural network performing sentiment analysis on movie reviews. In the final architecture AWS API Gateway and AWS Lambda functions is used as well.