Sentiment Analysis by BERT:

BERT is state-of-the-art natural language processing model from Google. Using its latent space, it can be repurpossed for various NLP tasks, such as sentiment analysis.

I have used Hugging Face Transformers and Pytorch and the task is predicting positivity / negativity on IMDB reviews.

Data:

Firstly, you need to prepare IMDB data which is publicly available. Format used here is one review per line, with first 12500 lines being positive, followed by 12500 negative lines. Positive has been encoded with 0 and negative with 1.

You can download data and weights (in the correct format) directly from my drive link here.

Models:

I have used 3 models:

BertForSequenceClassification (Hugging Face)
BertModel (Hugging Face)
Pytorch pretrained BERT (not from Hugging Face)

Results:

BertForSequenceClassification:

	precision	recall	f1-score	support
0.0	0.90	0.93	0.91	12500
1.0	0.93	0.90	0.91	12500

accuracy			0.91	25000
macro avg	0.91	0.91	0.91	25000
weighted avg	0.91	0.91	0.91	25000

Accuracy achieved: 91 %

After optimization experiments BertModel does better with an accuracy of 93 %

Optimization:

I will optimize the hyperparameters later to get as close to the sota as possible.
You can view the optimization experiments here.

Code:

Code has been uploaded as a notebook and a .py file.

Note: For the .py file, ensure transformers is installed (command: pip install transformers) and set correct paths in lines 76 and 227.

Code with the base BertModel can be found here.

Links:

Useful comments and links to tutorials have been given inside the notebook to guide you through

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
BERT_IMDB.ipynb		BERT_IMDB.ipynb
BERT_base_imdb.ipynb		BERT_base_imdb.ipynb
BERT_base_imdb_hugging.ipynb		BERT_base_imdb_hugging.ipynb
README.md		README.md
bert_base_imdb.py		bert_base_imdb.py
bert_base_imdb_hugging.py		bert_base_imdb_hugging.py
bert_imdb.py		bert_imdb.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis by BERT:

Data:

Models:

Results:

Optimization:

Code:

Links:

About

Releases

Packages

Languages

dchandak99/BERT-Sentiment

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis by BERT:

Data:

Models:

Results:

Optimization:

Code:

Links:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages