Skip to content

Created a model which predicts a probability of each type of toxicity for each comment.

Notifications You must be signed in to change notification settings

Sanskrutiii/Toxic-comments-classification-using-NLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Toxic-comments-classification-using-NLP

Created a model which predicts a probability of each type of toxicity for each comment.

Dataset - https://www.kaggle.com/competitions/jigsaw-toxic-comment-classification-challenge/data

Toxic comment classification- The dataset has 312735 comments. Out of these the training set has 1,59,571 comments while the training set has 1,53,164 comments. These comments are classified into 6 toxic behaviours. The classes are “Toxic, Severe toxic, Obscene, Threat, Insult, Identity hate”.

Results : I've used 2 classification algorithm- Multinomial Naive Bayes model with 54.003% and Logistic regression with 96% accuracy. We use the logistic regression model because of the better performance metric.

About

Created a model which predicts a probability of each type of toxicity for each comment.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published