Skip to content

Kaggle Competition: Toxic Comment Classification Challenge - Identify and classify toxic online comments

Notifications You must be signed in to change notification settings

iamkrut/Toxic-Comment-Classification

Repository files navigation

Downloading the dataset:

Here's how to run all the three models

For Logistic Regression

  • run python3 Log_reg/log_regression.py

For LSTM

  • Download crawl-300d-2M.vec and glove.840B.300d.txt. Put them in data folder
  • run python3 LSTM/LSTM.py

For TextCNN

  • Download crawl-300d-2M.vec.zip and extract it in data folder
  • run python3 TextCNN/textCNN.py

NOTE: If something doesn't work just clone the project directory from https://csil-git1.cs.surrey.sfu.ca/krutp/nlpclass-1197-g-lexchunkers/tree/master/project. Word embedding still would have to be downloaded separately. NOTE: Report file project.ipynb contains images so Images folder needs to be downloaded from gitlab

Checking the output files: The output.zip contains all the submission predictions generated by the three models. They should be submitted to Kaggle for evaluation

About

Kaggle Competition: Toxic Comment Classification Challenge - Identify and classify toxic online comments

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published