Rate Severity & Toxicity Classification of Social Media Comments

The objective was to rate the severity of toxic social media comments and classify them as toxic or not across more than seven languages. My approach involved leveraging advanced multilingual modeling techniques, utilizing Kaggle's TPU support, and incorporating innovative strategies to address the challenges of cross-lingual toxicity detection.

Data Preparation and Preprocessing

Data Collection:
- I utilized the English training data provided from the previous competitions, which included comments labeled with varying degrees of toxicity.
- The test data comprised Wikipedia talk page comments in multiple languages.
Text Cleaning and Tokenization:
- Implemented text preprocessing steps to handle special characters, punctuation, and emojis.
- Used language-specific tokenizers to ensure that the nuances of each language were preserved during tokenization.

Model Development

Multilingual Transformer Models:
- Leveraged state-of-the-art multilingual models like BERT, XLM-RoBERTa, T5, etc which are designed to handle multiple languages.
- Fine-tuned these models on the English training data, employing transfer learning to adapt the models to recognize toxicity in other languages.
Few-Shot and Zero-Shot Learning:
- Explored few-shot and zero-shot learning techniques to improve model performance on languages not seen during training.
- Implemented strategies like meta-learning to enhance the model's ability to generalize across different languages.
Model Architecture:
- Developed multi-headed models to handle the main task of toxicity detection and sub-tasks such as classifying different types of toxicity (e.g., obscene, threat, insult).
- Employed ensemble learning by combining predictions from multiple models to achieve robust performance.

Training and Optimization

TPU Utilization:
- Took advantage of Kaggle's TPU support for faster training and fine-tuning of large transformer models.
- Optimized training pipelines to handle large-scale data efficiently, reducing training time and improving model scalability.
Hyperparameter Tuning:
- Conducted extensive hyperparameter tuning using grid search and Bayesian optimization to identify the best configurations for model performance.
- Focused on parameters such as learning rate, batch size, and number of training epochs.

Evaluation and Bias Mitigation

Performance Metrics:
- Evaluated model performance using metrics like F1-score, precision, recall, and ROC-AUC to ensure a balanced assessment of toxicity detection.
- Conducted cross-validation to validate model performance across different data splits.
Bias Detection and Mitigation:
- Implemented techniques to detect and mitigate unintended bias in toxicity classification.
- Used fairness-aware learning approaches to ensure that the model operated fairly across diverse conversation contexts.

Model Deployment and Inference

Real-Time Inference:
- Deployed the models using Jigsaw's Perspective API to serve toxicity predictions in real-time.
- Implemented efficient inference pipelines to handle large volumes of comments, ensuring quick and accurate toxicity detection.
Multilingual Support:
- Ensured that the deployed models supported toxicity detection across all target languages.
- Continuously monitored and evaluated model performance in different languages to make iterative improvements.

Results:

#	Model Name	Accuracy (%)
1	Roberta	90.49
2	Electra	89.90
3	AlBERTa	88.80
4	BERT	88.46
5	Distil-BERT	87.41
6	XLM-RoBERTa	85.83
7	Embeddings+Conv	85.00
8	Simple Embeddings	65.68

Conclusion

By combining advanced multilingual modeling techniques, leveraging few-shot and zero-shot learning, and utilizing TPU support for efficient training, my approach aimed to develop a robust and fair toxicity classification system. This approach not only achieved high performance in detecting toxic comments but also contributed to the broader goal of fostering healthier and more collaborative online conversations across multiple languages.

Notebooks

Roberta-Large

Contributors:

Kayvan Shah
Abneet Wats
Rahil Merchant

Link to the dataset:

Description from competition page

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
BERT		BERT
Custom Embeddings model		Custom Embeddings model
ELECTRA		ELECTRA
Embeddings+Conv		Embeddings+Conv
Exploratory Data Analysis		Exploratory Data Analysis
HuggingFace DistilBert		HuggingFace DistilBert
Images		Images
Roberta model		Roberta model
XLM RoBERTa		XLM RoBERTa
alBERTa		alBERTa
severity_rating		severity_rating
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rate Severity & Toxicity Classification of Social Media Comments

Data Preparation and Preprocessing

Model Development

Training and Optimization

Evaluation and Bias Mitigation

Model Deployment and Inference

Results:

Conclusion

Notebooks

Contributors:

Link to the dataset:

About

Languages

License

KayvanShah1/Jigsaw_multilingual_toxic_comment_classification

Folders and files

Latest commit

History

Repository files navigation

Rate Severity & Toxicity Classification of Social Media Comments

Data Preparation and Preprocessing

Model Development

Training and Optimization

Evaluation and Bias Mitigation

Model Deployment and Inference

Results:

Conclusion

Notebooks

Contributors:

Link to the dataset:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages