Text Classification with BERT

Finetuned BERT model on a custom dataset containing tweets to classify their sentiment. Each tweet can be classified into 8 sentiments and can have more than one categories

For e.g a tweet can be both 'joyous' and 'optimistic'

Dataset

Download link: https://huggingface.co/datasets/sem_eval_2018_task_1

A subset of this dataset containing only English was selected. It has more than 50,000 pairs of text and corresponding label already splitted into train and test part.

Training:

--BERT uncased model is used for the training
--First normal training was done on the pre-trained model for 5 epochs
--Then PEFT was done using Low Rank Adaptation(LoRA) approach which gave better results.

Result:

--Fine tuned model was evaluated using classification metrics Accuracy,F1,ROC-AUC curve as the task is classification

Challenges:

--Due to infrastructural challenges model could be trained for 5 epochs only which can be increased significantly.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
Text_Classification_with_BERT.ipynb		Text_Classification_with_BERT.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Classification with BERT

Dataset

Training:

Result:

Challenges:

About

Releases

Packages

Languages

License

arka57/Text-Classification-with-BERT

Folders and files

Latest commit

History

Repository files navigation

Text Classification with BERT

Dataset

Training:

Result:

Challenges:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages