Skip to content

Honghui-Du/AI_genertaed_text_detection_Debert

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI-Generated Text Detection with DeBERTa-Xlarge

This repository contains code of a course project for fine-tuning a DeBERTa-Xlarge model to detect AI-generated text. The model is trained using all active layers and 5 cross-validation approach for performance evaluation.

Model Overview

  • Model: DeBERTa-Xlarge
  • Cross-Validation: 5-fold cross-validation is employed to assess model performance and generalization capability.

Dataset

The dataset used for fine-tuning the model consists of a combination of several publicly available datasets, as well as generated samples:

  1. Persaude Corpus 2
  2. LLM Detect AI Generated Text Competition Dataset, where the data lodaers were also referred.
  3. DAIGT V4 Train Dataset
  4. In addition, data samples were also generated using LLaMA and GPT-2 models.

The final merged datasets can be downloaded in https://file.io/Wz3JI0DVhXF1

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages