AI-Generated Text Detection with DeBERTa-Xlarge

This repository contains code of a course project for fine-tuning a DeBERTa-Xlarge model to detect AI-generated text. The model is trained using all active layers and 5 cross-validation approach for performance evaluation.

Model Overview

Model: DeBERTa-Xlarge
Cross-Validation: 5-fold cross-validation is employed to assess model performance and generalization capability.

Dataset

The dataset used for fine-tuning the model consists of a combination of several publicly available datasets, as well as generated samples:

Persaude Corpus 2
LLM Detect AI Generated Text Competition Dataset, where the data lodaers were also referred.
DAIGT V4 Train Dataset
In addition, data samples were also generated using LLaMA and GPT-2 models.

The final merged datasets can be downloaded in https://file.io/Wz3JI0DVhXF1

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
code		code
conf		conf
deberta_train		deberta_train
gpt2_train		gpt2_train
output_data		output_data
utils		utils
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Generated Text Detection with DeBERTa-Xlarge

Model Overview

Dataset

About

Releases

Packages

Languages

Honghui-Du/AI_genertaed_text_detection_Debert

Folders and files

Latest commit

History

Repository files navigation

AI-Generated Text Detection with DeBERTa-Xlarge

Model Overview

Dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages