This repository hosts the implementation and datasets for two main components: GAVEL and CNBN. CNBN serves as the baseline for our comparisons, while GAVEL is our newly proposed system designed to improve upon the baseline in specific tasks.
-
CNBN (Baseline):
gh_action_dataset
: Includes datasets likeall.csv
,code_block.csv
, and results from multiple 5-fold cross-validation setups.tools
: Contains scripts such asdataLoader.py
,main.py
, andpredictor.py
to facilitate data handling and predictions.
-
GAVEL (Proposed System):
CONFS
: Configuration files (C1.csv
toC5.csv
) for setting up experimental conditions.RESULTS
: Contains experimental outputs likeFold_1_Results.csv
, utilities for converting classification reports to LaTeX (classification_report2latex.py
), and statistical analysis tools (stats.py
).TOOLS
: Includes scripts likegavel.py
which are central to the GAVEL system.