python3 shuffle_dataset.py --dataset 'xsum' --orig_source_length 512 --max_target_length 36 --seed 0
python3 src/extraction.py --approach 'head+tail0.5'
python3 src/extraction.py --approach 'head+tail0.5' --shuffle True -seed 0
To combine the extracted document into train_set.csv, val_set.csv and test.csv
python3 helper/helper.py --approach 'head+tail0.5'
python3 helper/helper.py --approach 'head+tail0.5' --shuffle True -seed 0