git clone [email protected]:HEP-KBFI/ml-tau-en-reg.git --recursive
If, however, you forgot to pull the submodules using ```--recursive``, you need to initialize the submodules as such:
git submodule update --init
The latest merged ntuples for training are here:
$ du -csh /scratch/persistent/joosep/ml-tau/20240701_lowered_ptcut_merged/*
474M /scratch/persistent/joosep/ml-tau/20240701_lowered_ptcut_merged/qq_test.parquet
1.9G /scratch/persistent/joosep/ml-tau/20240701_lowered_ptcut_merged/qq_train.parquet
112M /scratch/persistent/joosep/ml-tau/20240701_lowered_ptcut_merged/zh_test.parquet
445M /scratch/persistent/joosep/ml-tau/20240701_lowered_ptcut_merged/zh_train.parquet
97M /scratch/persistent/joosep/ml-tau/20240701_lowered_ptcut_merged/z_test.parquet
386M /scratch/persistent/joosep/ml-tau/20240701_lowered_ptcut_merged/z_train.parquet
$ du -csh /local/joosep/ml-tau-en-reg/results/240626_train_on_z/*/*/*
7.9M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v1/dm_multiclass/LorentzNet
11M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v1/dm_multiclass/ParticleTransformer
7.5M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v1/dm_multiclass/SimpleDNN
9.2M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v1/jet_regression/LorentzNet
12M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v1/jet_regression/ParticleTransformer
8.3M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v1/jet_regression/SimpleDNN
7.9M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v2/dm_multiclass/LorentzNet
11M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v2/dm_multiclass/ParticleTransformer
7.5M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v2/dm_multiclass/SimpleDNN
9.2M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v2/jet_regression/LorentzNet
12M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v2/jet_regression/ParticleTransformer
8.4M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v2/jet_regression/SimpleDNN
8.0M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v3/dm_multiclass/LorentzNet
11M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v3/dm_multiclass/ParticleTransformer
7.4M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v3/dm_multiclass/SimpleDNN
9.3M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v3/jet_regression/LorentzNet
12M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v3/jet_regression/ParticleTransformer
8.6M /local/joosep/ml-tau-en-reg/results/240626_train_on_z/v3/jet_regression/SimpleDNN
All the necessary packages are installed to the singularity image used in the run.sh
script.
In order to run the scripts do:
./run.sh python3 [XYZ]
To test locally on a fraction of the data
./run.sh python3 enreg/scripts/trainModel.py output_dir=training-outputs/mytest fraction_train=0.1 fraction_valid=0.1 training.num_epochs=2 model_type=SimpleDNN training_type=jet_regression
./run.sh python3 enreg/scripts/trainModel.py output_dir=training-outputs/mytest fraction_train=0.1 fraction_valid=0.1 training.num_epochs=2 model_type=SimpleDNN training_type=dm_multiclass
The configuration the models starts at enreg/config/model_training.yaml
.
To submit the training of the models to gpu0
, check and run
./enreg/scripts/submit-pytorch-gpu-all.sh
For comet-ml logging one needs to create an account at https://www.comet.com/ and get the API key.
comet login
This will prompt you to enter the API key and will create a file in ~/.comet.config
, where you should add the workspace name and project, such that the config file will look as such:
[comet]
api_key = YOUR_API_KEY
project_name = YOUR_PROJECT
workspace = YOUR_WORKSPACE
Change enreg/config/benchmarking.yaml
and enreg/config/metrics/regression.yaml
as needed.
./run.sh python3 enreg/scripts/calculate_regression_metrics.py
and
notebooks/DM_CM.ipynb
notebooks/losses.ipynb
./run.sh jupyter notebook --no-browser
Edit sim/run_sim.sh
to change the output directory, then generate 100 events as follows:
cd sim
./run_sim.sh 1 p8_ee_ZH_Htautau_ecm380.cmd
To produce the jet-based ML ntuples from the .root and .hepmc files
./run.sh python3 enreg/scripts/ntupelize_edm4hep.py
bash /home/joosep/tmp/NA7VJ17OVH/executables/execute0.sh