transformer-omr-spellchecker

Experiments in training a transformer network to identify "errors" in symbolic music using a small corpus of training data. See the paper "A Transformer-based 'Spellchecker' for Detecting Errors in OMR Output," Timothy de Reuse and Ichiro Fujinaga, in ISMIR 2022 for a detailed description of our data augmentation method.

Setup

Requires Python >= 3.8, and

numpy>=1.22.0
h5py>=3.0.0
sklearn>=1.0.0
torch>=1.10
numba>=0.56.0
pytorch-fast-transformers==0.2.2
music21>=7.0.0

Preparing Data And Training

Data must be processed from .krn or .musicxml files (or any other symbolic music format parseable by music21 that contains information from which a sensible engraving can be inferred) into several .hdf5 file. We require a corpus of OMR'ed music files accompanied by their corrected versions (to serve as test and validation data) as well as a set of correct music files of similar genre to be used for training, through data augmentation (again, see the ISMIR paper cited above). The run_all_data_preparation.py script processes these files and produces five intermediary files:

A .h5 file containing only correct musical pieces in agnostic format (for training, w/data augmentation)
A .h5 file containing OMR'ed musical pieces aligned with corresponding correct musical files
A .h5 file containing supervised input / target training data based on alignments of the OMR'ed / correct musical pieces in the previous .h5 file
A vocab.txt file that assignes a single natural number index to each agnostic encoding token
A .joblib file created by training a simple regression model on part of the OMR'ed musical pieces, for inserting OMR-like errors into the non-OMR'ed corpus of musical files

Versions of these are already included in the GitHub repository for this project under /processed_datasets and /data_management. If you want to make your own, you will have to change the names of the files in the run_all_data_preparation.py script to match the names of the folders on your local machine where your symbolic music files are kept.

The model is trained and tested by setting a parameters file and running train_lstut_model.py with the relevant arguments (run python train_lstut_model.py -h for details).

Name		Name	Last commit message	Last commit date
Latest commit History 575 Commits
.vscode		.vscode
data_augmentation		data_augmentation
data_management		data_management
ext_tools		ext_tools
knn_classifier		knn_classifier
models		models
old_data_format_loaders		old_data_format_loaders
param_sets		param_sets
processed_datasets		processed_datasets
results_plotting		results_plotting
tests		tests
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
agnostic_embedding_vectors.npy		agnostic_embedding_vectors.npy
agnostic_omr_dataloader.py		agnostic_omr_dataloader.py
compare_agnostics_2.csv		compare_agnostics_2.csv
evaluate_alignment.py		evaluate_alignment.py
model_params.py		model_params.py
model_setup.py		model_setup.py
perfilecsv.csv		perfilecsv.csv
plot_outputs.py		plot_outputs.py
requirements.txt		requirements.txt
results_and_metrics.py		results_and_metrics.py
run_all_data_preparation.py		run_all_data_preparation.py
train_lstut_model.py		train_lstut_model.py
training_helper_functions.py		training_helper_functions.py
wandb_logging.py		wandb_logging.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

transformer-omr-spellchecker

Setup

Preparing Data And Training

About

Releases

Packages

Languages

timothydereuse/transformer-omr-spellchecker

Folders and files

Latest commit

History

Repository files navigation

transformer-omr-spellchecker

Setup

Preparing Data And Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages