librivox-errors

For project in course DT2112 VT24, "Detecting and correcting speech errors produced on LibriVox audiobooks".

Phoneme model and fine-tuning with 5-gram

We used a wav2vec2 model and feed it with a 5-gram model provided to us by our supervisor. https://www.kaggle.com/code/julwan/dubliners-wav2vec2-xls-r-300m-timit-phoneme

Error metrics

Word error rate

https://www.kaggle.com/code/julwan/wer-dubliners

Distance alignment confusion matrix

https://www.kaggle.com/code/julwan/distance-alignment-confusion-matrix-dubliners

Over characters and phones between the two different accents.

Report

The report can be found on https://www.overleaf.com/read/qhtvzkrjthms#83b9f5

Credit

Thanks to our supervisor Jim O'Regan for helping us through the project and providing us with the materials necessary to work on this project.

Team

Julia Wang
Kevin Wenström
Peter Cady

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
canon		canon
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

librivox-errors

Phoneme model and fine-tuning with 5-gram

Error metrics

Word error rate

Distance alignment confusion matrix

Report

Credit

Team

About

Releases

Packages

Languages

License

ziyi01/librivox-errors

Folders and files

Latest commit

History

Repository files navigation

librivox-errors

Phoneme model and fine-tuning with 5-gram

Error metrics

Word error rate

Distance alignment confusion matrix

Report

Credit

Team

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages