DNN_Speech_Recognition

Building an automatic speech recognition pipeline using the LibriSpeech data set. For the acoustic model both spectrograms and MFCCs are extracted from the data and fed to a DNN. For the DNN model different architectures are implemented using Keras and Tensorflow backend to explore the best preforming option. At the end a final model is built consisting of a 1D CNN Layer to extract features from the spectrogram, two bidirectional GRU layers with a batch normalization layer for each and finally a time distributed layer to receive sequences from the GRUs with a softmax activation to decode the probability distributions.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
sample_models.py		sample_models.py
train_utils.py		train_utils.py
vui_notebook.ipynb		vui_notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DNN_Speech_Recognition

About

Releases

Packages

Languages

moelkhawaga/DNN_Speech_Recognition

Folders and files

Latest commit

History

Repository files navigation

DNN_Speech_Recognition

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages