Skip to content

abrari/tensorflow_speech_recognition

 
 

Repository files navigation

Tensorflow Speech Recognition

A example showing how to get CTC (connectionist temporal classification) cost function working with Tensorflow for automatic speech recognition.

Requirements

  • python 2.7+
    • tensorflow 1.0+
    • python_speech_features
    • numpy
    • scipy
  • sox (to convert MP3 to WAV)

Speech Data

I'm trying to transcribe recitation of the Quran from various reciters. The verse-by-verse recitation can be downloaded here. Convert them into WAV format using 2wav.sh script. Some WAV files from surah Al-Fatihah verse 2 are included in the wav directory to get started.

Learning Materials

Some useful introductory materials to get started:

License

This project is licensed under the terms of the MIT license.

See README for more information.

About

Tensorflow + CTC Example for ASR

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.6%
  • Shell 0.4%