GitHub

Introduction

This repo contains an end-to-end speech recognition system implented using tensorflow. It uses bidirectional LSTMs in the encoding layer with attention in the decoding layer. The LSTMs in the encoding layer are strided and in every layer, the time dimension is reduced by 2.

Model

Default configuration contains 3 bidirectional layers for encoding and decoding layer:

Current results

I am getting round 15% WER in dev93 by training in si284 set.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
bin		bin
deepsphinx		deepsphinx
images		images
.pylintrc		.pylintrc
LICENSE.md		LICENSE.md
README.md		README.md
USAGE.md		USAGE.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Model

Current results

About

Releases

Packages

Languages

License

cmusphinx/deepsphinx

Folders and files

Latest commit

History

Repository files navigation

Introduction

Model

Current results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages