Skip to content

speechRecognitionTranscriber: Implementation of video / audio to text transcriber using speech recognition with the Google Speech API.

Notifications You must be signed in to change notification settings

davidvelascogarcia/speechRecognitionTranscriber

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

speechRecognitionTranscriber Homepage Latest Release Build Status

Speech Recognition Transcriber: speechRecognitionTranscriber (Python API)

Introduction

speechRecognitionTranscriber module use Google Speech API in python. This module performs speech recognition and converts to text. Admits video and audio files to be transcribed. Use ffmpeg to convert video and audio files to .wav to be recognized in Google Speech API. Also use fragments division based on silence.

Documentation available on docs.

Running software

speechRecognitionTranscriber requires video/audio like input. The process to running the program:

  1. Execute programs/speechRecognitionTranscriber.py, to start de program.
python speechRecognitionTranscriber.py
  1. Introduce your file path.
yourfile.extension

NOTE:

  • Transcribed text is saved in transcribedText.txt.
  • Transcribed text is saved in transcribedText.pdf.
  • Audio fragments are saved in /fragments.
  • Converted source is saved as convertedFile.wav.

Temporal files like cconvertedFile.wav and /fragments are deleted when program ends.

Requirements

speechRecognitionTranscriber requires:

pip install SpeechRecognition
  • Install fpdf:
pip install fpdf
  • Install pydub
pip install pydub
  • Install ffmpeg

Linux

sudo apt-get install ffmpeg

Microsoft Windows Download binaries and set path in system variables.

Tested on: windows 10,ubuntu 14.04, ubuntu 16.04, ubuntu 18.04, lubuntu 18.04 and raspbian.

Status

Build Status

Issues

Related projects

About

speechRecognitionTranscriber: Implementation of video / audio to text transcriber using speech recognition with the Google Speech API.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages