Skip to content

Latest commit

 

History

History
86 lines (56 loc) · 3.01 KB

README.md

File metadata and controls

86 lines (56 loc) · 3.01 KB

speechRecognitionTranscriber Homepage Latest Release Build Status

Speech Recognition Transcriber: speechRecognitionTranscriber (Python API)

Introduction

speechRecognitionTranscriber module use Google Speech API in python. This module performs speech recognition and converts to text. Admits video and audio files to be transcribed. Use ffmpeg to convert video and audio files to .wav to be recognized in Google Speech API. Also use fragments division based on silence.

Documentation available on docs.

Running software

speechRecognitionTranscriber requires video/audio like input. The process to running the program:

  1. Execute programs/speechRecognitionTranscriber.py, to start de program.
python speechRecognitionTranscriber.py
  1. Introduce your file path.
yourfile.extension

NOTE:

  • Transcribed text is saved in transcribedText.txt.
  • Transcribed text is saved in transcribedText.pdf.
  • Audio fragments are saved in /fragments.
  • Converted source is saved as convertedFile.wav.

Temporal files like cconvertedFile.wav and /fragments are deleted when program ends.

Requirements

speechRecognitionTranscriber requires:

pip install SpeechRecognition
  • Install fpdf:
pip install fpdf
  • Install pydub
pip install pydub
  • Install ffmpeg

Linux

sudo apt-get install ffmpeg

Microsoft Windows Download binaries and set path in system variables.

Tested on: windows 10,ubuntu 14.04, ubuntu 16.04, ubuntu 18.04, lubuntu 18.04 and raspbian.

Status

Build Status

Issues

Related projects