In this tutorial, you will learn how to build a Serverless application for speech (audio) to text conversion using Amazon Transcribe, AWS Lambda and the Go programming language.
Amazon Transcribe is a service that utilizes machine learning models to convert speech to text automatically. It offers various features that can enhance the accuracy of the transcribed text, such as language customization, content filtering, multi-channel audio analysis, and individual speaker speech partitioning. Amazon Transcribe can be used as a standalone transcription service or to add speech-to-text capabilities to any application. You can transcribe media in real time (streaming) or you can transcribe media files located in an Amazon S3 bucket (batch).
Here is how the application works:
- MP3 audio files uploaded to a
S3
bucket will trigger a Lambda function. - The Lambda function will submit an asynchronous job to Amazon Transcribe which will finally store the resulting file (containing the converted text) in another
S3
bucket.
See CONTRIBUTING for more information.
This library is licensed under the MIT-0 License. See the LICENSE file.