Build software better, together

aishoot / Speech_Feature_Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

signal-processing speech feature-extraction speech-dataset speech-feature-extraction speech-features speech-preprocess

Updated Sep 3, 2020
Python

hetpandya / youtube_tts_data_generator

A python library to generate speech dataset from Youtube videos

text-to-speech youtube python-library tts speech-dataset dataset-generator youtube-dataset youtube-dataset-generator tts-dataset text-to-speech-dataset

Updated Jun 7, 2024
Python

ruslan-corpus / ruslan-corpus.github.io

Star

text-to-speech tts russian speech-dataset speech-corpus

Updated Aug 29, 2019
HTML

fjxmlzn / RNN-SM

Star

[T-IFS] RNN-SM: Fast Steganalysis of VoIP Streams Using Recurrent Neural Network

algorithm steganalysis idc rnn-sm ss-qccn speech-dataset

Updated May 24, 2018
Python

manankshastri / Trigger-Word-Detection

Star

Construct a speech dataset and implement an algorithm for trigger word detection (sometimes also called keyword detection, or wakeword detection).

python deep-learning rnn gated-recurrent-units speech-dataset trigger-word-detection

Updated Apr 14, 2019
Jupyter Notebook

Rumeysakeskin / Speech-Datasets-for-ASR

Sponsor

Star

Download speech datasets (English and non-English) for Automatic Speech Recognition

speech-synthesis speech-recognition speech-to-text speech-processing asr speech-dataset audio-datasets voice-datasets common-voice-dataset voxforge-dataset

Updated Jan 22, 2023
Jupyter Notebook

gauthelo / kallaama-speech-dataset

Star

A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.

natural-language-processing agriculture speech-processing speech-dataset senegal-language

Updated Apr 29, 2024

MahtaFetrat / ManaTTS-Persian-Speech-Dataset

Star

ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

text-to-speech tts speech-synthesis persian data-collection data-preprocessing speech-processing forced-alignment speech-dataset speech-corpus dataset-preparation persian-speech tts-dataset text-to-speech-dataset mana-tts speech-data-collection

Updated Sep 13, 2024
Jupyter Notebook

petrichorwq / DECRO-dataset

Star

Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.

speech-dataset deepfake-detection

Updated Sep 14, 2023

revsic / speechset

Star

Numpy-librosa implementation of Speech dataset pipeline

preprocessor tts vocoder speech-dataset

Updated Jan 18, 2023
Python

Ralireza / PSDR

Star

Persian spoken digit recognition

speech-recognition persian speech-recognizer speech-analysis speech-dataset persian-speech-recognition persian-spoken-digit persian-dataset

Updated Jul 28, 2019
Python

KanishkNavale / Speech-Emotion-Recognition

Star

A simple CNN-LSTM deep neural model using Tensorflow to classify emotions from a speech dataset

deep-learning tensorflow cnn lstm speech-emotion-recognition speech-dataset

Updated Jun 1, 2022
Jupyter Notebook

ina-foss / InaGVAD

Star

Voice activity detection and speaker gender segmentation audiovisual corpus

radio benchmark corpus tv dataset gender audio-segmentation voice-activity-detection gender-prediction speech-dataset gender-bias speech-activity-detection speaker-gender speech-corpus audio-dataset audiovisual-dataset acoustic-diversity gender-representation

Updated Jun 6, 2024
Jupyter Notebook

MahtaFetrat / GPTInformal-Persian-Speech-Dataset

Star

A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject

text-to-speech tts speech-synthesis persian data-collection data-preprocessing speech-processing forced-alignment speech-dataset speech-corpus dataset-preparation persian-speech tts-dataset text-to-speech-dataset mana-tts speech-data-collection manatts

Updated Sep 22, 2024

neuralwork / speech-collector

Star

A full-stack webapp for collecting and managing speech datasets.

collection dataset dataset-generation speech-dataset voice-dataset dataset-collection

Updated Nov 30, 2024
TypeScript

mborsdorf / GlobalPhoneMS_Scripts

Star

multilingual python deep-learning matlab speech-separation speech-dataset auditory-attention

Updated Sep 6, 2021
MATLAB

mborsdorf / TargetLanguageExtraction

Star

audio multilingual python deep-learning matlab pytorch speech-processing audio-processing source-separation speech-separation speech-dataset auditory-attention speech-corpus speaker-extraction speech-database

Updated Feb 8, 2022

EmoTa is an open-access Tamil Speech Emotion Recognition dataset with 936 utterances from 22 native speakers, covering five emotions (anger, happiness, sadness, fear, and neutrality). It supports emotion classification tasks and advances Tamil language processing.

ser speech-emotion-recognition speech-dataset emotional-speech coling2025 tamil-speech-emotion-recognition emota tamil-language-processing chipsal

Updated Jan 14, 2025

PanosAntoniadis / fast-recorder

Star

Simple script that creates a speech dataset quickly

recorder speech-to-text sphinx-4 speech-dataset

Updated Jul 13, 2019
Python

nafiuny / voice_conversion_dataset

Star

top dataset for voice conversion models

python text-to-speech tts dataset speech-to-text datasets pyth voice-conversion vc speech-dataset audio-datasets voice-dataset voice-datasets audio-dataset tts-dataset vc-dataset

Updated Oct 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-dataset

Here are 25 public repositories matching this topic...

aishoot / Speech_Feature_Extraction

hetpandya / youtube_tts_data_generator

ruslan-corpus / ruslan-corpus.github.io

fjxmlzn / RNN-SM

manankshastri / Trigger-Word-Detection

Rumeysakeskin / Speech-Datasets-for-ASR

gauthelo / kallaama-speech-dataset

MahtaFetrat / ManaTTS-Persian-Speech-Dataset

petrichorwq / DECRO-dataset

revsic / speechset

Ralireza / PSDR

KanishkNavale / Speech-Emotion-Recognition

ina-foss / InaGVAD

MahtaFetrat / GPTInformal-Persian-Speech-Dataset

neuralwork / speech-collector

mborsdorf / GlobalPhoneMS_Scripts

mborsdorf / TargetLanguageExtraction

aaivu / EmoTa

PanosAntoniadis / fast-recorder

nafiuny / voice_conversion_dataset

Improve this page

Add this topic to your repo