audio-segmentation

Star

Here are 22 public repositories matching this topic...

BingLingGroup / autosub

Star

Command-line utility to transcribe/translate from video/audio/subtitles to subtitles

subtitles substation-alpha audio-segmentation xfyun cloud-speech-api voice-activity-detection baidu-api xunfei-api

Updated Dec 21, 2023
Python

amsehili / auditok

Star

An audio/acoustic activity detection and audio segmentation tool

vad audio-data audio-activities audio-segmentation voice-detection voice-activity-detection

Updated Dec 11, 2024
Python

Appen / UHV-OTS-Speech

Star

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

speech-recognition speech-processing audio-segmentation gender-classification speaker-diarization synthetic-speech-detection topic-detection speech-seperation speaker-identification accent-detection speech-transcription speech-annotation

Updated Mar 25, 2023
Forth

mt-upc / SHAS

Star

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

speech speech-to-text audio-segmentation speech-translation wav2vec2

Updated Feb 9, 2023
Python

nianlonggu / WhisperSeg

Star

Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection

transformer whisper audio-segmentation voice-activity-detection icassp2024 animal-sound-detection whisperseg

Updated Dec 5, 2024
Python

dangvansam / pyannote-onnx

Star

PyAnnote Voice Activity Detection (ONNX version)

vad audio-segmentation speech-separation onnx speech-activity-detection audio-split audio-splitter pyannote voice-ac

Updated Sep 9, 2023
Jupyter Notebook

huzaifakhan04 / music-recommendation-web-application-based-on-rhythmic-similarity-using-locality-sensitive-hashing

Star

This repository contains a web application that integrates with a music recommendation system, which leverages a dataset of 3,415 audio files, each lasting thirty seconds, utilising a Locality-Sensitive Hashing (LSH) implementation to determine rhythmic similarity, as part of an assignment for the Fundamental of Big Data Analytics (DS2004) course.

music spotify data-science machine-learning big-data music-recommendation lsh web-application music-information-retrieval flask-application locality-sensitive-hashing ann cosine-distance audio-segmentation audio-processing audio-recommendation music-recommendation-system approximate-nearest-neighbors

Updated Mar 1, 2024
Jupyter Notebook

ina-foss / InaGVAD

Star

Voice activity detection and speaker gender segmentation audiovisual corpus

radio benchmark corpus tv dataset gender audio-segmentation voice-activity-detection gender-prediction speech-dataset gender-bias speech-activity-detection speaker-gender speech-corpus audio-dataset audiovisual-dataset acoustic-diversity gender-representation

Updated Jun 6, 2024
Jupyter Notebook

yxlijun / solfege-segmentation

Star

pitch detection,CNN

cnn audio-segmentation f0-detection solfege-segmentation

Updated Sep 21, 2018
Python

Metiu-Metiu / Neural-Texture-Sound-synthesis---data-sets

Star

Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.

data-augmentation audio-segmentation synthetic-dataset-generation audio-datasets synthetic-dataset real-dataset audio-dataset-for-machine-learning