wavlm

Here are 12 public repositories matching this topic...

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

text-to-speech deep-learning pytorch tts speech-synthesis gan speaker-adaptation adversarial-training diffusion-models wavlm latent-diffusion latent-diffusion-models

Updated Aug 10, 2024
Python

s3prl / s3prl

Star

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Updated Dec 22, 2024
Python

wenet-e2e / wespeaker

Star

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Updated Dec 25, 2024
Python

mjhydri / Singing-Vocal-Beat-Tracking

Star

This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and t…

music music-information-retrieval beat-tracking self-supervised singing-voice hubert linear-transformer wavlm

Updated Sep 4, 2022
Python

lucadellalib / discrete-wavlm-codec

Star

A neural speech codec based on discrete WavLM representations

clustering pytorch speech-synthesis codec k-means quantization self-supervised-learning hifi-gan wavlm token-extraction neural-speech-coding

Updated Aug 28, 2024
Python

alessandropec / data_driven_ai_voice_cloning

Star

This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering

machine-learning text-to-speech ai deep-learning speaker-verification zero-shot-learning speaker-embeddings voice-cloning tacotron2 fastspeech2 ecapa-tdnn wavlm generative-ai

Updated Mar 5, 2023
Python

Sarasadeghii / Sharif-WavLM

Star

In this repository, the wavLM model is used for quality and poor quality data for speaker verification task, and the PyCM library is used for evaluation.

confusion-matrix speaker-verification farsi-datasets wavlm pycm

Updated May 27, 2023
Jupyter Notebook

lucadellalib / audiocodecs

Star

A collections of audio codecs with a standardized API

text-to-speech pytorch speech-synthesis codec quantization mimi dac self-supervised-learning encodec wavlm speech-coding speechtokenizer speech-language-model

Updated Nov 24, 2024
Python

theolepage / wavlm_ssl_sv

Star

SOTA method for self-supervised speaker verification leveraging a large-scale pretrained ASR model.

pytorch speaker-recognition speaker-verification asr dino self-supervised-learning voxceleb meta-project-show meta-project-color-8877f4 meta-project-order-2 wavlm

Updated Sep 19, 2024
Python

zhu00121 / Universal-representation-dynamics-of-deepfake-speech

Star

This repo contains code used in the paper "Characterizing the temporal dynamics of universal speech representations for generalizable deepfake detection"

self-supervised deepfake-detection wav2vec2 wavlm modulation-transformation

Updated Oct 19, 2023
Python

aitor-alvarez / acoustic-transformer-models

Star

Acoustic Transformer Models for Audio Classification

classification acoustic transformer-models pytorch-lightning hubert wav2vec2 wavlm

Updated Nov 2, 2024
Python

lucadellalib / cryceleb2023

Star

CryCeleb2023 experiments

metric-learning speaker-verification triplet-loss eer am-softmax ecapa-tdnn titanet wavlm cryceleb2023

Updated Jul 5, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the wavlm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wavlm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wavlm

Here are 12 public repositories matching this topic...

yl4579 / StyleTTS2

s3prl / s3prl

wenet-e2e / wespeaker

mjhydri / Singing-Vocal-Beat-Tracking

lucadellalib / discrete-wavlm-codec

alessandropec / data_driven_ai_voice_cloning

Sarasadeghii / Sharif-WavLM

lucadellalib / audiocodecs

theolepage / wavlm_ssl_sv

zhu00121 / Universal-representation-dynamics-of-deepfake-speech

aitor-alvarez / acoustic-transformer-models

lucadellalib / cryceleb2023

Improve this page

Add this topic to your repo