k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning #1745

yfyeung · 2024-09-07T15:24:02Z

Libri-Light data processing script
Libri-Light Zipformer multi-node-multi-gpu pre-train recipe
LibriSpeech Zipformer bpe-level prund rnn-t fine-tune recipe
LibriSpeech Zipformer letter-level ctc fine-tune recipe
Release all resource and results for Zipformer Base
Release all resource and results for Zipformer Large

update Update ssl_datamodule.py Update pretrain.py Update pretrain.sh Update pretrain.sh Update hubert_ce.py Update pretrain.py

update

yfyeung · 2024-09-25T03:49:10Z

Part of the resources have been released:

Zipformer Base pre-trained with cross entropy loss: Checkpoints, logs, and scripts
Zipformer Base fine-tuned with pruned RNN-T loss: Checkpoints, logs, and scripts
Zipformer Base fine-tuned with letter-level CTC loss: Checkpoints, logs, and scripts
Zipformer Base pre-trained manifests including kmeans labels: Dataset

With these resources, I believe anyone with 8 V100 32G GPUs can easily reproduce our experiments.

ezerhouni · 2024-09-25T06:48:43Z

@yfyeung Thank you very much for the PR and sharing the model weights. Do you plan to release a paper also ?

yfyeung · 2024-09-25T07:09:23Z

@yfyeung Thank you very much for the PR and sharing the model weights. Do you plan to release a paper also ?

Maybe, but currently I am not sure whether it is suitable for a technical report or just a normal research paper.

Your Name and others added 28 commits August 10, 2024 22:44

add librilight ssl recipe

8e296b7

update Update ssl_datamodule.py Update pretrain.py Update pretrain.sh Update pretrain.sh Update hubert_ce.py Update pretrain.py

support multinode multigpu

f26dd3b

update

Merge branch 'k2-fsa:master' into dev/k2ssl

cce86a3

Merge branch 'k2-fsa:master' into dev/k2ssl

8b1402a

Merge branch 'k2-fsa:master' into dev/k2ssl

70a1713

use lr hours in librilight ssl

d025ce1

Update run_multi_node_multi_gpu.sh

d0a96a6

Update pretrain.py

eca8afc

Update pretrain.py

8fe6713

Update pretrain.py

6dbcdba

Update run_multi_node_multi_gpu.sh

f672df2

Merge branch 'k2-fsa:master' into dev/k2ssl

6357d42

Update pretrain.py

ad61d72

Merge branch 'k2-fsa:master' into dev/k2ssl

26b2a57

Update finetune_ctc.py

ef5cf02

Delete egs/librispeech/SSL/pretrain.sh

13c6e81

Delete egs/librilight/SSL/zipformer/finetune.py

673ca14

Delete egs/librilight/SSL/zipformer/decode.py

affc43b

Delete egs/librilight/SSL/zipformer/asr_datamodule.py

d4a5c40

Update finetune_ctc.py

2e52cbf

Update finetune.py

2d3452f

Update finetune_ce.py

19cc5ba

Update finetune.py

f05b3b1

Merge branch 'k2-fsa:master' into dev/k2ssl

450d05d

small fix

b35924f

fix isort

8c257a3

Merge branch 'k2-fsa:master' into dev/k2ssl

6a30568

Merge branch 'k2-fsa:master' into dev/k2ssl

25b6dd2

yfyeung and others added 8 commits October 21, 2024 13:27

Merge branch 'k2-fsa:master' into dev/k2ssl

e80b9dc

Merge branch 'k2-fsa:master' into dev/k2ssl

c920735

update prepare.sh

84f8adf

add sliding window

a6a8089

update

ce72b34

Merge branch 'k2-fsa:master' into dev/k2ssl

34957a6

Merge branch 'k2-fsa:master' into dev/k2ssl

d4b6cb0

skipping batch counts hurts performance

1b89c6d

yfyeung force-pushed the dev/k2ssl branch 2 times, most recently from 35fd0cf to 1b89c6d Compare October 30, 2024 11:59

yfyeung added 2 commits October 31, 2024 11:18

Merge branch 'k2-fsa:master' into dev/k2ssl

277b261

Merge branch 'k2-fsa:master' into dev/k2ssl

893fee4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning #1745

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning #1745

yfyeung commented Sep 7, 2024 •

edited

Loading

yfyeung commented Sep 25, 2024

ezerhouni commented Sep 25, 2024

yfyeung commented Sep 25, 2024

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning #1745

Are you sure you want to change the base?

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning #1745

Conversation

yfyeung commented Sep 7, 2024 • edited Loading

yfyeung commented Sep 25, 2024

ezerhouni commented Sep 25, 2024

yfyeung commented Sep 25, 2024

yfyeung commented Sep 7, 2024 •

edited

Loading