Building & Updating Pipeline

Install python 3.12 virtual environment: python3 -m venv myenv and source myenv/bin/activate.
Run the build.sh script except for the last cog run pythong line.
Run: sudo cog push r8.im/vishjain/whisperx-replicate-speaklabs-v2 to build and push to the Replicate repository.

whisperX on Replicate

This repo is the codebase behind the following Replicate models, which we use at Upmeet:

victor-upmeet/whisperx : if you don't know which model to use, use this one. It uses a low-cost hardware, which suits most cases
victor-upmeet/whisperx-a40-large : if you encounter some memory issues with previous models, consider this one. It can happen when dealing with long audio files and performing alignment and/or diarization
victor-upmeet/whisperx-a100-80gb : if you encounter some memory issues with previous models, consider this one. It can happen when dealing with long audio files and performing alignment and/or diarization

Model Information

WhisperX provides fast automatic speech recognition (70x realtime with large-v3) with word-level timestamps and speaker diarization.

Whisper is an ASR model developed by OpenAI, trained on a large dataset of diverse audio. Whilst it does produces highly accurate transcriptions, the corresponding timestamps are at the utterance-level, not per word, and can be inaccurate by several seconds. OpenAI’s whisper does not natively support batching, but WhisperX does.

Model used is for transcription is large-v3 from faster-whisper.

For more information about WhisperX, including implementation details, see the WhisperX github repo.

Citation

@misc{bain2023whisperx,
      title={WhisperX: Time-Accurate Speech Transcription of Long-Form Audio}, 
      author={Max Bain and Jaesung Huh and Tengda Han and Andrew Zisserman},
      year={2023},
      eprint={2303.00747},
      archivePrefix={arXiv},
      primaryClass={cs.SD}
}

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
.gitignore		.gitignore
README.md		README.md
build.sh		build.sh
cog.yaml		cog.yaml
get_vad_model_url.py		get_vad_model_url.py
predict.py		predict.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Building & Updating Pipeline

whisperX on Replicate

Model Information

Citation

About

Releases

Packages

Languages

Zeus-Labs/whisperx-replicate-speaklabs

Folders and files

Latest commit

History

Repository files navigation

Building & Updating Pipeline

whisperX on Replicate

Model Information

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages