Releases: m-bain/whisperX
Releases · m-bain/whisperX
3.1.1
translate
functionality added
- fix word timestamp bug (words no longer have consecutive timestamps)
v3.1.0
- 70x real time transcription, <8GB gpu memory requirement ⚡️⚡️
- each transcript segment is a sentence (using
nltk.sent_tokenize
)
- diarization now assigned per sentence (and outputted to srt)
- clean up on alignment logic
v3.0.1
- fix pickling error (set num_workers=0) to perform dataloading iwth main process
- add basic diarization
- pad language detection if less than 30s
v3.0.0
batched inference with faster-whisper backend
v2.0.1
allow custom model_dir for torchaudio models
alpha
Merge pull request #96 from smly/fix-batch-processing
FIX: Assertion error in batch processing