Replies: 1 comment
-
By default it show word-level and segment-level timing. So you just need to disable one to show only the other. result = model.transcribe('audio.mp3')
result.to_srt_vtt('output.srt', segment_level=False) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
First of all, thank you very much for this amazing version of whisper.
I just wanted to ask you for the word_level function, resulting to a srt file with timecodes for each word.
If i want to format the output, it's the same logic as original whisper?
Thank you very much for your time
Beta Was this translation helpful? Give feedback.
All reactions