Skip to content

Commit

Permalink
Tweaked model loading text
Browse files Browse the repository at this point in the history
  • Loading branch information
nikita-savelyevv committed Jan 25, 2024
1 parent a98ee8d commit 06c54b5
Showing 1 changed file with 3 additions and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -70,7 +70,9 @@
"## Load PyTorch model\n",
"[back to top ⬆️](#Table-of-contents:)\n",
"\n",
"The `AutoModelForSpeechSeq2Seq.from_pretrained` method is used for the initialization of PyTorch Whisper model using the transformers library. We will use the `distil-whisper/distil-large-v2` model as an example in this tutorial. The model will be downloaded once during first run and this process may require some time. More details about this model can be found in [model_card](https://huggingface.co/distil-whisper/distil-large-v2).\n",
"The `AutoModelForSpeechSeq2Seq.from_pretrained` method is used for the initialization of PyTorch Whisper model using the transformers library. By default, we will use the `distil-whisper/distil-large-v2` model as an example in this tutorial. The model will be downloaded once during first run and this process may require some time.\n",
"\n",
"You may also choose other models from [Distil-Whisper hugging face collection](https://huggingface.co/collections/distil-whisper/distil-whisper-models-65411987e6727569748d2eb6) such as `distil-whisper/distil-medium.en` or `distil-whisper/distil-small.en`. Models of the original Whisper architecture are also available, more on them [here](https://huggingface.co/openai).\n",
"\n",
"Preprocessing and post-processing are important in this model use. `AutoProcessor` class used for initialization `WhisperProcessor` is responsible for preparing audio input data for the model, converting it to Mel-spectrogram and decoding predicted output token_ids into string using tokenizer."
]
Expand Down

0 comments on commit 06c54b5

Please sign in to comment.