Update convert_and_optimize_asr.py #1659

zhuo-yoyowz · 2024-01-29T14:14:51Z

Update convert_and_optimize_asr.py with quantization code of whisper model

adrianboguszewski

Thanks, @zhuo-yoyowz. Did you test it in the app? Will it work with Optimum Intel (app uses HF interface)?

Is it possible to quantize it using OVQuantizer from Optimum Intel?

zhuo-yoyowz · 2024-01-31T12:33:35Z

Thanks, @zhuo-yoyowz. Did you test it in the app? Will it work with Optimum Intel (app uses HF interface)?

Is it possible to quantize it using OVQuantizer from Optimum Intel?

Hi Adrian, I've tested it in app.py. Without changing any code in app.py, the current pipeline could also load and compile the quantized model successfully. Haven't done testing the OVQuantizer from Optimum Intel. I'm still a bit uncertain about how to se the configuration for setting up the calibration dataset and defining the preprocess function.

adrianboguszewski

Does it work for you? If I use it in the app I don't get a meaningful transcription. Just a random word.

adrianboguszewski · 2024-02-01T16:10:50Z

recipes/conversational_voice_agent/convert_and_optimize_asr.py

+                                                             decoder_calibration_data)
+
+    calibration_dataset = load_dataset("librispeech_asr", "clean", split="validation", streaming=True)
+    for sample in tqdm(islice(calibration_dataset, calibration_dataset_size), desc="Collecting calibration data",


tqdm is causing some errors for me (expecting a notebook?)

adrianboguszewski · 2024-02-01T16:12:13Z

recipes/conversational_voice_agent/convert_and_optimize_asr.py

+    if not output_dir.exists():
+        ov_model = OVModelForSpeechSeq2Seq.from_pretrained(
+            MODEL_NAME, ov_config=ov_config, export=True, compile=False, load_in_8bit=False
+        )
+        ov_model.half()
+        ov_model.save_pretrained(output_dir)
+    else:
+        ov_model = OVModelForSpeechSeq2Seq.from_pretrained(
+            output_dir, ov_config=ov_config, compile=False
+        )



I wouldn't check if the model is converted. I don't assume one will convert to FP16 first and then to INT8.

adrianboguszewski · 2024-02-01T16:13:42Z

recipes/conversational_voice_agent/convert_and_optimize_asr.py

+
+        CALIBRATION_DATASET_SIZE = 50
+        quantized_distil_model_path = model_dir / (MODEL_NAME.rsplit ("/")[-1] + "-INT8")
+        ov_model.to("AUTO")


Why AUTO device here? Shouldn't be CPU or nothing?
Is compilation needed?

Hi Adrian, replaced codes with updated version, using Optimum-Intel for weights compression directly. Please help review. Thanks~

adrianboguszewski · 2024-02-07T14:37:26Z

Good job!

Update convert_and_optimize_asr.py

78ddf6c

Update convert_and_optimize_asr.py with quantization code of whisper model

zhuo-yoyowz requested a review from adrianboguszewski January 29, 2024 14:15

adrianboguszewski reviewed Jan 30, 2024

View reviewed changes

adrianboguszewski requested changes Feb 1, 2024

View reviewed changes

Update convert_and_optimize_asr.py

de7af37

adrianboguszewski merged commit 9cf8836 into recipes Feb 7, 2024
1 check passed

adrianboguszewski deleted the zhuo-yoyowz-patch-1 branch February 7, 2024 14:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update convert_and_optimize_asr.py #1659

Update convert_and_optimize_asr.py #1659

zhuo-yoyowz commented Jan 29, 2024

adrianboguszewski left a comment

zhuo-yoyowz commented Jan 31, 2024

adrianboguszewski left a comment

adrianboguszewski Feb 1, 2024 •

edited

Loading

adrianboguszewski Feb 1, 2024

adrianboguszewski Feb 1, 2024

zhuo-yoyowz Feb 5, 2024

adrianboguszewski commented Feb 7, 2024

Update convert_and_optimize_asr.py #1659

Update convert_and_optimize_asr.py #1659

Conversation

zhuo-yoyowz commented Jan 29, 2024

adrianboguszewski left a comment

Choose a reason for hiding this comment

zhuo-yoyowz commented Jan 31, 2024

adrianboguszewski left a comment

Choose a reason for hiding this comment

adrianboguszewski Feb 1, 2024 • edited Loading

Choose a reason for hiding this comment

adrianboguszewski Feb 1, 2024

Choose a reason for hiding this comment

adrianboguszewski Feb 1, 2024

Choose a reason for hiding this comment

zhuo-yoyowz Feb 5, 2024

Choose a reason for hiding this comment

adrianboguszewski commented Feb 7, 2024

adrianboguszewski Feb 1, 2024 •

edited

Loading