From 7b3ad7e243a5cc2e1d49321646544fb6e6b6cc0e Mon Sep 17 00:00:00 2001 From: Liubov Talamanova Date: Fri, 18 Oct 2024 10:56:28 +0100 Subject: [PATCH] Update docs/articles_en/openvino-workflow/model-optimization-guide/weight-compression.rst Co-authored-by: Alexander Kozlov --- .../model-optimization-guide/weight-compression.rst | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/articles_en/openvino-workflow/model-optimization-guide/weight-compression.rst b/docs/articles_en/openvino-workflow/model-optimization-guide/weight-compression.rst index 132127ef71de28..b7e7b0aa307434 100644 --- a/docs/articles_en/openvino-workflow/model-optimization-guide/weight-compression.rst +++ b/docs/articles_en/openvino-workflow/model-optimization-guide/weight-compression.rst @@ -161,7 +161,7 @@ trade-offs after optimization: `Larger Group Size`: Results in faster inference and a smaller model, but might compromise accuracy. -* ``ratio`` controls the ratio between INT4 and layers compressed to ``backup_mode`` in the model. +* ``ratio`` controls the ratio between the layers compressed to the precision defined by ``mode`` and the rest of the layers that will be kept in the ``backup_mode`` in the optimized model. Ratio is a decimal between 0 and 1. For example, 0.8 means that 80% of layers will be compressed to INT4, while the rest will be compressed to ``backup_mode`` precision. The default value for ratio is 1.