Skip to content

Commit

Permalink
Update docs/articles_en/openvino-workflow/model-optimization-guide/we…
Browse files Browse the repository at this point in the history
…ight-compression.rst

Co-authored-by: Alexander Kozlov <[email protected]>
  • Loading branch information
l-bat and AlexKoff88 authored Oct 18, 2024
1 parent 7de9725 commit 7b3ad7e
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -161,7 +161,7 @@ trade-offs after optimization:
`Larger Group Size`: Results in faster inference and a smaller model, but might
compromise accuracy.

* ``ratio`` controls the ratio between INT4 and layers compressed to ``backup_mode`` in the model.
* ``ratio`` controls the ratio between the layers compressed to the precision defined by ``mode`` and the rest of the layers that will be kept in the ``backup_mode`` in the optimized model.
Ratio is a decimal between 0 and 1. For example, 0.8 means that 80% of layers will be
compressed to INT4, while the rest will be compressed to ``backup_mode`` precision. The default
value for ratio is 1.
Expand Down

0 comments on commit 7b3ad7e

Please sign in to comment.