Add precision conversion and quantization filters #3059

ZiyueXu77 · 2024-11-07T22:49:53Z

Fixes # .

Description

Add 16 and 8 bit quantization filters and apply it to LLM-HF example and experiments

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Quick tests passed locally by running ./runtest.sh.
In-line docstrings updated.
Documentation updated.

holgerroth · 2024-11-12T17:37:48Z

examples/advanced/llm_hf/sft_job_compress.py

Why not just add an option for compressions to sft_job?

good point! updated

holgerroth · 2024-11-12T17:38:35Z

nvflare/app_opt/compression/model_compressor.py

Need some unit tests for model compressor & extractor

holgerroth · 2024-11-12T17:39:43Z

nvflare/app_opt/compression/model_compressor.py

+        else:
+            self.compression_type = compression_type
+        # compression constants
+        self.FP16_MIN = np.finfo(np.float32).min


Shouldn't this depend on source_data_type?

yes, initiate here to avoid multiple finfo calling

holgerroth · 2024-11-12T17:40:48Z

nvflare/app_opt/compression/model_compressor.py

+class ModelCompressor(DXOFilter):
+    def __init__(
+        self,
+        source_data_type="float32",


in general, why do we need source_data_type? Can we not automatically determine it?

removed, also removed its counterpart for decompressor

holgerroth · 2024-11-12T17:47:47Z

nvflare/app_opt/compression/model_extractor.py

I would call this "Decompressor". Extraction is more often used for archives with multiple files. Here, we just use data compression/decompression.

good idea, changed to "quantizer" and "dequantizer"

holgerroth

Added some suggestions.

Add 16 and 8 bit quantization filters

96deb9e

ZiyueXu77 requested review from holgerroth, chesterxgchen and yanchengnv November 7, 2024 22:51

ZiyueXu77 added 2 commits November 8, 2024 10:00

Expose --gpu to job script

c67e01f

remove redundant file

2c200b4

ZiyueXu77 requested review from YuanTingHsieh and IsaacYangSLA November 8, 2024 17:17

ZiyueXu77 added 2 commits November 8, 2024 13:15

add bitsandbytes license file

98c0c0e

add save_total_limit to fl script to save space

bee5b04

holgerroth reviewed Nov 12, 2024

View reviewed changes

nvflare/app_opt/compression/model_compressor.py Outdated

Copy link

Collaborator

holgerroth Nov 12, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need some unit tests for model compressor & extractor

holgerroth reviewed Nov 12, 2024

View reviewed changes

holgerroth requested changes Nov 12, 2024

View reviewed changes

ZiyueXu77 added 2 commits November 13, 2024 16:48

updates on naming, func, etc.

7670ca2

update name

d369a09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add precision conversion and quantization filters #3059

Add precision conversion and quantization filters #3059

ZiyueXu77 commented Nov 7, 2024

holgerroth Nov 12, 2024

ZiyueXu77 Nov 13, 2024

holgerroth Nov 12, 2024

holgerroth Nov 12, 2024

ZiyueXu77 Nov 13, 2024

holgerroth Nov 12, 2024

ZiyueXu77 Nov 13, 2024

holgerroth Nov 12, 2024

ZiyueXu77 Nov 13, 2024

holgerroth left a comment

Add precision conversion and quantization filters #3059

Are you sure you want to change the base?

Add precision conversion and quantization filters #3059

Conversation

ZiyueXu77 commented Nov 7, 2024

Description

Types of changes

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

holgerroth left a comment

Choose a reason for hiding this comment