Feat (offload/fx): better buffer/params + call_functional #816

Giuseppe5 · 2024-01-30T16:24:17Z

No description provided.

* optimum: initial optimum integration * Refined solution for offloading * Fix (optimum): clean-up (#802) * Fix (optimum): dataloader and forward cleanup (#807) * Fix (optimum): forward pass + fx (#808) * FX forward, GPTQ, Export (#809) * Forward pass with fx and pkv * Restore eval * Restore quantization * Experimental export * Fix GPTQ + Export * Fix 2GB ONNX export error * Fix gptq + speedup * Feat (offload/fx): better buffer/params + call_functional (#816) * Fix: typo to setting weight handlers * Feat (optimum): better call_function FX offload (#817) * Refactored per row quantization. JIT not working (#818) * Better structure for QDQ weights (#822) * Fix (export): flag for torch qcdq export (#823) * Setup: remove optimum folder (#825) * Add/fix comments * Fix llm example * Misc: pre-commit fix * Fix (graph/equalize): new transpose interface * Fix (examples/llm): no constant folding for group quant --------- Co-authored-by: Nick Fraser <[email protected]>

Feat (offload/fx): better buffer/params + call_functional

b008e18

Giuseppe5 merged commit 034168b into Xilinx:optimum Jan 30, 2024
22 checks passed

nickfraser pushed a commit that referenced this pull request Feb 1, 2024

Feat (offload/fx): better buffer/params + call_functional (#816)

09df0df

Giuseppe5 added a commit that referenced this pull request Feb 6, 2024

Feat (offload/fx): better buffer/params + call_functional (#816)

38dc8b3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (offload/fx): better buffer/params + call_functional #816

Feat (offload/fx): better buffer/params + call_functional #816

Giuseppe5 commented Jan 30, 2024

Feat (offload/fx): better buffer/params + call_functional #816

Feat (offload/fx): better buffer/params + call_functional #816

Conversation

Giuseppe5 commented Jan 30, 2024