Skip to content

Intel® auto-round v0.3 Release

Compare
Choose a tag to compare
@wenhuach21 wenhuach21 released this 14 Aug 11:33
· 153 commits to main since this release
  • Highlights:

    • Broader Device Support:
      • Expanded support for CPU, HPU, and CUDA inference in the AutoRound format, resolving the 2-bit accuracy issue.
    • New Recipes and Model Releases:
    • Experimental Features:
      • Introduced several experimental features, including activation quantization and mx_fp, with promising outcomes with AutoRound.
    • Multimodal Model Support:
      • Extended capabilities for tuning and inference across several multimodal models.

    Lowlights:

    • Implemented support for low_cpu_mem_usage, auto_awq format, calibration dataset concatenation, and calibration datasets with chat templates.