Skip to content

[ Kernel ] FP8 Dynamic-Per-Token Quant Kernel#6511

Merged
robertgshaw2-redhat merged 14 commits intovllm-project:mainfrom neuralmagic:varun/dynamic-per-token-fp8Jul 18, 2024

Commits

Commits on Jul 16, 2024

Commits on Jul 17, 2024