diff --git a/index.html b/index.html index 845527ce..ba4efe1a 100644 --- a/index.html +++ b/index.html @@ -79,10 +79,20 @@
This is enabled by LLM model compression technique: SmoothQuant and AWQ (Activation-aware Weight Quantization), co-designed with TinyChatEngine that implements the compressed low-precision model.
+
LLaMA Chat | Code LLaMA |
+
LLaMA Chat | Code LLaMA |
Feel free to check out our slides for more details!