diff --git a/README.md b/README.md index 9e6f7339..2ee04d4e 100644 --- a/README.md +++ b/README.md @@ -8,45 +8,11 @@ This is enabled by LLM model compression technique: [SmoothQuant](https://github Feel free to check out our [slides](assets/slides.pdf) for more details! -### Demo on an NVIDIA GeForce RTX 4070 laptop: - - - - - - - - - -
- chat_demo_gpu - - coding_demo_gpu -
- LLaMA Chat - - Code LLaMA -
+### Code LLaMA Demo on an NVIDIA GeForce RTX 4070 laptop: +![coding_demo_gpu](assets/figures/coding_demo_gpu.gif) -### Demo on an Apple MacBook Pro (M1, 2021): - - - - - - - - - -
- chat_demo_m1 - - coding_demo_m1 -
- LLaMA Chat - - Code LLaMA -
+### LLaMA Chat Demo on an Apple MacBook Pro (M1, 2021): +![chat_demo_m1](assets/figures/chat_demo_m1.gif) ## Overview