Change the repository type filter
All
Repositories list
40 repositories
- LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
quant_horizon
Publicllmc
Public[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".mtc-token-healing
Publicgeneral-sam-py
Publicgeneral-sam
PublicEasyLLM
PublicDeepSpeed
Publicopencompass
PublicInternVL
PublicOmniBal
PublicTFMQ-DM
Public[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".L2_Compression
Publicmsbench
PublicMQBench
PublicFCPTS
Public templatestatecs
Publicgreedy-tokenizer
PublicQLLM
Public[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"Dipoorlet
Publicawesome-lm-system
PublicLPCV_2023_solution
PublicOutlier_Suppression_Plus
PublicUP_LPCV2023_Plugin
PublicChatGLM-6B
Publicpyvlova
Publicsystemnoise_web
PublicNART
PublicUnited-Perception
Public