-
Southern University of Science and Technology
- Shenzhen Guangdong, China
-
15:29
(UTC +08:00) - https://www.sustech.edu.cn/
Highlights
- Pro
Popular repositories Loading
-
-
tensorrtx
tensorrtx PublicForked from wang-xinyu/tensorrtx
Implementation of popular deep learning networks with TensorRT network definition API
C++
-
InfiniGen
InfiniGen PublicForked from snu-comparch/InfiniGen
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
Python
-
H2O
H2O PublicForked from FMInference/H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Python
-
prompt-cache
prompt-cache PublicForked from yale-sys/prompt-cache
Modular and structured prompt caching for low-latency LLM inference
Python
-
If the problem persists, check the GitHub status page or contact support.