A fork from FasterTransformer Backend. See this repo for documentation on usage.
-
Notifications
You must be signed in to change notification settings - Fork 0
LLM model serving
License
melli-labs/fastertransformer_backend
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
LLM model serving
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Python 71.8%
- C++ 12.8%
- Shell 11.9%
- CMake 3.1%
- Dockerfile 0.4%