Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Examples] Add TensorRT-LLM example (end-to-end) #1753

Open
Tracked by #1735 ...
peterschmidt85 opened this issue Oct 1, 2024 · 5 comments
Open
Tracked by #1735 ...

[Examples] Add TensorRT-LLM example (end-to-end) #1753

peterschmidt85 opened this issue Oct 1, 2024 · 5 comments

Comments

@peterschmidt85
Copy link
Contributor

No description provided.

@bikash119
Copy link

@peterschmidt85 : May I take this up?

@peterschmidt85
Copy link
Contributor Author

@peterschmidt85 : May I take this up?

Only if you know how. It's not an easy one. TensorRT-LLM is one of the most complicated stacks.
The example should show it end-to-end: how to build a model, and serve it.

@bikash119
Copy link

Thank you @peterschmidt85 for giving me the heads up. Will give it a try and keep you posted on how it goes.
May I request you to point to any example on dstack which has similar set of task like

  • build the model
  • serve it

@peterschmidt85
Copy link
Contributor Author

Thank you @peterschmidt85 for giving me the heads up. Will give it a try and keep you posted on how it goes. May I request you to point to any example on dstack which has similar set of task like

  • build the model
  • serve it

I would invite you to explore more what dstack is, how it works, and of course what TensorRT-LLM is and how it works.

@bikash119
Copy link

Thank you @peterschmidt85.

@peterschmidt85 peterschmidt85 mentioned this issue Oct 3, 2024
46 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants