Add direct client implementation #15

dltn · 2024-11-07T18:40:59Z

The Llama Stack primarily operates as a client/server model. However, there are scenarios where hosting a distribution can be cumbersome (e.g., testing, Jupyter), making it more desirable to utilize the Llama Stack as a library.

This introduces a clever hack that extends the Stainless Python client. It intercepts GET/POST requests intended for HTTP transmission and uses reflection to deserialize and route them directly to their implementations.

Is this roundabout serialization the most efficient method? Certainly not. However, the convenience of having this as a drop-in solution is significant, and it is negligible compared to GPU latency.

src/llama_stack_client/lib/direct/direct.py

ashwinb

oh man this is beautiful <3

yanxi0830 · 2024-11-07T21:47:52Z

src/llama_stack_client/lib/direct/direct.py

+
+from llama_stack.distribution.datatypes import StackRunConfig
+from llama_stack.distribution.distribution import get_provider_registry
+from llama_stack.distribution.resolver import resolve_impls


Should we add llama-stack as a dependency for the llama-stack-client package?

nope it should be the reverse as we talked about. this code should always be exercised when the person already has llama-stack in their environment (as a library or as pip)

Hmm, should this class LlamaStackDirectClient be inside the llama-stack repo instead of the llama-stack-client-python repo?

User who want to use llama-stack as a library. Install llama-stack package (dependent on llama-stack-client package). Is able to use LlamaStackDirectClient.

User who just installs llama-stack-client package. They cannot use LlamaStackDirectClient without installing llama-stack.

@yanxi0830 yeah I think that makes sense to me actually.

Add direct client implementation and tests

4b33f9a

facebook-github-bot added the cla signed label Nov 7, 2024

ashwinb reviewed Nov 7, 2024

View reviewed changes

src/llama_stack_client/lib/direct/direct.py Outdated Show resolved Hide resolved

ashwinb reviewed Nov 7, 2024

View reviewed changes

src/llama_stack_client/lib/direct/direct.py Outdated Show resolved Hide resolved

ashwinb reviewed Nov 7, 2024

View reviewed changes

src/llama_stack_client/lib/direct/direct.py Show resolved Hide resolved

ashwinb approved these changes Nov 7, 2024

View reviewed changes

dltn mentioned this pull request Nov 7, 2024

Factor out create_dist_registry meta-llama/llama-stack#398

Merged

dltn added 2 commits November 7, 2024 16:19

precommit

c5621ce

dicts

b3d386e

dltn merged commit 0901251 into main Nov 7, 2024
3 checks passed

dltn deleted the add-direct-client branch November 7, 2024 21:27

yanxi0830 reviewed Nov 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add direct client implementation #15

Add direct client implementation #15

dltn commented Nov 7, 2024 •

edited

Loading

ashwinb left a comment

yanxi0830 Nov 7, 2024

ashwinb Nov 7, 2024

yanxi0830 Nov 8, 2024 •

edited

Loading

ashwinb Nov 8, 2024

Add direct client implementation #15

Add direct client implementation #15

Conversation

dltn commented Nov 7, 2024 • edited Loading

ashwinb left a comment

Choose a reason for hiding this comment

yanxi0830 Nov 7, 2024

Choose a reason for hiding this comment

ashwinb Nov 7, 2024

Choose a reason for hiding this comment

yanxi0830 Nov 8, 2024 • edited Loading

Choose a reason for hiding this comment

ashwinb Nov 8, 2024

Choose a reason for hiding this comment

dltn commented Nov 7, 2024 •

edited

Loading

yanxi0830 Nov 8, 2024 •

edited

Loading