[ENH] Add support for Ollama assistants #376

smokestacklightnin · 2024-03-23T00:00:14Z

This PR adds support for Ollama assistants.

ragna/assistants/_ollama.py

ragna/assistants/_api.py

…refactor

Other models will be added in a later step

Also fix typo in `ragna.assistants.__init__`

pmeier · 2024-05-28T08:01:11Z

ragna/assistants/_ollama.py

+                    if "error" in json_data:
+                        raise RagnaException(json_data["error"])
+                    if not json_data["done"]:
+                        yield cast(str, json_data["message"]["content"])


@nenb This violates "same response schema" part of #425 (comment) 😖

pmeier · 2024-05-30T18:36:25Z

@nenb As follow-up to #376 (comment), I've refactored the logic we merged in #425 a little to make it even more flexible.

We now have an HttpApiCaller base class that handles the different streaming protocols for us. It is instantiated by the base class of the builtin assistants and parametrized by the streaming protocol, which is set as class constant.
This new HttpApiCaller also handles JSON streaming for Google assistants now and thus abstracts the complexity away.
I've renamed the OpenaiCompliantHttpApiAssistant to OpenaiLikeHttpApiAssistant, because the term "compliant" is even less meaningless when adding Ollama into the mix
Each subclass of OpenaiLikeHttpApiAssistant now has to implement answer itself given that their is no standard way of handling it. The only common functionality is the call to the API, which so far seems to be rather constant across all models.

nenb

This looks fine to me.

I've pulled the branch locally and confirmed that it works for OpenAI models and for a Llamafile that I had locally.

Co-authored-by: Philip Meier <[email protected]>

smokestacklightnin requested a review from pmeier March 30, 2024 08:40

smokestacklightnin force-pushed the assistants/ollama/basic-functionality branch 2 times, most recently from 1c7ffa6 to 85030be Compare April 2, 2024 03:00

This was referenced Apr 2, 2024

Update Anthropic assistants #380

Merged

Refactor ApiAssistant to AuthenticatedApiAssistant and UnauthenticatedApiAssistant #381

Closed

pmeier reviewed Apr 2, 2024

View reviewed changes

smokestacklightnin added 17 commits April 6, 2024 21:08

Added almost empty OllamaApiAssistant

868febb

Add _make_system_content method

a0c8499

Add preliminary (untested) _call_api method

ae0720a

Using JSONL for responses

eeba8a1

Add kwargs for compatibility and TODO messages to remove in a future …

420c9e8

…refactor

Add Ollama gemma:2b model

20fb764

Other models will be added in a later step

Fix OllamaApiAssistant._call_api signature by adding types

906f2a1

Add temperature option

50f19a1

Add _assert_api_call_is_success()

0ce77d8

Add answer()

7bbbafb

Add __init__()

301c815

Set url through initializer or environment variable

a4a2608

Add is_available()

14e14c5

Rename Gemma2B to OllamaGemma2B

0cae498

Remove unnecessary else clause

d02b501

Handle error in http response

1ce1982

Remove unnecessary _call_api() abstraction

fd5c34b

smokestacklightnin force-pushed the assistants/ollama/basic-functionality branch from 85030be to fd5c34b Compare April 10, 2024 06:41

Fix typing errors

6f2055c

smokestacklightnin self-assigned this Apr 10, 2024

smokestacklightnin added pr-status: in-progress 🏗️ type: enhancement 💅 New feature or request labels Apr 10, 2024

smokestacklightnin added 2 commits April 10, 2024 00:11

Add docstring

e5e8e30

Add OllamaPhi2

72161a0

smokestacklightnin added 6 commits April 10, 2024 16:38

Remove unnecessary exclusion from test

c5e79e0

Simplify check for availability of Ollama model

f6edb19

Simplify call to superclass is_available()

6460d1e

Correct incorrect grammar on system instruction

c9b2e01

Add several Ollama models

086ce23

Order alphabetically

9724dd6

smokestacklightnin requested a review from pmeier April 12, 2024 00:41

smokestacklightnin marked this pull request as ready for review April 12, 2024 00:41

Add Ollama to listings in docs

9bebbb0

Also fix typo in `ragna.assistants.__init__`

smokestacklightnin added pr-status: needs review 👀 pr-status: merge ready 💪 and removed pr-status: in-progress 🏗️ labels Apr 12, 2024

This was referenced May 27, 2024

Add OpenAI API compatible assistant #424

Closed

refactor assistant streaming and create OpenAI compliant base class #425

Merged

Merge branch 'main' into assistants/ollama/basic-functionality

bc211d3

pmeier reviewed May 28, 2024

View reviewed changes

pmeier added 3 commits May 28, 2024 10:45

refactor streaming again

4a737e0

more

3e2a682

fix

5a4d89d

pmeier requested a review from nenb May 30, 2024 18:30

cleanup

9de0920

pmeier linked an issue May 30, 2024 that may be closed by this pull request

ApiAssistant abstract base class for assistants that don't require an API key #375

Closed

pmeier added dev: components and removed pr-status: needs review 👀 pr-status: merge ready 💪 labels May 30, 2024

nenb approved these changes Jun 7, 2024

View reviewed changes

pmeier merged commit da1bcc2 into Quansight:main Jun 10, 2024
10 checks passed

pmeier deleted the assistants/ollama/basic-functionality branch June 10, 2024 09:25

blakerosenthal pushed a commit that referenced this pull request Jul 17, 2024

[ENH] Add support for Ollama assistants (#376)

eb486f0

Co-authored-by: Philip Meier <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Add support for Ollama assistants #376

[ENH] Add support for Ollama assistants #376

smokestacklightnin commented Mar 23, 2024 •

edited

Loading

pmeier May 28, 2024 •

edited

Loading

pmeier commented May 30, 2024

nenb left a comment

[ENH] Add support for Ollama assistants #376

[ENH] Add support for Ollama assistants #376

Conversation

smokestacklightnin commented Mar 23, 2024 • edited Loading

pmeier May 28, 2024 • edited Loading

Choose a reason for hiding this comment

pmeier commented May 30, 2024

nenb left a comment

Choose a reason for hiding this comment

smokestacklightnin commented Mar 23, 2024 •

edited

Loading

pmeier May 28, 2024 •

edited

Loading