feat(gemini): trace google gemini Integration #10503

Yun-Kim · 2024-09-03T22:13:28Z

Adds support for tracing Google's Gemini Python SDK generate_content and generate_content_async methods (also indirectly traces send_message()).

This PR also moves _get_attr() convenience helper from the anthropic integration to the shared ddtrace.llmobs._utils.py module.

Features:

Traces generate_content/generate_content_async() and indirectly send_message/send_message_async() which uses generate_content under the hood
Handles streamed responses
Captures input/outputs (truncation included, function call/response messages included), token usage, input configuration options, and model name.

Testing:

The Gemini Python SDK does not use requests/httpx to submit requests but instead via GRPC. The vcrpy testing framework does not capture this so I wrote a mock client to store/return mock responses instead of sending actual requests to Google. This testing framework was inspired by the Gemini Python SDK's testing strategy.

Next steps (future PRs)

Wrap send_message() not to trace, but to attach a hash/session_id for all subsequent send_message calls
Integrate with LLMObs

Checklist

PR author has checked that all the criteria below are met
The PR description includes an overview of the change
The PR description articulates the motivation for the change
The change includes tests OR the PR description describes a testing strategy
The PR description notes risks associated with the change, if any
Newly-added code is easy to change
The change follows the library release note guidelines
The change includes or references documentation updates if necessary
Backport labels are set (if applicable)

Reviewer Checklist

Reviewer has checked that all the criteria below are met
Title is accurate
All changes are related to the pull request's stated goal
Avoids breaking API changes
Testing strategy adequately addresses listed risks
Newly-added code is easy to change
Release note makes sense to a user of the library
If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment
Backport labels are set in a manner that is consistent with the release branch maintenance policy

github-actions · 2024-09-03T22:13:55Z

CODEOWNERS have been resolved as:

.riot/requirements/1e15a25.txt                                          @DataDog/apm-python
.riot/requirements/1f54e6b.txt                                          @DataDog/apm-python
.riot/requirements/e8247d6.txt                                          @DataDog/apm-python
.riot/requirements/ebe4ea5.txt                                          @DataDog/apm-python
ddtrace/contrib/google_generativeai/__init__.py                         @DataDog/ml-observability
ddtrace/contrib/internal/google_generativeai/_utils.py                  @DataDog/ml-observability
ddtrace/contrib/internal/google_generativeai/patch.py                   @DataDog/ml-observability
ddtrace/llmobs/_integrations/gemini.py                                  @DataDog/ml-observability
releasenotes/notes/feat-google-gemini-d5ee30b1d711bc08.yaml             @DataDog/apm-python
tests/contrib/google_generativeai/__init__.py                           @DataDog/ml-observability
tests/contrib/google_generativeai/conftest.py                           @DataDog/ml-observability
tests/contrib/google_generativeai/test_data/apple.jpg                   @DataDog/ml-observability
tests/contrib/google_generativeai/test_google_generativeai.py           @DataDog/ml-observability
tests/contrib/google_generativeai/test_google_generativeai_patch.py     @DataDog/ml-observability
tests/contrib/google_generativeai/utils.py                              @DataDog/ml-observability
tests/snapshots/tests.contrib.google_generativeai.test_google_generativeai.test_gemini_completion.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_generativeai.test_google_generativeai.test_gemini_completion_error.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_generativeai.test_google_generativeai.test_gemini_completion_image.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_generativeai.test_google_generativeai.test_gemini_completion_multiple_messages.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_generativeai.test_google_generativeai.test_gemini_completion_stream.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_generativeai.test_google_generativeai.test_gemini_completion_system_prompt.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_generativeai.test_google_generativeai.test_gemini_completion_tool_stream.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_generativeai.test_google_generativeai.test_gemini_tool_chat_completion.json  @DataDog/apm-python
tests/snapshots/tests.contrib.google_generativeai.test_google_generativeai.test_gemini_tool_completion.json  @DataDog/apm-python
.github/CODEOWNERS                                                      @DataDog/python-guild @DataDog/apm-core-python
.gitlab/tests/llmobs.yml                                                @DataDog/ml-observability
ddtrace/_monkey.py                                                      @DataDog/apm-core-python
ddtrace/contrib/internal/anthropic/_streaming.py                        @DataDog/ml-observability
ddtrace/contrib/internal/anthropic/patch.py                             @DataDog/ml-observability
ddtrace/contrib/internal/anthropic/utils.py                             @DataDog/ml-observability
ddtrace/llmobs/_integrations/__init__.py                                @DataDog/ml-observability
ddtrace/llmobs/_integrations/anthropic.py                               @DataDog/ml-observability
ddtrace/llmobs/_utils.py                                                @DataDog/ml-observability
docs/index.rst                                                          @DataDog/python-guild
docs/integrations.rst                                                   @DataDog/python-guild
docs/spelling_wordlist.txt                                              @DataDog/python-guild
riotfile.py                                                             @DataDog/apm-python
tests/.suitespec.json                                                   @DataDog/python-guild @DataDog/apm-core-python

ddtrace/contrib/internal/google_generativeai/patch.py

pr-commenter · 2024-09-03T22:48:07Z

Benchmarks

Benchmark execution time: 2024-09-11 20:02:53

Comparing candidate commit eecda47 in PR branch yunkim/gemini-integration with baseline commit 150beb7 in branch main.

Found 0 performance improvements and 0 performance regressions! Performance is the same for 353 metrics, 47 unstable metrics.

ddtrace/contrib/internal/google_generativeai/patch.py

ddtrace/contrib/internal/google_generativeai/_utils.py

datadog-dd-trace-py-rkomorn · 2024-09-09T22:05:15Z

Datadog Report

Branch report: yunkim/gemini-integration
Commit report: eecda47
Test service: dd-trace-py

✅ 0 Failed, 9717 Passed, 355 Skipped, 2h 9m 47.19s Total duration (1m 35.08s time saved)

erikayasuda

Just some nits and clarifying questions

releasenotes/notes/feat-google-gemini-d5ee30b1d711bc08.yaml

riotfile.py

docs/spelling_wordlist.txt

ddtrace/contrib/google_generativeai/__init__.py

ddtrace/contrib/internal/google_generativeai/patch.py

ddtrace/contrib/google_generativeai/__init__.py

ddtrace/contrib/internal/google_generativeai/_utils.py

ddtrace/contrib/internal/google_generativeai/patch.py

ddtrace/contrib/internal/google_generativeai/_utils.py

Adds support for tracing Google's Gemini Python SDK `generate_content` and `generate_content_async` methods (also indirectly traces `send_message()`). This PR also moves `_get_attr()` convenience helper from the anthropic integration to the shared `ddtrace.llmobs._utils.py` module. ### Features: - Traces `generate_content/generate_content_async()` and indirectly `send_message/send_message_async()` which uses `generate_content` under the hood - Handles streamed responses - Captures input/outputs (truncation included, function call/response messages included), token usage, input configuration options, and model name. ### Testing: The Gemini Python SDK does not use requests/httpx to submit requests but instead via GRPC. The vcrpy testing framework does not capture this so I wrote a mock client to store/return mock responses instead of sending actual requests to Google. This testing framework was inspired by the [Gemini Python SDK's testing strategy](https://github.com/google-gemini/generative-ai-python/blob/9407dcde5666ba58831227d3acf8bd0e3f3b4f81/tests/test_generative_models.py#L41). ### Next steps (future PRs) - Wrap `send_message()` not to trace, but to attach a hash/session_id for all subsequent send_message calls - Integrate with LLMObs ## Checklist - [x] PR author has checked that all the criteria below are met - The PR description includes an overview of the change - The PR description articulates the motivation for the change - The change includes tests OR the PR description describes a testing strategy - The PR description notes risks associated with the change, if any - Newly-added code is easy to change - The change follows the [library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) - The change includes or references documentation updates if necessary - Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) ## Reviewer Checklist - [x] Reviewer has checked that all the criteria below are met - Title is accurate - All changes are related to the pull request's stated goal - Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - Testing strategy adequately addresses listed risks - Newly-added code is easy to change - Release note makes sense to a user of the library - If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting) (cherry picked from commit c0fd013)

WIP Gemini Integration

7591b5d

datadog-datadog-prod-us1 bot reviewed Sep 3, 2024

View reviewed changes

Support tracing streamed responses

f4c62a5

datadog-datadog-prod-us1 bot reviewed Sep 4, 2024

View reviewed changes

Yun-Kim and others added 5 commits September 4, 2024 17:59

fmt, refactor nesting

7aed48b

fmt, release note draft

c8e8c57

Merge branch 'main' into yunkim/gemini-integration

77ee9f5

Extract api key, model name

a95f2a6

Add tests

364bc73

datadog-datadog-prod-us1 bot reviewed Sep 9, 2024

View reviewed changes

ddtrace/contrib/internal/google_generativeai/_utils.py Outdated Show resolved Hide resolved

Yun-Kim added 6 commits September 9, 2024 16:49

fmt

ccc3349

Docs

4344a6e

More docs

8531633

fmt

d449f01

Suitespec

4c6003a

Remove from gitlab, add to circleci

902611a

Yun-Kim and others added 6 commits September 9, 2024 18:07

spellcheck

648a3ae

Migrate tests back to gitlab

4b79cbf

fix spelling

e622d35

Codeowners

1791545

Merge branch 'main' into yunkim/gemini-integration

c1ba316

Move suite to llmobs gitlab

a6ab1dd

Yun-Kim marked this pull request as ready for review September 10, 2024 15:42

Yun-Kim requested review from a team as code owners September 10, 2024 15:42

Yun-Kim requested a review from a team as a code owner September 10, 2024 15:42

Yun-Kim requested review from tabgok and erikayasuda September 10, 2024 15:42

erikayasuda approved these changes Sep 10, 2024

View reviewed changes

Yun-Kim added 2 commits September 10, 2024 18:00

Address comments, fix snapshots

91f8ee3

fix snapshots

f2c3331

lievan reviewed Sep 10, 2024

View reviewed changes

yahya-mouman reviewed Sep 11, 2024

View reviewed changes

ddtrace/contrib/internal/google_generativeai/_utils.py Outdated Show resolved Hide resolved

yahya-mouman reviewed Sep 11, 2024

View reviewed changes

ddtrace/contrib/internal/google_generativeai/patch.py Show resolved Hide resolved

Yun-Kim added the backport 2.13 label Sep 11, 2024

sabrenner reviewed Sep 11, 2024

View reviewed changes

ddtrace/contrib/internal/google_generativeai/_utils.py Show resolved Hide resolved

fmt, address PR comments

d7b73bb

datadog-datadog-prod-us1 bot reviewed Sep 11, 2024

View reviewed changes

ddtrace/contrib/internal/google_generativeai/_utils.py Outdated Show resolved Hide resolved

Yun-Kim added 3 commits September 11, 2024 14:45

avoid silent error

ca44592

fmt

7c57b59

typing

eecda47

lievan approved these changes Sep 11, 2024

View reviewed changes

Yun-Kim merged commit c0fd013 into main Sep 11, 2024
506 of 507 checks passed

Yun-Kim deleted the yunkim/gemini-integration branch September 11, 2024 20:45

github-actions bot mentioned this pull request Sep 11, 2024

feat(gemini): trace google gemini Integration [backport 2.13] #10641

Closed

2 tasks

Yun-Kim mentioned this pull request Sep 16, 2024

Need the option to mask the input and output of the LLM API in Datadog LLM observability. #10517

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gemini): trace google gemini Integration #10503

feat(gemini): trace google gemini Integration #10503

Yun-Kim commented Sep 3, 2024 •

edited

Loading

github-actions bot commented Sep 3, 2024 •

edited

Loading

pr-commenter bot commented Sep 3, 2024 •

edited

Loading

datadog-dd-trace-py-rkomorn bot commented Sep 9, 2024 •

edited

Loading

erikayasuda left a comment

feat(gemini): trace google gemini Integration #10503

feat(gemini): trace google gemini Integration #10503

Conversation

Yun-Kim commented Sep 3, 2024 • edited Loading

Features:

Testing:

Next steps (future PRs)

Checklist

Reviewer Checklist

github-actions bot commented Sep 3, 2024 • edited Loading

pr-commenter bot commented Sep 3, 2024 • edited Loading

Benchmarks

datadog-dd-trace-py-rkomorn bot commented Sep 9, 2024 • edited Loading

Datadog Report

erikayasuda left a comment

Choose a reason for hiding this comment

Yun-Kim commented Sep 3, 2024 •

edited

Loading

github-actions bot commented Sep 3, 2024 •

edited

Loading

pr-commenter bot commented Sep 3, 2024 •

edited

Loading

datadog-dd-trace-py-rkomorn bot commented Sep 9, 2024 •

edited

Loading