assistant: Fix safety settings for google_ai #20178

FedorBuben · 2024-11-04T16:45:15Z

Sometimes Gemini API's safety filter blocks code, considering it dangerous content. This happens quite often. The bug has been described in some comments under #18561. Here is an example prompt for Gemini Flash.
In Zed:

In Google AI Studio:

Disabling the DangerousContent category for the filter solves the problem

Release Notes:

N/A

cla-bot · 2024-11-04T16:45:17Z

We require contributors to sign our Contributor License Agreement, and we don't have @FedorBuben on file. You can sign our CLA at https://zed.dev/cla. Once you've signed, post a comment here that says '@cla-bot check'.

maxdeviant · 2024-11-04T16:47:36Z

crates/language_model/src/request.rs

-            safety_settings: None,
+            safety_settings: Some(vec![google_ai::SafetySetting {
+                category: google_ai::HarmCategory::DangerousContent,
+                threshold: google_ai::HarmBlockThreshold::BlockNone,


Not blocking any dangerous content seems like the wrong choice for a default.

Should there be an option for adjusting the filter settings?

HarmCategory::DangerousContent is one of the 5 categories. The filter currently blocks content in all other categories.

From Gemini API Docs:

The Gemini API's adjustable safety filters cover the following categories:

Category Description

Harassment Negative or harmful comments targeting identity and/or protected attributes.

Hate speech Content that is rude, disrespectful, or profane.

Sexually explicit Contains references to sexual acts or other lewd content.

Dangerous Promotes, facilitates, or encourages harmful acts.

Civic integrity Election-related queries.

FedorBuben · 2024-11-04T16:48:08Z

@cla-bot check

cla-bot · 2024-11-04T16:48:11Z

The cla-bot has been summoned, and re-checked this pull request!

Modified safety settings for google_ai

5865f6a

maxdeviant reviewed Nov 4, 2024

View reviewed changes

cla-bot bot added the cla-signed The user has signed the Contributor License Agreement label Nov 4, 2024

SomeoneToIgnore assigned maxdeviant Nov 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assistant: Fix safety settings for google_ai #20178

assistant: Fix safety settings for google_ai #20178

FedorBuben commented Nov 4, 2024

cla-bot bot commented Nov 4, 2024

maxdeviant Nov 4, 2024

FedorBuben Nov 4, 2024

FedorBuben commented Nov 4, 2024

cla-bot bot commented Nov 4, 2024

Category	Description
Harassment	Negative or harmful comments targeting identity and/or protected attributes.
Hate speech	Content that is rude, disrespectful, or profane.
Sexually explicit	Contains references to sexual acts or other lewd content.
Dangerous	Promotes, facilitates, or encourages harmful acts.
Civic integrity	Election-related queries.

assistant: Fix safety settings for google_ai #20178

Are you sure you want to change the base?

assistant: Fix safety settings for google_ai #20178

Conversation

FedorBuben commented Nov 4, 2024

cla-bot bot commented Nov 4, 2024

maxdeviant Nov 4, 2024

Choose a reason for hiding this comment

FedorBuben Nov 4, 2024

Choose a reason for hiding this comment

FedorBuben commented Nov 4, 2024

cla-bot bot commented Nov 4, 2024