[Bugfix] Fix the default value for temperature in ChatCompletionRequest #11219

yansh97 · 2024-12-16T01:36:08Z

FIX #10930

Set the default value for temperature in ChatCompletionRequest to 1.0.

References:

vllm/vllm/sampling_params.py

Line 176 in 571da8f

temperature: float = 1.0

https://platform.openai.com/docs/api-reference/chat

github-actions · 2024-12-16T01:36:20Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

yansh97 · 2024-12-16T01:38:45Z

This modifies the default behavior of the vLLM OpenAI Compatible Server. Considering that temperature is very likely to be customized by users, I believe the impact is minimal.

yansh97 · 2024-12-16T08:14:58Z

Some tests failed, and it doesn’t seem to be caused by my PR. @simon-mo

K-Mistele · 2024-12-16T21:14:47Z

Out of curiosity, why was this change made? 0.7 has been the default temperature for a long time, and changing from 0.7 to 1 is not a small difference in terms of behavior, and in particular it may impact the quality of tool calls using auto tool choice without a temperature manually specified

yansh97 · 2024-12-17T01:43:55Z

Out of curiosity, why was this change made? 0.7 has been the default temperature for a long time, and changing from 0.7 to 1 is not a small difference in terms of behavior, and in particular it may impact the quality of tool calls using auto tool choice without a temperature manually specified出于好奇，为什么要做出这样的改变？ 0.7 长期以来一直是默认温度，从 0.7 更改为 1 在行为方面差异不小，特别是它可能会影响使用自动工具选择而不手动指定温度的工具调用质量

The default temperature for offline inference (LLM class) is 1.0, same as OpenAI’s official implementation. It’s also the default in most frameworks. I didn’t check past PRs to see why the OpenAI Compatible Server uses 0.7 as the default. It’s a bit odd.

Also, like I mentioned before, in temperature-sensitive cases, it’s common for users to set their own temperature. This shouldn’t affect too many use cases. Maybe we should note this change in the OpenAI Compatible Server docs? @simon-mo

simon-mo · 2024-12-17T02:39:40Z

I actually don't recall why is 0.7 given both huggingface and OpenAI have it at 1.0. Are we aware of any breakage?

yansh97 · 2024-12-17T02:58:01Z

I actually don't recall why is 0.7 given both huggingface and OpenAI have it at 1.0. Are we aware of any breakage?实际上我不记得为什么是 0.7，因为 Huggingface 和 OpenAI 的值都是 1.0。我们是否意识到有任何破损？

This value has been 0.7 since it was introduced, see #116. What’s strange is that ChatCompletionRequest uses 0.7, while CompletionRequest uses 1.0.

I searched Google for the source of 0.7, and it seems to be because OpenAI chatbots like ChatGPT use it (though no solid proof). The default for OpenAI’s Chat Completion API is 1.0. I think aligning with that makes sense.

K-Mistele · 2024-12-17T04:39:42Z

Aligning with default behavior 100% makes sense, but it's also to avoid risking breaking workflows that depend on existing defaults.

It seems like this should be at least a minor version bump in semver, if not a major due to potentially breaking behavior; although it's definitely fuzzy. it's syntactically backwards-compatible, although arguably not semantically

…st (vllm-project#11219)

Fix the default value for temperature in ChatCompletionRequest

02952e6

mergify bot added the frontend label Dec 16, 2024

simon-mo approved these changes Dec 16, 2024

View reviewed changes

simon-mo enabled auto-merge (squash) December 16, 2024 01:37

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 16, 2024

DarkLight1337 mentioned this pull request Dec 16, 2024

[Feature] Add load generation config from model #11164

Merged

simon-mo merged commit 17138af into vllm-project:main Dec 16, 2024
62 of 66 checks passed

yansh97 deleted the patch-1 branch December 16, 2024 08:16

hmellor mentioned this pull request Dec 18, 2024

Better defaults to match Hugging Face #2733

Open

BKitor pushed a commit to BKitor/vllm that referenced this pull request Dec 30, 2024

[Bugfix] Fix the default value for temperature in ChatCompletionReque…

00e3ded

…st (vllm-project#11219)

joennlae pushed a commit to 44ai-labs/vllm that referenced this pull request Jan 19, 2025

[Bugfix] Fix the default value for temperature in ChatCompletionReque…

9247259

…st (vllm-project#11219)

joennlae pushed a commit to 44ai-labs/vllm that referenced this pull request Jan 19, 2025

[Bugfix] Fix the default value for temperature in ChatCompletionReque…

0f3fa3e

…st (vllm-project#11219)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Fix the default value for temperature in ChatCompletionRequest #11219

[Bugfix] Fix the default value for temperature in ChatCompletionRequest #11219

yansh97 commented Dec 16, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Dec 16, 2024

yansh97 commented Dec 16, 2024 •

edited

Loading

yansh97 commented Dec 16, 2024

K-Mistele commented Dec 16, 2024

yansh97 commented Dec 17, 2024

simon-mo commented Dec 17, 2024

yansh97 commented Dec 17, 2024

K-Mistele commented Dec 17, 2024 •

edited

Loading

[Bugfix] Fix the default value for temperature in ChatCompletionRequest #11219

[Bugfix] Fix the default value for temperature in ChatCompletionRequest #11219

Conversation

yansh97 commented Dec 16, 2024 • edited by github-actions bot Loading

github-actions bot commented Dec 16, 2024

yansh97 commented Dec 16, 2024 • edited Loading

yansh97 commented Dec 16, 2024

K-Mistele commented Dec 16, 2024

yansh97 commented Dec 17, 2024

simon-mo commented Dec 17, 2024

yansh97 commented Dec 17, 2024

K-Mistele commented Dec 17, 2024 • edited Loading

yansh97 commented Dec 16, 2024 •

edited by github-actions bot

Loading

yansh97 commented Dec 16, 2024 •

edited

Loading

K-Mistele commented Dec 17, 2024 •

edited

Loading