Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: implement inference server by using vllm #624

Merged
merged 7 commits into from
Oct 24, 2024
Merged

Commits on Oct 22, 2024

  1. feat: implement inference server by using vllm

    provide openai-compatible serving api
    
    Signed-off-by: zhuangqh <[email protected]>
    zhuangqh committed Oct 22, 2024
    Configuration menu
    Copy the full SHA
    d594182 View commit details
    Browse the repository at this point in the history
  2. fix test

    zhuangqh committed Oct 22, 2024
    Configuration menu
    Copy the full SHA
    68f19d3 View commit details
    Browse the repository at this point in the history
  3. fix

    zhuangqh committed Oct 22, 2024
    Configuration menu
    Copy the full SHA
    38bf077 View commit details
    Browse the repository at this point in the history

Commits on Oct 23, 2024

  1. add env to pipeline

    Signed-off-by: jerryzhuang <[email protected]>
    zhuangqh committed Oct 23, 2024
    Configuration menu
    Copy the full SHA
    3c39983 View commit details
    Browse the repository at this point in the history
  2. Merge branch 'main' into zhuangqh/support-vllm

    Signed-off-by: jerryzhuang <[email protected]>
    zhuangqh authored Oct 23, 2024
    Configuration menu
    Copy the full SHA
    1c376ee View commit details
    Browse the repository at this point in the history
  3. run test in env

    Signed-off-by: jerryzhuang <[email protected]>
    zhuangqh committed Oct 23, 2024
    Configuration menu
    Copy the full SHA
    a228847 View commit details
    Browse the repository at this point in the history
  4. fix

    Signed-off-by: jerryzhuang <[email protected]>
    zhuangqh committed Oct 23, 2024
    Configuration menu
    Copy the full SHA
    960e18a View commit details
    Browse the repository at this point in the history