Skip to content

Commit

Permalink
chore: revert llama.cpp to b3029
Browse files Browse the repository at this point in the history
  • Loading branch information
sangjanai committed Jun 25, 2024
1 parent 5124a55 commit 0f645e8
Show file tree
Hide file tree
Showing 3 changed files with 3 additions and 6 deletions.
1 change: 1 addition & 0 deletions .github/workflows/nightly-build.yml
Original file line number Diff line number Diff line change
Expand Up @@ -58,6 +58,7 @@ jobs:
needs: [create-draft-release]
timeout-minutes: 60
strategy:
fail-fast: false
matrix:
include:
- os: "linux"
Expand Down
2 changes: 1 addition & 1 deletion llama.cpp
Submodule llama.cpp updated 475 files
6 changes: 1 addition & 5 deletions src/llama_server_context.cc
Original file line number Diff line number Diff line change
Expand Up @@ -264,11 +264,7 @@ json LlamaServerContext::GetModelProps() {
}

int LlamaServerContext::RequestCompletion(json data, bool infill,
bool embedding, int multitask_id) {
// From this commit: 'llama : allow pooled embeddings on any model (#7477)'
// we need to explicitly set embedding flad for each request
llama_set_embeddings(ctx, embedding);

bool embedding, int multitask_id) {
TaskServer task;
task.id = id_gen++;
task.target_id = 0;
Expand Down

0 comments on commit 0f645e8

Please sign in to comment.