Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DO NOT MERGE Verify b3499 #154

Closed
wants to merge 1 commit into from
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
41 changes: 1 addition & 40 deletions .github/workflows/llama.yml
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ jobs:
matrix:
runner: [ubuntu-20.04, macos-m1]
wasmedge: ["0.13.5", "0.14.0"]
plugin: [wasi_nn-ggml]
plugin: [wasi_nn-ggml-b3499]
job:
- name: "Tiny Llama"
run: |
Expand Down Expand Up @@ -288,45 +288,6 @@ jobs:
default \
$'[INST] <<SYS>>\nYou are a helpful, respectful and honest assistant. Always answer as helpfully as possible, while being safe. Your answers should not include any harmful, unethical, racist, sexist, toxic, dangerous, or illegal content. Please ensure that your responses are socially unbiased and positive in nature. If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you do not know the answer to a question, please do not share false information.\n<</SYS>>\nWhat is the capital of Japan?[/INST]'

- name: Phi 3 Mini 4k
run: |
test -f ~/.wasmedge/env && source ~/.wasmedge/env
cd wasmedge-ggml/test/phi-3
curl -LO https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-gguf/resolve/main/Phi-3-mini-4k-instruct-q4.gguf
cargo build --target wasm32-wasi --release
time wasmedge --dir .:. \
--env n_gpu_layers="$NGL" \
--nn-preload default:GGML:AUTO:Phi-3-mini-4k-instruct-q4.gguf \
target/wasm32-wasi/release/wasmedge-ggml-phi-3.wasm \
default \
$'<|user|>\nWhat is the capital of Japan?<|end|>\n<|assistant|>'

- name: Phi 3 Mini 128k
run: |
test -f ~/.wasmedge/env && source ~/.wasmedge/env
cd wasmedge-ggml/test/phi-3
curl -LO https://huggingface.co/second-state/Phi-3-mini-128k-instruct-GGUF/resolve/main/Phi-3-mini-128k-instruct-Q5_K_M.gguf
cargo build --target wasm32-wasi --release
time wasmedge --dir .:. \
--env n_gpu_layers="$NGL" \
--nn-preload default:GGML:AUTO:Phi-3-mini-128k-instruct-Q5_K_M.gguf \
target/wasm32-wasi/release/wasmedge-ggml-phi-3.wasm \
default \
$'<|user|>\nWhat is the capital of Japan?<|end|>\n<|assistant|>'

- name: Phi 3 Medium 4k
run: |
test -f ~/.wasmedge/env && source ~/.wasmedge/env
cd wasmedge-ggml/test/phi-3
curl -LO https://huggingface.co/second-state/Phi-3-medium-4k-instruct-GGUF/resolve/main/Phi-3-medium-4k-instruct-Q5_K_M.gguf
cargo build --target wasm32-wasi --release
time wasmedge --dir .:. \
--env n_gpu_layers="$NGL" \
--nn-preload default:GGML:AUTO:Phi-3-medium-4k-instruct-Q5_K_M.gguf \
target/wasm32-wasi/release/wasmedge-ggml-phi-3.wasm \
default \
$'<|user|>\nWhat is the capital of Japan?<|end|>\n<|assistant|>'

- name: JSON Schema
run: |
test -f ~/.wasmedge/env && source ~/.wasmedge/env
Expand Down
Loading