Update llama2.c converter to read vocab and write models in GGUF format #2751

ochafik · 2023-08-23T21:09:34Z

This updates examples/convert-llama2c-to-ggml (@byte-6174) to be fully GGUF-friendly (cf. #2398), as a follow up to #2685:

Reinstate vocabulary import from a llama.cpp model (now in GGUF format)
Directly output GGUF instead of GGJTv3

Tested w/ following commands:

make clean && LLAMA_DEBUG=1 make -j main convert-llama2c-to-ggml

# Read & convert vocab from llama2.c/tokenizer.bin
./convert-llama2c-to-ggml \
    --copy-vocab-from-model ../llama2.c/tokenizer.bin \
    --llama2c-model stories42M.bin \
    --llama2c-output-model stories42M.gguf.converted-vocab.bin && \
  ./main -m stories42M.gguf.converted-vocab.bin -p "One day, Lily met a Shoggoth" -n 500 -c 256 --ignore-eos

# Copy vocab from an existing llama GGUF model
./convert-llama2c-to-ggml \
    --copy-vocab-from-model llama-2-7b-chat.gguf.q2_K.bin \
    --llama2c-model stories42M.bin \
    --llama2c-output-model stories42M.gguf.copied-vocab.bin && \
  ./main -m stories42M.gguf.copied-vocab.bin -p "One day, Lily met a Shoggoth" -n 500 -c 256 --ignore-eos

…pp way

ggerganov

Nice work - tested it and it works 🦙

byte-6174 · 2023-08-27T15:54:30Z

indeed, @ochafik nice quick turnaround!

ochafik · 2023-08-28T13:14:18Z

Thanks guys!!

…n GGUF format (ggerganov#2751) * llama2.c: direct gguf output (WIP) * Simplify vector building logic * llama2.c gguf conversion: fix token types in converter * llama2.c: support copying vocab from a llama gguf model file * llama2.c: update default path for vocab model + readme * llama2.c: use defines for gguf keys * llama2.c: escape whitespaces w/ U+2581 in vocab converter the llama.cpp way * llama2.c converter: cleanups + take n_ff from config

llama2.c: direct gguf output (WIP)

5132130

ochafik mentioned this pull request Aug 23, 2023

Fix import of llama2.c models that don't share weights between embedding layers #2685

Merged

ochafik added 3 commits August 23, 2023 22:21

Simplify vector building logic

463e117

llama2.c gguf conversion: fix token types in converter

63174b8

llama2.c: support copying vocab from a llama gguf model file

c88362d

ochafik force-pushed the llama2c-gguf branch from 003021f to c88362d Compare August 26, 2023 18:54

ochafik added 4 commits August 26, 2023 20:59

llama2.c: update default path for vocab model + readme

df3b81a

llama2.c: use defines for gguf keys

20c4471

llama2.c: escape whitespaces w/ U+2581 in vocab converter the llama.c…

0722e58

…pp way

llama2.c converter: cleanups + take n_ff from config

c58792c

ochafik changed the title ~~[Draft] llama2.c: direct gguf output (WIP)~~ Update llama2.c converter to read vocab and write models in GGUF format Aug 26, 2023

ochafik marked this pull request as ready for review August 26, 2023 22:11

ggerganov approved these changes Aug 27, 2023

View reviewed changes

ggerganov merged commit 230d46c into ggerganov:master Aug 27, 2023
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update llama2.c converter to read vocab and write models in GGUF format #2751

Update llama2.c converter to read vocab and write models in GGUF format #2751

ochafik commented Aug 23, 2023 •

edited

Loading

ggerganov left a comment

byte-6174 commented Aug 27, 2023

ochafik commented Aug 28, 2023

Update llama2.c converter to read vocab and write models in GGUF format #2751

Update llama2.c converter to read vocab and write models in GGUF format #2751

Conversation

ochafik commented Aug 23, 2023 • edited Loading

ggerganov left a comment

Choose a reason for hiding this comment

byte-6174 commented Aug 27, 2023

ochafik commented Aug 28, 2023

ochafik commented Aug 23, 2023 •

edited

Loading