Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GGUF #2398

Merged
merged 253 commits into from
Aug 21, 2023
Merged

GGUF #2398

merged 253 commits into from
Aug 21, 2023
This pull request is big! We’re only showing the most recent 250 commits.

Commits on Jul 26, 2023

  1. gguf : first API pass

    ggerganov committed Jul 26, 2023
    Configuration menu
    Copy the full SHA
    6873148 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    8d6acfe View commit details
    Browse the repository at this point in the history
  3. gguf : read tensor info

    ggerganov committed Jul 26, 2023
    Configuration menu
    Copy the full SHA
    d91b985 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    78b226a View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    860c9c6 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    cb871fa View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    d313c0f View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    e46870f View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    5628ec7 View commit details
    Browse the repository at this point in the history
  10. gguf : add comments

    ggerganov committed Jul 26, 2023
    Configuration menu
    Copy the full SHA
    d8491fc View commit details
    Browse the repository at this point in the history

Commits on Jul 27, 2023

  1. Configuration menu
    Copy the full SHA
    c85d317 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d89533d View commit details
    Browse the repository at this point in the history
  3. gguf : add array support

    ggerganov committed Jul 27, 2023
    Configuration menu
    Copy the full SHA
    d2b6ca1 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    158be8f View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    68f5348 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    d2bb3ac View commit details
    Browse the repository at this point in the history

Commits on Jul 28, 2023

  1. GGUF : write tensor (#2426)

    * WIP: Write tensor
    
    * GGUF : Support writing tensors in Python
    
    * refactor : rm unused import and upd todos
    
    * fix : fix errors upd writing example
    
    * rm example.gguf
    
    * gitignore *.gguf
    
    * undo formatting
    monatis authored Jul 28, 2023
    Configuration menu
    Copy the full SHA
    11ef380 View commit details
    Browse the repository at this point in the history
  2. gguf : add gguf_find_key (#2438)

    * gguf.cpp : find key example
    
    * ggml.h : add gguf_find_key
    
    * ggml.c : add gguf_find_key
    klosax authored Jul 28, 2023
    Configuration menu
    Copy the full SHA
    3492f84 View commit details
    Browse the repository at this point in the history
  3. gguf : fix writing tensors

    monatis committed Jul 28, 2023
    Configuration menu
    Copy the full SHA
    1495735 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    9475cdb View commit details
    Browse the repository at this point in the history

Commits on Jul 29, 2023

  1. Configuration menu
    Copy the full SHA
    08dc8fd View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    06f423a View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d54f53c View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    999431c View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    ea5f9ad View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    aa99562 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    93f7f7a View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    0c219fb View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    c861e23 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    8a76dd8 View commit details
    Browse the repository at this point in the history
  11. gguf : write tokenizer data

    monatis committed Jul 29, 2023
    Configuration menu
    Copy the full SHA
    cc3dd7f View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    0317c41 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    8ad7cd4 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    0f5e57f View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    34469b9 View commit details
    Browse the repository at this point in the history
  16. ggml.c : get arr str and f32

    klosax authored Jul 29, 2023
    Configuration menu
    Copy the full SHA
    2c22e3b View commit details
    Browse the repository at this point in the history
  17. gguf.py : support any type

    klosax authored Jul 29, 2023
    Configuration menu
    Copy the full SHA
    9577821 View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    06c3e4a View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    32e037f View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    87c34e4 View commit details
    Browse the repository at this point in the history

Commits on Jul 30, 2023

  1. Configuration menu
    Copy the full SHA
    0790c12 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ccd81a7 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b4676ee View commit details
    Browse the repository at this point in the history
  4. ggml.c : add gguf_get_arr_n

    klosax authored Jul 30, 2023
    Configuration menu
    Copy the full SHA
    b19c117 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    4ed98bf View commit details
    Browse the repository at this point in the history
  6. add gptneox gguf example

    klosax authored Jul 30, 2023
    Configuration menu
    Copy the full SHA
    e9192b0 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    f175b05 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    2fabc17 View commit details
    Browse the repository at this point in the history
  9. add gptneox gguf example

    klosax authored Jul 30, 2023
    Configuration menu
    Copy the full SHA
    30c4ea4 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    068a8e0 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    2a09146 View commit details
    Browse the repository at this point in the history

Commits on Jul 31, 2023

  1. Configuration menu
    Copy the full SHA
    4f5b622 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6b3a7b9 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    7aa0a0e View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b26f5b2 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    bb42aef View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    f3de876 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    da4900e View commit details
    Browse the repository at this point in the history

Commits on Aug 1, 2023

  1. Configuration menu
    Copy the full SHA
    e7a7416 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    c77fabb View commit details
    Browse the repository at this point in the history
  3. Update gptneox-main.cpp

    klosax authored Aug 1, 2023
    Configuration menu
    Copy the full SHA
    36a36c3 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    ff1cb02 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    49380a2 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    1b4f9c8 View commit details
    Browse the repository at this point in the history

Commits on Aug 2, 2023

  1. Configuration menu
    Copy the full SHA
    cf365fb View commit details
    Browse the repository at this point in the history
  2. gguf-util.h : update note

    monatis committed Aug 2, 2023
    Configuration menu
    Copy the full SHA
    c3a65c4 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e1e9b28 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    c5ba5ef View commit details
    Browse the repository at this point in the history

Commits on Aug 4, 2023

  1. Delete gptneox-common.cpp

    klosax authored Aug 4, 2023
    Configuration menu
    Copy the full SHA
    23abbe8 View commit details
    Browse the repository at this point in the history
  2. Delete gptneox-common.h

    klosax authored Aug 4, 2023
    Configuration menu
    Copy the full SHA
    6691aa8 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    2922280 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    e6f19ba View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5d98989 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    fb0b243 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    278ada9 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    db5618a View commit details
    Browse the repository at this point in the history

Commits on Aug 7, 2023

  1. Configuration menu
    Copy the full SHA
    4357e69 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    1da82c5 View commit details
    Browse the repository at this point in the history
  3. gguf : minor stuff

    ggerganov committed Aug 7, 2023
    Configuration menu
    Copy the full SHA
    8083ae3 View commit details
    Browse the repository at this point in the history
  4. Update gptneox-main.cpp

    klosax authored Aug 7, 2023
    Configuration menu
    Copy the full SHA
    65559a2 View commit details
    Browse the repository at this point in the history

Commits on Aug 8, 2023

  1. map tensor names

    klosax authored Aug 8, 2023
    Configuration menu
    Copy the full SHA
    ece4fc1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f4d137d View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    7d5f452 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    0246d0d View commit details
    Browse the repository at this point in the history

Commits on Aug 10, 2023

  1. Configuration menu
    Copy the full SHA
    1c4d8bf View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4f86518 View commit details
    Browse the repository at this point in the history
  3. rm binary commited by mistake

    monatis committed Aug 10, 2023
    Configuration menu
    Copy the full SHA
    4c0f64e View commit details
    Browse the repository at this point in the history
  4. upd .gitignore

    monatis committed Aug 10, 2023
    Configuration menu
    Copy the full SHA
    22de6c5 View commit details
    Browse the repository at this point in the history
  5. gguf : calculate n_mult

    monatis committed Aug 10, 2023
    Configuration menu
    Copy the full SHA
    42cc04d View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    cfb8e35 View commit details
    Browse the repository at this point in the history
  7. gguf : rm deprecated function

    monatis committed Aug 10, 2023
    Configuration menu
    Copy the full SHA
    f316b94 View commit details
    Browse the repository at this point in the history

Commits on Aug 11, 2023

  1. Configuration menu
    Copy the full SHA
    e7d346c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a356b0e View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b2440f1 View commit details
    Browse the repository at this point in the history
  4. gguf : add gguf_get_kv_type

    monatis committed Aug 11, 2023
    Configuration menu
    Copy the full SHA
    eb8ca69 View commit details
    Browse the repository at this point in the history
  5. gguf : add gguf_get_kv_type

    monatis committed Aug 11, 2023
    Configuration menu
    Copy the full SHA
    e3a4960 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    28abfc9 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    781b9ec View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    d09fd10 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    61919c1 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    7009cf5 View commit details
    Browse the repository at this point in the history
  11. gguf : rm redundant method

    monatis committed Aug 11, 2023
    Configuration menu
    Copy the full SHA
    f44bbd3 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    e732423 View commit details
    Browse the repository at this point in the history
  13. Update gguf_tensor_map.py

    klosax authored Aug 11, 2023
    Configuration menu
    Copy the full SHA
    2a5ac7a View commit details
    Browse the repository at this point in the history
  14. Update gptneox-main.cpp

    klosax authored Aug 11, 2023
    Configuration menu
    Copy the full SHA
    e76c59d View commit details
    Browse the repository at this point in the history

Commits on Aug 12, 2023

  1. Configuration menu
    Copy the full SHA
    2f52008 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    186c496 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    4fa017a View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    0e1a3c7 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    c4f02b4 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    b2571af View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    fa7c395 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    1fc3d30 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    202eab0 View commit details
    Browse the repository at this point in the history
  10. gguf : roper closing of file

    monatis committed Aug 12, 2023
    Configuration menu
    Copy the full SHA
    60d5408 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    5d81a71 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    8f09157 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    4cef57c View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    f821847 View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    e606ffe View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    5e58ffa View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    8b5f0c5 View commit details
    Browse the repository at this point in the history
  18. gguf.py : n_layer --> n_block

    klosax authored Aug 12, 2023
    Configuration menu
    Copy the full SHA
    d2ce9cf View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    489616e View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    e91a222 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    c7bd8c1 View commit details
    Browse the repository at this point in the history
  22. Update gguf_tensor_map.py

    klosax authored Aug 12, 2023
    Configuration menu
    Copy the full SHA
    9bf5a7e View commit details
    Browse the repository at this point in the history

Commits on Aug 13, 2023

  1. Configuration menu
    Copy the full SHA
    e3d1f07 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    17800cd View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    91d4bfd View commit details
    Browse the repository at this point in the history
  4. fix conflicts

    monatis committed Aug 13, 2023
    Configuration menu
    Copy the full SHA
    1d60468 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    bf2dad3 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    2827b84 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    6beebf3 View commit details
    Browse the repository at this point in the history
  8. fix conflicts

    monatis committed Aug 13, 2023
    Configuration menu
    Copy the full SHA
    24f4883 View commit details
    Browse the repository at this point in the history

Commits on Aug 14, 2023

  1. gguf : add todos and comments

    monatis committed Aug 14, 2023
    Configuration menu
    Copy the full SHA
    196b50f View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    56a1f32 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    5d22a9d View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    51939d7 View commit details
    Browse the repository at this point in the history
  5. Delete gguf_tensor_map.py

    klosax authored Aug 14, 2023
    Configuration menu
    Copy the full SHA
    806a157 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    d753dfb View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    a7d226f View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    5c5a95b View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    0c19ae7 View commit details
    Browse the repository at this point in the history
  10. gguf : use UNIX line ending

    ggerganov committed Aug 14, 2023
    Configuration menu
    Copy the full SHA
    62490f1 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    6f64b6c View commit details
    Browse the repository at this point in the history
  12. llama : sync gguf-llama.cpp with latest llama.cpp (#2608)

    * llama : sync gguf-llama.cpp with latest llama.cpp
    
    * minor : indentation + assert
    
    * llama : refactor gguf_buffer and gguf_ctx_buffer
    
    * llama : minor
    ggerganov authored Aug 14, 2023
    Configuration menu
    Copy the full SHA
    f00780b View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    6f14854 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    8af3a99 View commit details
    Browse the repository at this point in the history
  15. llama : tokenizer fixes (#2549)

    * Merge tokenizer fixes into the gguf branch.
    
    * Add test vocabularies
    goerch authored Aug 14, 2023
    Configuration menu
    Copy the full SHA
    ec1b100 View commit details
    Browse the repository at this point in the history
  16. convert : update convert-new.py with tokenizer fixes (#2614)

    * Merge tokenizer fixes into the gguf branch.
    
    * Add test vocabularies
    
    * Adapt convert-new.py (and fix a clang-cl compiler error on windows)
    goerch authored Aug 14, 2023
    Configuration menu
    Copy the full SHA
    afc4ca2 View commit details
    Browse the repository at this point in the history
  17. llama : sync gguf-llama with llama (#2613)

    * llama : sync gguf-llama with llama
    
    * tests : fix build + warnings (test-tokenizer-1 still fails)
    
    * tests : fix wstring_convert
    
    * convert : fix layer names
    
    * llama : sync gguf-llama.cpp
    
    * convert : update HF converter to new tokenizer voodoo magics
    ggerganov authored Aug 14, 2023
    Configuration menu
    Copy the full SHA
    7494c78 View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    6c63550 View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    7ec125b View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    5d518d4 View commit details
    Browse the repository at this point in the history
  21. gguf.py : add token types

    klosax authored Aug 14, 2023
    Configuration menu
    Copy the full SHA
    cedb487 View commit details
    Browse the repository at this point in the history
  22. Configuration menu
    Copy the full SHA
    ab2cbd0 View commit details
    Browse the repository at this point in the history
  23. Configuration menu
    Copy the full SHA
    ca47582 View commit details
    Browse the repository at this point in the history
  24. Configuration menu
    Copy the full SHA
    2dd5d2c View commit details
    Browse the repository at this point in the history

Commits on Aug 15, 2023

  1. Configuration menu
    Copy the full SHA
    b6056c3 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    66756c8 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    2ae0e98 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    4a1741a View commit details
    Browse the repository at this point in the history

Commits on Aug 16, 2023

  1. Configuration menu
    Copy the full SHA
    ea5615a View commit details
    Browse the repository at this point in the history
  2. llama : refactor model loading code (#2620)

    * llama : style formatting + remove helper methods
    
    * llama : fix quantization using gguf tool
    
    * llama : simplify gguf_file_saver
    
    * llama : fix method names
    
    * llama : simplify write_header()
    
    * llama : no need to pass full file loader to the file saver
    
    just gguf_ctx
    
    * llama : gguf_file_saver write I32
    
    * llama : refactor tensor names (#2622)
    
    * gguf: update tensor names searched in quantization
    
    * gguf : define tensor names as constants
    
    * gguf : initial write API (not tested yet)
    
    * gguf : write to file API (not tested)
    
    * gguf : initial write API ready + example
    
    * gguf : fix header write
    
    * gguf : fixes + simplify example + add ggml_nbytes_pad()
    
    * gguf : minor
    
    * llama : replace gguf_file_saver with new gguf write API
    
    * gguf : streaming support when writing files
    
    * gguf : remove oboslete write methods
    
    * gguf : remove obosolete gguf_get_arr_xxx API
    
    * llama : simplify gguf_file_loader
    
    * llama : move hparams and vocab from gguf_file_loader to llama_model_loader
    
    * llama : merge gguf-util.h in llama.cpp
    
    * llama : reorder definitions in .cpp to match .h
    
    * llama : minor simplifications
    
    * llama : refactor llama_model_loader (WIP)
    
    wip : remove ggml_ctx from llama_model_loader
    
    wip : merge gguf_file_loader in llama_model_loader
    
    * llama : fix shape prints
    
    * llama : fix Windows build + fix norm_rms_eps key
    
    * llama : throw error on missing KV paris in model meta data
    
    * llama : improve printing + log meta data
    
    * llama : switch print order of meta data
    
    ---------
    
    Co-authored-by: M. Yusuf Sarıgöz <[email protected]>
    ggerganov and monatis authored Aug 16, 2023
    Configuration menu
    Copy the full SHA
    758ff1b View commit details
    Browse the repository at this point in the history
  3. gguf : deduplicate (#2629)

    * gguf : better type names
    
    * dedup : CPU + Metal is working
    
    * ggml : fix warnings about unused results
    
    * llama.cpp : fix line feed and compiler warning
    
    * llama : fix strncpy warning + note token_to_str does not write null
    
    * llama : restore the original load/save session implementation
    
    Will migrate this to GGUF in the future
    
    * convert-llama-h5-to-gguf.py : support alt ctx param name
    
    * ggml : assert when using ggml_mul with non-F32 src1
    
    * examples : dedup simple
    
    ---------
    
    Co-authored-by: klosax <[email protected]>
    ggerganov and klosax authored Aug 16, 2023
    Configuration menu
    Copy the full SHA
    88b5769 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    c8ee87f View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5ec1893 View commit details
    Browse the repository at this point in the history

Commits on Aug 17, 2023

  1. Configuration menu
    Copy the full SHA
    42f8fe1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    5a0a2c5 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d6fd53a View commit details
    Browse the repository at this point in the history
  4. convert-new.py : output gguf (#2635)

    * convert-new.py : output gguf (WIP)
    
    * convert-new.py : add gguf key-value pairs
    
    * llama : add hparams.ctx_train + no longer print ftype
    
    * convert-new.py : minor fixes
    
    * convert-new.py : vocab-only option should work now
    
    * llama : fix tokenizer to use llama_char_to_byte
    
    * tests : add new ggml-vocab-llama.gguf
    
    * convert-new.py : tensor name mapping
    
    * convert-new.py : add map for skipping tensor serialization
    
    * convert-new.py : convert script now works
    
    * gguf.py : pick some of the refactoring from #2644
    
    * convert-new.py : minor fixes
    ggerganov authored Aug 17, 2023
    Configuration menu
    Copy the full SHA
    e0429d3 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    2ddd968 View commit details
    Browse the repository at this point in the history
  6. Revert "ci : disable CI temporary to not waste energy"

    This reverts commit 7e82d25.
    ggerganov committed Aug 17, 2023
    Configuration menu
    Copy the full SHA
    dd016cc View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    d646c4e View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    8ace03a View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    11bf436 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    6d66ef9 View commit details
    Browse the repository at this point in the history
  11. editorconfig : ignore models folder

    ggml-ci
    ggerganov committed Aug 17, 2023
    Configuration menu
    Copy the full SHA
    c3b7393 View commit details
    Browse the repository at this point in the history
  12. ci : update ".bin" to ".gguf" extension

    ggml-ci
    ggerganov committed Aug 17, 2023
    Configuration menu
    Copy the full SHA
    dd9e2fc View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    81a2c2a View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    93f285b View commit details
    Browse the repository at this point in the history
  15. llama : fix lambda capture

    ggml-ci
    ggerganov committed Aug 17, 2023
    Configuration menu
    Copy the full SHA
    899f9a5 View commit details
    Browse the repository at this point in the history
  16. ggml : fix bug in gguf_set_kv

    ggml-ci
    ggerganov committed Aug 17, 2023
    Configuration menu
    Copy the full SHA
    e72c8c2 View commit details
    Browse the repository at this point in the history
  17. common.h : .bin --> .gguf

    klosax authored Aug 17, 2023
    Configuration menu
    Copy the full SHA
    fb11dd3 View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    78e1e57 View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    acaa982 View commit details
    Browse the repository at this point in the history
  20. llama.cpp : typo

    klosax authored Aug 17, 2023
    Configuration menu
    Copy the full SHA
    b3cc182 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    57eaadb View commit details
    Browse the repository at this point in the history
  22. Configuration menu
    Copy the full SHA
    5484737 View commit details
    Browse the repository at this point in the history
  23. gguf.py : write tensors in a single pass (#2644)

    * gguf : single pass for writing tensors + refactoring writer
    
    * gguf : single pass for writing tensors + refactoring writer
    
    * gguf : single pass for writing tensors + refactoring writer
    
    * gguf : style fixes in simple conversion script
    
    * gguf : refactor gptneox conversion script
    
    * gguf : rename h5 to hf (for HuggingFace)
    
    * gguf : refactor pth to gguf conversion script
    
    * gguf : rm file_type key and method
    
    * gguf.py : fix vertical alignment
    
    * gguf.py : indentation
    
    ---------
    
    Co-authored-by: Georgi Gerganov <[email protected]>
    monatis and ggerganov authored Aug 17, 2023
    Configuration menu
    Copy the full SHA
    fc3a523 View commit details
    Browse the repository at this point in the history
  24. Configuration menu
    Copy the full SHA
    b668cd3 View commit details
    Browse the repository at this point in the history
  25. gguf.py : gptneox mapping

    klosax authored Aug 17, 2023
    Configuration menu
    Copy the full SHA
    640ddc4 View commit details
    Browse the repository at this point in the history
  26. Configuration menu
    Copy the full SHA
    9e2d4dd View commit details
    Browse the repository at this point in the history
  27. Configuration menu
    Copy the full SHA
    3c1b721 View commit details
    Browse the repository at this point in the history
  28. ggml.h : reverse GGUF_MAGIC

    klosax authored Aug 17, 2023
    Configuration menu
    Copy the full SHA
    c20ae49 View commit details
    Browse the repository at this point in the history
  29. gguf.py : reverse GGUF_MAGIC

    klosax authored Aug 17, 2023
    Configuration menu
    Copy the full SHA
    147a99b View commit details
    Browse the repository at this point in the history
  30. Configuration menu
    Copy the full SHA
    d9e6890 View commit details
    Browse the repository at this point in the history
  31. Configuration menu
    Copy the full SHA
    306070c View commit details
    Browse the repository at this point in the history

Commits on Aug 18, 2023

  1. Configuration menu
    Copy the full SHA
    b275de7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    aa3efe8 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    856afff View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    e35f8c7 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    dea5be6 View commit details
    Browse the repository at this point in the history
  6. llama : re-order functions

    ggerganov committed Aug 18, 2023
    Configuration menu
    Copy the full SHA
    660ca9b View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    38016ed View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    2d6c2c7 View commit details
    Browse the repository at this point in the history
  9. llama : minor API updates

    ggerganov committed Aug 18, 2023
    Configuration menu
    Copy the full SHA
    035d511 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    5d2656d View commit details
    Browse the repository at this point in the history
  11. llama : fix MPI build

    ggml-ci
    ggerganov committed Aug 18, 2023
    Configuration menu
    Copy the full SHA
    a4ad2bf View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    25b8a89 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    fb7c883 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    d5e976c View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    16ab9ba View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    c0e4ca6 View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    593b04f View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    281d6d1 View commit details
    Browse the repository at this point in the history
  19. gguf.py : fix for falcon 40b

    klosax authored Aug 18, 2023
    Configuration menu
    Copy the full SHA
    bd5a579 View commit details
    Browse the repository at this point in the history
  20. Configuration menu
    Copy the full SHA
    1d80eea View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    2c8055b View commit details
    Browse the repository at this point in the history

Commits on Aug 19, 2023

  1. Configuration menu
    Copy the full SHA
    b3a7a2b View commit details
    Browse the repository at this point in the history
  2. cmpnct_gpt2bpe.hpp : fixes

    klosax authored Aug 19, 2023
    Configuration menu
    Copy the full SHA
    dadf098 View commit details
    Browse the repository at this point in the history
  3. falcon-main.cpp : fixes

    klosax authored Aug 19, 2023
    Configuration menu
    Copy the full SHA
    781bf24 View commit details
    Browse the repository at this point in the history
  4. gptneox-main.cpp : fixes

    klosax authored Aug 19, 2023
    Configuration menu
    Copy the full SHA
    8945d47 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    6a2e520 View commit details
    Browse the repository at this point in the history
  6. Update examples/server/README.md

    Co-authored-by: slaren <[email protected]>
    klosax and slaren authored Aug 19, 2023
    Configuration menu
    Copy the full SHA
    c0a1269 View commit details
    Browse the repository at this point in the history
  7. cmpnct_gpt2bpe.hpp : cleanup

    klosax authored Aug 19, 2023
    Configuration menu
    Copy the full SHA
    28b8c26 View commit details
    Browse the repository at this point in the history

Commits on Aug 20, 2023

  1. Configuration menu
    Copy the full SHA
    76b4662 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f838faa View commit details
    Browse the repository at this point in the history

Commits on Aug 21, 2023

  1. Configuration menu
    Copy the full SHA
    5a02b96 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4f92488 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    7de7cb4 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    d5c8fcf View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    287db51 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    58bde5c View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    c818c40 View commit details
    Browse the repository at this point in the history
  8. gguf.py : fix rope scale kv

    klosax authored Aug 21, 2023
    Configuration menu
    Copy the full SHA
    6a69a69 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    5f6ff38 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    dc1f051 View commit details
    Browse the repository at this point in the history
  11. llama.cpp : use rope scale kv

    klosax authored Aug 21, 2023
    Configuration menu
    Copy the full SHA
    c082b9f View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    9070e33 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    7a7d1ba View commit details
    Browse the repository at this point in the history
  14. Merge branch 'master' into gguf

    ggml-ci
    ggerganov committed Aug 21, 2023
    Configuration menu
    Copy the full SHA
    1e7a009 View commit details
    Browse the repository at this point in the history
  15. py : fix whitespace

    ggerganov committed Aug 21, 2023
    Configuration menu
    Copy the full SHA
    6490ff7 View commit details
    Browse the repository at this point in the history
  16. gguf : add Python script to convert GGMLv3 LLaMA models to GGUF (#2682)

    * First pass at converting GGMLv3 LLaMA models to GGUF
    
    * Cleanups, better output during conversion
    
    * Fix vocab space conversion logic
    
    * More vocab conversion fixes
    
    * Add description to converted GGUF files
    
    * Improve help text, expand warning
    
    * Allow specifying name and description for output GGUF
    
    * Allow overriding vocab and hyperparams from original model metadata
    
    * Use correct params override var name
    
    * Fix wrong type size for Q8_K
    
    Better handling of original style metadata
    
    * Set default value for gguf add_tensor raw_shape KW arg
    KerfuffleV2 authored Aug 21, 2023
    Configuration menu
    Copy the full SHA
    e06cbce View commit details
    Browse the repository at this point in the history
  17. llama : improve token type support (#2668)

    * Merge tokenizer fixes into the gguf branch.
    
    * Add test vocabularies
    
    * Adapt convert-new.py (and fix a clang-cl compiler error on windows)
    
    * Improved tokenizer test
    
    But does it work on MacOS?
    
    * Improve token type support
    
    - Added @klosax code to convert.py
    - Improved token type support in vocabulary
    
    * Exclude platform dependent tests
    
    * More sentencepiece compatibility by eliminating magic numbers
    
    * Restored accidentally removed comment
    goerch authored Aug 21, 2023
    Configuration menu
    Copy the full SHA
    8d177ed View commit details
    Browse the repository at this point in the history
  18. llama : add API for token type

    ggml-ci
    ggerganov committed Aug 21, 2023
    Configuration menu
    Copy the full SHA
    0b53b8b View commit details
    Browse the repository at this point in the history
  19. tests : use new tokenizer type API (#2692)

    * Merge tokenizer fixes into the gguf branch.
    
    * Add test vocabularies
    
    * Adapt convert-new.py (and fix a clang-cl compiler error on windows)
    
    * Improved tokenizer test
    
    But does it work on MacOS?
    
    * Improve token type support
    
    - Added @klosax code to convert.py
    - Improved token type support in vocabulary
    
    * Exclude platform dependent tests
    
    * More sentencepiece compatibility by eliminating magic numbers
    
    * Restored accidentally removed comment
    
    * Improve commentary
    
    * Use token type API in test-tokenizer-1.cpp
    goerch authored Aug 21, 2023
    Configuration menu
    Copy the full SHA
    49c25cc View commit details
    Browse the repository at this point in the history
  20. py : cosmetics

    ggerganov committed Aug 21, 2023
    Configuration menu
    Copy the full SHA
    811f653 View commit details
    Browse the repository at this point in the history
  21. readme : add notice about new file format

    ggml-ci
    ggerganov committed Aug 21, 2023
    Configuration menu
    Copy the full SHA
    66a66a0 View commit details
    Browse the repository at this point in the history