llama : refactor model loading code #1991

ggerganov · 2023-06-25T10:30:31Z

In llama.cpp we have logic for supporting some very old model formats and features such as sharded models which is making the code unnecessary complicated and difficult to maintain. We should simplify it and remove support for old stuff that is no longer used.

Additionally, with the upcoming unified file format (ggerganov/ggml#220) we will have to look into reimplementing the code to use it and add support for loading non-LLaMA models as well. This will be an important step towards adding inference of new models such as MPT and Falcon. Therefore, simplifying the logic as much as possible will help to easily adopt the new unified file format when it is ready

The text was updated successfully, but these errors were encountered:

hetiejun · 2023-06-25T11:04:33Z

A gental solution might be to provide a tool that can convert old formats and even other formats into the new format, I suppose.

howard0su · 2023-06-26T11:37:15Z

Remove shards support. #2000

ggerganov · 2023-08-21T20:22:19Z

Closed via #2398

ggerganov added good first issue Good for newcomers refactoring Refactoring labels Jun 25, 2023

howard0su self-assigned this Jun 26, 2023

ggerganov mentioned this issue Jun 26, 2023

Remove shards weight file support #2000

Merged

slaren mentioned this issue Jun 27, 2023

llama : add Falcon LLM support #1602

Closed

ggerganov mentioned this issue Aug 16, 2023

GGUF #2398

Merged

34 tasks

ggerganov closed this as completed Aug 21, 2023

This was referenced Oct 28, 2023

StableLM support #3586

Merged

llama : refactor graph build code #3837

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : refactor model loading code #1991

llama : refactor model loading code #1991

ggerganov commented Jun 25, 2023

hetiejun commented Jun 25, 2023

howard0su commented Jun 26, 2023

ggerganov commented Aug 21, 2023

llama : refactor model loading code #1991

llama : refactor model loading code #1991

Comments

ggerganov commented Jun 25, 2023

hetiejun commented Jun 25, 2023

howard0su commented Jun 26, 2023

ggerganov commented Aug 21, 2023