Skip to content
This repository has been archived by the owner on Jun 24, 2024. It is now read-only.

Add Falcon Support #313

Merged
merged 12 commits into from
Jun 28, 2023
Merged

Add Falcon Support #313

merged 12 commits into from
Jun 28, 2023

Conversation

LLukas22
Copy link
Contributor

@LLukas22 LLukas22 commented Jun 17, 2023

Partially closes #293.

Implementation of ggerganov/ggml#231 into llm

Currently the Falcon GGML file format is not finalized see cmp-nct/ggllm.cpp#5 and cmp-nct/ggllm.cpp#1. We'll leave this as a draft as long as the format isn't fixed.

This currently only supports the 7B variants as the 40B models need some additional changes.

This also needs the K/V-Store to be f32 as the op_repeat operation is only implemented for f32. Simply add --no-float16 to the CLI call. Ideally these repeat calls should be replaced later on.

Models can be found here

crates/models/falcon/Cargo.toml Outdated Show resolved Hide resolved
crates/models/falcon/src/lib.rs Outdated Show resolved Hide resolved
crates/models/falcon/src/lib.rs Outdated Show resolved Hide resolved
crates/models/falcon/src/lib.rs Outdated Show resolved Hide resolved
@philpax philpax marked this pull request as ready for review June 28, 2023 22:52
@philpax philpax merged commit 3a63142 into main Jun 28, 2023
4 checks passed
@philpax philpax deleted the feat/falcon-support branch July 16, 2023 19:27
@hhamud hhamud mentioned this pull request Aug 7, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
topic:model-support Support for new models
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Support Falcon
3 participants