-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
converter does not work with the current ggml #23
Comments
I ran into the same issue, but after making these changes it works fine. 007f063 |
Thanks! I think latest ggml with your increasing memory size code can be used to convert the models.
|
I see that this has been updated: https://github.com/skeskinen/bert.cpp/blob/master/bert.cpp#L461 |
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Tried to convert
https://huggingface.co/intfloat/e5-large-v2
to ggml with the currentd9f04e609fb7f7e5fb3b20a77d4d685219971009
commit. However, execution of the converted f32, f16, q4_0, and q4_1 models shows thenot enough space in the context's memory pool
message. Maybe it is related to ggerganov/ggml#158 ?The text was updated successfully, but these errors were encountered: