Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The model you provide in modelscope and huggingface is not complete #3

Open
lynquantumman opened this issue Feb 19, 2024 · 1 comment

Comments

@lynquantumman
Copy link

The model you provide in modelscope and huggingface only include the LLM model. The cross attention part and visual part is missing. Ergo, based on the ckpt, we cannot re-implement your exps. Hope you can make it complete. Plus, if it is what it expected to be on the png image, it should be a greate work.

@Zhazhan
Copy link
Collaborator

Zhazhan commented Mar 1, 2024

Thank you for the reminder! We have supplemented the cross-attention part and the vision encoder part on both modelscope and huggingface. We have also added a script in the code that merges them into a single model and illustrated it in the README. We greatly appreciate your support for CodeFuse-VLM!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants