Fix llama3 generation #278

satyaog · 2024-09-10T22:08:20Z

force bf16 instead of f32 to have the same model size are pretrained
fix no_pretrained arg

Delaunay · 2024-09-11T12:37:03Z

benchmarks/llm/prepare.py

@@ -154,7 +162,7 @@ def main():

    #
    huggingface_format = config.get("safetensors", False)
-    pretrained = not args.no_pretrained
+    pretrained = not config.get("no_pretrained", False)


I am not a fan of "no_xxx" configs, but nothing else comes to mind.
Do you have suggestions ?

Is pretrained already in used somewhere? Else I'm not sure if we have access to the bench config (from config/*.yaml) there but that would be cleaner I supposed?

Or the arg could be untrained

Fix llama3 generation

134dd47

satyaog mentioned this pull request Sep 10, 2024

Generate llama instead of downloading it #250

Merged

Delaunay reviewed Sep 11, 2024

View reviewed changes

Fix llama3 generation

8cc4926

satyaog force-pushed the hotfix/llama3_gen branch from bc84701 to 8cc4926 Compare September 24, 2024 13:50

Delaunay deleted the branch mila-iqia:staging October 2, 2024 17:00

Delaunay closed this Oct 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix llama3 generation #278

Fix llama3 generation #278

satyaog commented Sep 10, 2024 •

edited

Loading

Delaunay Sep 11, 2024 •

edited

Loading

satyaog Sep 23, 2024 •

edited

Loading

satyaog Sep 23, 2024

Fix llama3 generation #278

Fix llama3 generation #278

Conversation

satyaog commented Sep 10, 2024 • edited Loading

Delaunay Sep 11, 2024 • edited Loading

Choose a reason for hiding this comment

satyaog Sep 23, 2024 • edited Loading

Choose a reason for hiding this comment

satyaog Sep 23, 2024

Choose a reason for hiding this comment

satyaog commented Sep 10, 2024 •

edited

Loading

Delaunay Sep 11, 2024 •

edited

Loading

satyaog Sep 23, 2024 •

edited

Loading