-
Notifications
You must be signed in to change notification settings - Fork 18
Issues: ServiceNow/Fast-LLM
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[doc] Improve the Checkpoint Conversion Guide for Practical Usability
documentation
Improvements or additions to documentation
#111
opened Jan 9, 2025 by
tscholak
[feat] Implement Loss Masking to Exclude Predefined Token Spans from LM Loss
enhancement
New feature or request
[doc] Benchmark Fast-LLM Pretraining Throughput Against Contemporary Frameworks
documentation
Improvements or additions to documentation
#108
opened Jan 8, 2025 by
tscholak
[doc] Automate MkDocs Documentation for Fast-LLM Configuration Options
documentation
Improvements or additions to documentation
[feat] Public Non-CLI API for training and data preparation
enhancement
New feature or request
#105
opened Jan 7, 2025 by
tscholak
[feat] Track Exact Fast-LLM Version in Training Outputs and wandb Logs
enhancement
New feature or request
#101
opened Dec 31, 2024 by
tscholak
[feat] Improve the weight conversion interface
enhancement
New feature or request
#99
opened Dec 21, 2024 by
jlamypoirier
[feat] Dump full configuration in experiment-dir
enhancement
New feature or request
#91
opened Dec 10, 2024 by
RaymondLi0
[bug] Inconsistent init_method_std in test_load_distributed_checkpoint_dp2
bug
Something isn't working
#88
opened Dec 9, 2024 by
jlamypoirier
[bug] Conversion fails when using Something isn't working
layers_per_step
with some input formats
bug
#87
opened Dec 4, 2024 by
RaymondLi0
[feat] Option to disable top-k routing weights normalization
enhancement
New feature or request
#83
opened Dec 3, 2024 by
sohamparikh
[bug] Increasing training loss likely due to desynchronization
bug
Something isn't working
#77
opened Dec 2, 2024 by
tscholak
[docs] Llama 3.1 8B continual pretraining recipe
documentation
Improvements or additions to documentation
#70
opened Nov 26, 2024 by
tscholak
[feat] Support Mamba 2 blocks
enhancement
New feature or request
#68
opened Nov 25, 2024 by
tscholak
[feat] Data lineage tracking in New feature or request
metadata.yaml
enhancement
#67
opened Nov 25, 2024 by
tscholak
[feat] Streamline evaluations: Add integrated Evaluator framework
enhancement
New feature or request
#65
opened Nov 24, 2024 by
tscholak
[feat] Optional Attention Mask to Prevent Cross-Document Attention in Sequences
enhancement
New feature or request
#62
opened Nov 22, 2024 by
tscholak
[bug] Sparse copy runs out of shared memory with many experts
bug
Something isn't working
#56
opened Nov 20, 2024 by
sohamparikh
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.