Skip to content

Pull requests: karpathy/llm.c

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Mapping "py" gpt2 functionalities to match "c"
#783 opened Oct 31, 2024 by omarswelam Loading…
Verify vocab is padded before reshaping
#782 opened Oct 23, 2024 by austinleedavis Loading…
FP32 FlashAttention
#781 opened Oct 20, 2024 by ssiu Loading…
Activation Checkpointing for Llama3 branch
#773 opened Oct 2, 2024 by ademeure Loading…
-pm -> -pi: typo in error_usage
#765 opened Sep 22, 2024 by thundergolfer Loading…
Micro optimization for softmax_forward_kernel5
#762 opened Sep 20, 2024 by insop Loading…
FP8 with Tensor Reorg
#760 opened Sep 19, 2024 by ademeure Draft
Update download_starter_pack.sh
#758 opened Sep 18, 2024 by dongrixinyu Loading…
Add SwiGLU support - llama3 feature branch
#755 opened Sep 13, 2024 by gordicaleksa Loading…
add llama 3 support to llm.c
#754 opened Sep 13, 2024 by karpathy Draft
Adamw thread coarsening kernel
#753 opened Sep 3, 2024 by saladpalad Loading…
Fix sizing typo in train_gpt2_fp32.cu
#748 opened Aug 25, 2024 by gajanan-choudhary Loading…
log with LINE and FILE for better addressing.
#746 opened Aug 22, 2024 by NEWPLAN Loading…
check libnccl instead of nccl to be more reliable
#742 opened Aug 14, 2024 by dengl11 Loading…
[WIP] initial curand implementation for model init
#741 opened Aug 13, 2024 by ngc92 Loading…
multi-threaded model initialization
#737 opened Aug 12, 2024 by ngc92 Loading…
Add external KV to LLaMA 3
#734 opened Aug 10, 2024 by gordicaleksa Loading…
Add SwiGLU support
#718 opened Jul 29, 2024 by gordicaleksa Loading…
Add RoPE positional encoding
#714 opened Jul 28, 2024 by gordicaleksa Loading…
ProTip! What’s not been updated in a month: updated:<2024-10-19.