Skip to content

Commit

Permalink
note about dropout
Browse files Browse the repository at this point in the history
  • Loading branch information
rasbt committed Jun 19, 2024
1 parent bbb2a0c commit 85827e0
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -180,7 +180,7 @@ def train_model_simple(model, optimizer, device, n_epochs,
"emb_dim": 12, # Embedding dimension
"n_heads": 2, # Number of attention heads
"n_layers": 2, # Number of layers
"drop_rate": 0.0, # Dropout rate
"drop_rate": 0.0, # Dropout rate, deactivated via 0.0 as dropout in LLMs is not recommended anymore
"qkv_bias": False # Query-key-value bias
}

Expand Down

0 comments on commit 85827e0

Please sign in to comment.