-
Notifications
You must be signed in to change notification settings - Fork 525
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Chore: prevent from printing additional header in lcurve.out
on resuming
#4511
Conversation
…esuming training
for more information, see https://pre-commit.ci
📝 WalkthroughWalkthroughThe changes in the Changes
Sequence DiagramsequenceDiagram
participant Trainer
participant DataLoader
participant Iterator
Trainer->>Trainer: Initialize training
alt Is Restart Training
Trainer->>Trainer: Set lcurve_should_print_header to False
else New Training
Trainer->>Trainer: Set lcurve_should_print_header to True
end
Trainer->>DataLoader: Get data
alt Data Exhausted
DataLoader->>Iterator: Reinitialize
Iterator-->>DataLoader: Reset iterator
end
DataLoader-->>Trainer: Return batch data
The sequence diagram illustrates the modified training initialization and data loading process, highlighting the conditional header printing and data loader handling when training is resumed or data is exhausted. 📜 Recent review detailsConfiguration used: CodeRabbit UI 📒 Files selected for processing (1)
🔇 Additional comments (2)deepmd/pt/train/training.py (2)
The initialization of Line range hint The data handling logic in
This implementation ensures seamless training continuation for both single-task and multi-task scenarios. Also applies to: 604-631 Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## devel #4511 +/- ##
=======================================
Coverage 84.59% 84.59%
=======================================
Files 675 675
Lines 63575 63574 -1
Branches 3486 3486
=======================================
Hits 53779 53779
+ Misses 8671 8670 -1
Partials 1125 1125 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
-
I don't suggest so, as the checkpoint is not saved every steps. You may see 10000 10100 (restart) 10000 10100. It's not clear if the title is removed.
-
All backends should have the same behavior.
Summary by CodeRabbit