feat(pt): train with energy Hessian #4169

1azyking · 2024-09-29T17:31:13Z

Summary by CodeRabbit

New Features
- Introduced support for Hessian calculations across various components, enhancing the model's capabilities.
- Added a new loss function for Hessian, allowing for more comprehensive training scenarios.
- New JSON configuration files for multi-task and single-task learning models.
- Enhanced output handling to include Hessian data in model evaluations.
- Added new methods and properties to support Hessian in several classes and modules.
Bug Fixes
- Improved handling of output shapes and results related to Hessian data.
Documentation
- Updated documentation to include new Hessian properties and training guidelines.
- Added sections detailing Hessian configurations and requirements in the training documentation.
Tests
- Added unit tests for the new Hessian-related functionalities to ensure consistency and correctness.
- Enhanced existing test cases to incorporate Hessian data handling and validation.

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (3)

doc/model/train-energy-hessian.md (1)

48-54: Add validation guidance for Hessian matrices.

Consider adding guidance on how users can validate their Hessian matrices format, including:

A script or command to verify the shape and dimensions

Common pitfalls when preparing the data

Example of a minimal valid Hessian matrix

Would you like me to help draft the validation guidance section?
deepmd/pt/model/model/ener_model.py (2)
42-44: Add docstring to document the enable_hessian method

The method implementation is correct, but it would benefit from documentation explaining its purpose, effects, and any prerequisites.
 def enable_hessian(self):
+    """Enable Hessian computation for energy calculations.
+    
+    This method enables the computation of energy Hessian matrices by:
+    1. Setting up the required computational graph for energy Hessian
+    2. Enabling the internal Hessian flag
+    """
     self.requires_hessian("energy")
     self._hessian_enabled = True
62-63: Document Hessian tensor structure and add shape validation

While the Hessian output is correctly added, it would be beneficial to document the expected tensor structure and add shape validation.
 if self._hessian_enabled:
+    # Validate Hessian shape: [batch_size, natoms * 3, natoms * 3]
+    hessian = out_def_data["energy_derv_r_derv_r"]
+    expected_shape = hessian.shape[-2:]
+    if not (len(expected_shape) == 2 and expected_shape[0] == expected_shape[1]):
+        raise ValueError(f"Invalid Hessian shape: {hessian.shape}")
     output_def["hessian"] = out_def_data["energy_derv_r_derv_r"]

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d1361ca and 19d0cd3.

📒 Files selected for processing (7)

deepmd/pt/infer/deep_eval.py (5 hunks)
deepmd/pt/model/model/__init__.py (1 hunks)
deepmd/pt/model/model/ener_model.py (4 hunks)
deepmd/pt/model/model/make_hessian_model.py (1 hunks)
deepmd/utils/argcheck.py (2 hunks)
doc/model/train-energy-hessian.md (1 hunks)
source/tests/pt/model/test_dp_hessian_model.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (4)

deepmd/pt/model/model/init.py
source/tests/pt/model/test_dp_hessian_model.py
deepmd/pt/infer/deep_eval.py
deepmd/pt/model/model/make_hessian_model.py

🧰 Additional context used

📓 Learnings (1)

deepmd/utils/argcheck.py (1)

Learnt from: 1azyking
PR: deepmodeling/deepmd-kit#4169
File: deepmd/utils/argcheck.py:1982-2117
Timestamp: 2024-11-12T05:47:21.643Z
Learning: The `loss_ener_hess` and `loss_ener` functions should remain separate to avoid confusion, despite code duplication.

🪛 Markdownlint (0.37.0)

doc/model/train-energy-hessian.md

39-39: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

🔇 Additional comments (10)

doc/model/train-energy-hessian.md (4)

1-5: Add standard frontmatter and TOC entry.

Please follow the project's documentation standards:

Add frontmatter section following the format from dpa2.md
Add this document to the table of contents

7-9: LGTM: Clear warning about model freezing limitations.

The warning effectively communicates the limitations and fallback behavior when freezing Hessian models.

18-29: Update loss type to match implementation.

The loss type should be "ener_hess" instead of "ener" when training with Hessian calculations.

39-46: Specify language for fenced code block.

To improve consistency and adhere to Markdown best practices, specify a language for the fenced code block.

🧰 Tools

🪛 Markdownlint (0.37.0)

39-39: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

deepmd/utils/argcheck.py (2)

2179-2180: LGTM! Documentation is clear and consistent.

The documentation for the new Hessian parameters follows the project's style and clearly explains their purpose.

2235-2248: Implementation looks good, verify loss calculation handling.

The new Hessian parameters are well-implemented with appropriate types, defaults, and documentation. The implementation follows the project's patterns consistently.

Run the following script to verify the loss calculation code handles these new parameters:

✅ Verification successful

Implementation is correctly handled in loss calculation code

The verification shows that the new Hessian parameters are properly integrated into the loss calculation system:

The parameters are correctly used in deepmd/pt/loss/ener.py where:
- They control the Hessian loss activation via has_h = (start_pref_h != 0.0 and limit_pref_h != 0.0)
- The loss calculation properly scales between start and limit prefactors
- The Hessian loss is correctly computed and added to the total loss
The implementation is tested in source/tests/pt/test_loss.py
Example configurations in examples/hessian/ demonstrate proper usage
The training system correctly enables Hessian mode when these parameters are set via whether_hessian() function

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Verify that the loss calculation code properly handles the new Hessian parameters

# Check for loss calculation implementation
rg -A 10 "start_pref_h|limit_pref_h" --type python

# Check for test coverage
rg -A 10 "test.*hessian.*loss" --type python

Length of output: 161

Script:

#!/bin/bash
# Let's try without file type filtering since it's not supported
# Check for loss calculation implementation
rg -A 10 "start_pref_h|limit_pref_h"

# Check for test coverage
rg -A 10 "test.*hessian.*loss"

# Additionally search for Hessian-related loss calculations
rg -A 10 "hessian.*loss|loss.*hessian"

Length of output: 16280

deepmd/pt/model/model/ener_model.py (4)

18-20: LGTM: Import follows project conventions

The new import for Hessian model functionality follows the project's import conventions and is properly placed with other model-related imports.

40-40: LGTM: Proper initialization of Hessian flag

The _hessian_enabled flag is properly initialized as a private attribute with an appropriate default value of False, following Python conventions.

26-26: Verify the Hessian model wrapper implementation

The Hessian wrapper is correctly applied after the base model creation. Let's verify the wrapper implementation:

✅ Verification successful

The previous ast-grep patterns didn't work because of syntax issues. Let's try with ripgrep to find the implementation and usage patterns, along with checking the actual file content to understand the context better.

Based on the search results, I can now generate the final response:

Hessian model wrapper is correctly implemented and follows established patterns

The verification shows that:

The Hessian wrapper is properly applied after the base model creation (DPEnergyModel_ = make_hessian_model(DPEnergyModel_))
The wrapper follows the project's model decoration pattern, similar to other model wrappers like make_model
The implementation includes proper Hessian functionality handling through enable_hessian() and output translation in translated_output_def()
Test coverage exists in test_make_hessian_model.py validating the wrapper's functionality

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Check the make_hessian_model implementation and its usage pattern
ast-grep --pattern 'def make_hessian_model($_) {
  $$$
}'

# Check for similar wrapper patterns in the codebase
ast-grep --pattern '$_ = make_$_($_)'

Length of output: 99

Script:

#!/bin/bash
# Search for make_hessian_model definition and usage with ripgrep
rg "make_hessian_model" -A 5

# Search for similar model wrapper patterns
rg "make_.+_model" -A 5

# Look for the actual file content
cat deepmd/pt/model/model/ener_model.py

Length of output: 23253

99-100: Consider adding Hessian support to forward_lower method

While Hessian handling is correctly implemented in the forward pass, consider adding corresponding support in the forward_lower method for consistency.

1azyking · 2024-12-15T08:36:32Z

I'm much curious about this Hessian DP model

where are the Hessian labels from? I've heard that they are from Phonopy by ABACUS/VASP, is this true？

Is this Hessian model compatible with DPA-1/DPA-2 ?

The most applicable scenario for the Hessian model is TS search, can the Hessian from DP model used by TS search method like Sella ? (which is used by NewtonNet and developed by same developers)

Thanks！

Thank you for your comment! Here's a response to your questions:

The Hessian labels can come from various sources. To the best of my knowledge:

In VASP, you can calculate the second derivative matrix by setting the IBRION parameter.
In Gaussian, adding the freq keyword enables the calculation of the Hessian matrix.
In ABACUS, the Hessian matrix can be output by setting the FULL_FORCE_CONSTANTS parameter.
In CP2K, the Hessian matrix can also be obtained by configuring the input file appropriately.

Note that the Hessian-related output files from these programs often need to be processed using tools like Phonopy or cclib to extract the Hessian matrix.

The Hessian model is compatible with both DPA-1 and DPA-2. However, since we compute the Hessian using torch.autograd.functional.hessian method, the Hessian model currently supports only the pytorch backend.
The Hessian matrix has many applications, such as simulating vibrational-rotational spectra and performing transition state (TS) searches. The branch under development focuses on enabling the model to fit and predict Hessian matrices. Most of our adjustments are concentrated on training and testing functionalities (please refer to the documentation for more details). The method you mentioned, such as Sella, may require further development for integration with the Hessian model.

If you have any additional questions, feel free to leave another comment!

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 19d0cd3 and 7dead9c.

📒 Files selected for processing (1)

deepmd/pt/model/model/ener_model.py (4 hunks)

🔇 Additional comments (4)

deepmd/pt/model/model/ener_model.py (4)

18-20: LGTM: Clean import addition

The new import for make_hessian_model is properly organized with other model-related imports.

39-39: LGTM: Clean initialization of Hessian flag

The _hessian_enabled flag is properly initialized with a sensible default value.

63-64: LGTM: Clean addition of Hessian output definition

The Hessian output is properly guarded by the enabled flag and follows the existing pattern.

100-101: Verify Hessian support in forward_lower method

While the Hessian support is properly added to the forward method, please verify if similar changes are needed in the forward_lower method for consistency.

Let's check if there are any references to Hessian computation in lower-level implementations:

deepmd/pt/model/model/ener_model.py

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (2)

source/tests/pt/test_change_bias.py (2)
126-129: Consider refactoring common verification logic

The verification logic is duplicated across test methods. Consider extracting the common model initialization and verification code into a helper method.
+ def _verify_bias_change(self, state_dict, expected_bias):
+     model_params = state_dict["model"]["_extra_state"]["model_params"]
+     model_for_wrapper = get_model_for_wrapper(
+         model_params,
+         _loss_params=self.loss_params,
+     )
+     wrapper = ModelWrapper(model_for_wrapper)
+     wrapper.load_state_dict(state_dict["model"])
+     updated_bias = wrapper.model["Default"].get_out_bias()
+     torch.testing.assert_close(updated_bias, expected_bias)
150-153: Consider parameterizing the test with more test cases

The test could be enhanced by using @pytest.mark.parametrize to test various user-defined bias values, including edge cases and invalid inputs.

Example parameterization:
@pytest.mark.parametrize("user_bias", [
    [0.1, 3.2, -0.5],  # current case
    [0.0, 0.0, 0.0],   # zero bias
    [-1e6, 1e6, 0],    # extreme values
])
def test_change_bias_with_user_defined(self, user_bias):
    # existing test code

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 7dead9c and 6a31b02.

📒 Files selected for processing (1)

source/tests/pt/test_change_bias.py (4 hunks)

🔇 Additional comments (2)

source/tests/pt/test_change_bias.py (2)

90-90: LGTM: Loss parameters initialization in setUp

The initialization of loss parameters from the config is correctly placed in the setUp method, ensuring fresh configuration for each test case.

100-103: Verify test compatibility with Hessian mode

The test now correctly passes loss parameters to the model wrapper. However, we should verify that this test works correctly when Hessian mode is enabled.

doc/model/train-energy-hessian.md

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

deepmd/pt/train/training.py (1)
1228-1231: Consider improving function name and simplifying logic

While the function logic is correct, consider these improvements for better readability:

Rename to is_hessian_required or should_use_hessian to better reflect its purpose

Simplify to a single return statement
-def whether_hessian(loss_params):
-    loss_type = loss_params.get("type", "ener")
-    return loss_type == "ener" and loss_params.get("start_pref_h", 0.0) > 0.0
+def is_hessian_required(loss_params):
+    return loss_params.get("type", "ener") == "ener" and loss_params.get("start_pref_h", 0.0) > 0.0

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 6a31b02 and 550178e.

📒 Files selected for processing (2)

deepmd/pt/train/training.py (5 hunks)
deepmd/utils/argcheck.py (2 hunks)

🧰 Additional context used

📓 Learnings (1)

deepmd/utils/argcheck.py (1)

Learnt from: 1azyking
PR: deepmodeling/deepmd-kit#4169
File: deepmd/utils/argcheck.py:1982-2117
Timestamp: 2024-11-12T05:47:21.643Z
Learning: The `loss_ener_hess` and `loss_ener` functions should remain separate to avoid confusion, despite code duplication.

🔇 Additional comments (4)

deepmd/utils/argcheck.py (2)

2179-2180: LGTM! Documentation strings are clear and consistent.

The documentation strings for the new Hessian prefactor parameters follow the established pattern and clearly explain their purpose.

2235-2248: LGTM! Implementation of Hessian prefactor parameters is correct.

The implementation:

Uses consistent types ([float, int]) with other prefactor parameters
Sets appropriate default values (0.0) to make Hessian loss opt-in
Follows the same argument structure as other loss prefactors

deepmd/pt/train/training.py (2)

28-28: LGTM!

The import of EnergyHessianStdLoss is correctly placed with other loss imports.

1235-1238: LGTM!

The Hessian loss integration follows the established pattern and correctly:

Uses whether_hessian for conditional activation
Sets required parameters
Returns appropriate loss type

deepmd/pt/train/training.py

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (4)

doc/model/train-energy-hessian.md (4)

44-50: Enhance data format explanation.

Consider adding more details about the Hessian matrix structure:

Example of how the matrix elements are ordered

Relationship between atomic positions and matrix indices

52-92: Add practical training examples.

Consider adding:

Example of training from scratch with a complete input.json

Example of fine-tuning from a pretrained model with specific parameter adjustments

Would you like me to generate these example configurations?

🧰 Tools

🪛 Markdownlint (0.37.0)

84-84: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

100-100: Fix grammar in conditional statement.

Change "If one do freeze" to "If one does freeze"

🧰 Tools

🪛 LanguageTool

[grammar] ~100-~100: Possible subject-verb agreement error.
Context: ...output Hessian predictions. ::: If one do freeze and test a Hessian model using t...

(ONE_AGREEMENT)

139-139: Fix subject-verb agreement.

Change "the predicted Hessian for each frame are output" to "the predicted Hessian for each frame is output"

🧰 Tools

🪛 LanguageTool

[uncategorized] ~139-~139: This verb does not appear to agree with the subject. Consider using a different form.
Context: ...d, the predicted Hessian for each frame are output in an additional file in the wor...

(AI_EN_LECTOR_REPLACEMENT_VERB_AGREEMENT)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 550178e and 5f07d2b.

📒 Files selected for processing (1)

doc/model/train-energy-hessian.md (1 hunks)

🧰 Additional context used

🪛 LanguageTool

doc/model/train-energy-hessian.md

[grammar] ~100-~100: Possible subject-verb agreement error.
Context: ...output Hessian predictions. ::: If one do freeze and test a Hessian model using t...

(ONE_AGREEMENT)

[uncategorized] ~139-~139: This verb does not appear to agree with the subject. Consider using a different form.
Context: ...d, the predicted Hessian for each frame are output in an additional file in the wor...

(AI_EN_LECTOR_REPLACEMENT_VERB_AGREEMENT)

🪛 Markdownlint (0.37.0)

doc/model/train-energy-hessian.md

35-35: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

84-84: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

119-119: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

141-141: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

147-147: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

🔇 Additional comments (4)

doc/model/train-energy-hessian.md (4)

1-5: Add frontmatter and TOC entry.

Based on the project's documentation standards, this document needs:

A frontmatter section
An entry in the table of contents

Apply this change:

+---
+title: Train with energy Hessian
+---
+
 # Fit energy Hessian {{ pytorch_icon }}

14-25: Update loss type to match implementation.

The loss type should be "ener_hess" for training with Hessian calculations.

Apply this change:

   "loss": {
-      "type": "ener",
+      "type": "ener_hess",
       "start_pref_e": 0.02,

35-42: Specify language for file structure code block.

Add the text language identifier to the code block for better formatting.

Apply this change:

-```
+```text
 type.raw

🧰 Tools

🪛 Markdownlint (0.37.0)

35-35: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

119-122: Add language identifiers to code blocks.

Add appropriate language identifiers to the code blocks:

Apply these changes:

-```
+```text
 ${output_prefix}.e.out   ${output_prefix}.e_peratom.out  ${output_prefix}.f.out

-```
+```text
 ${output_prefix}.h.out

-```
+```text
 # frame - 0: data_h pred_h (3Na*3Na matrix in row-major order)

Also applies to: 141-143, 147-158

🧰 Tools

🪛 Markdownlint (0.37.0)

119-119: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (6)

deepmd/entrypoints/test.py (3)
306-307: Consider revising the Hessian data dimension.

The Hessian matrix has dimensions 3N×3N (where N is the number of atoms) as it represents second derivatives with respect to atomic coordinates. Setting ndof=1 might not be the most intuitive choice.

Consider one of these alternatives:

Set ndof=9 to represent the 9 components per atom pair

Add a comment explaining why ndof=1 was chosen

357-359: Consider using named indices for better maintainability.

Using a hardcoded index (3) for retrieving the Hessian makes the code fragile to future changes. Consider using named constants or an enum to make the code more maintainable.
+    # Define constants for return value indices
+    ENERGY_IDX, FORCE_IDX, VIRIAL_IDX, HESSIAN_IDX = 0, 1, 2, 3
     if dp.has_hessian:
-        hessian = ret[3]
+        hessian = ret[HESSIAN_IDX]
         hessian = hessian.reshape([numb_test, -1])
545-560: Improve the Hessian output header description.

The header could be more descriptive to help users understand the structure of the Hessian matrix data.
             save_txt_file(
                 detail_path.with_suffix(".h.out"),
                 h,
-                header=f"{system}: data_h pred_h (3Na*3Na matrix in row-major order)",
+                header=f"{system}: data_h pred_h (3N×3N Hessian matrix in row-major order, where N is the number of atoms)",
                 append=append_detail,
             )
deepmd/pt/train/training.py (3)
1228-1231: Consider improving the function name and implementation.

The function correctly determines if Hessian calculations are required, but could be improved:

Rename to is_hessian_required for better clarity

Add docstring explaining the function's purpose
-def whether_hessian(loss_params):
+def is_hessian_required(loss_params):
+    """Check if Hessian calculations are required based on loss parameters.
+    
+    Args:
+        loss_params: Dictionary containing loss configuration
+        
+    Returns:
+        bool: True if Hessian calculations should be performed
+    """
     loss_type = loss_params.get("type", "ener")
     return loss_type == "ener" and loss_params.get("start_pref_h", 0.0) > 0.0
1235-1237: LGTM: Hessian loss handling is correctly implemented.

The code properly checks for Hessian requirements and returns the appropriate loss function. Consider adding a comment explaining when Hessian loss is used.
+    # Use Hessian loss when energy type model has Hessian preference enabled
     if whether_hessian(loss_params):
         loss_params["starter_learning_rate"] = start_lr
         return EnergyHessianStdLoss(**loss_params)
Line range hint 1287-1304: Consider improving the architectural design.

The current implementation determines the model type (Hessian vs. non-Hessian) based on loss parameters, which creates tight coupling between model and loss configurations. This could make the system harder to maintain and extend.

Consider these alternatives:

Introduce a dedicated model type (e.g., "ener_hess")

Move Hessian configuration to model parameters

Create a separate configuration section for Hessian settings

Example:
model_params = {
    "type": "ener_hess",  # Explicit model type
    # ... other parameters
}
This would:

Improve separation of concerns

Make configuration more explicit

Reduce coupling between components

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5f07d2b and e95c140.

📒 Files selected for processing (6)

deepmd/entrypoints/test.py (8 hunks)
deepmd/infer/deep_eval.py (3 hunks)
deepmd/pt/infer/deep_eval.py (5 hunks)
deepmd/pt/train/training.py (5 hunks)
deepmd/utils/argcheck.py (2 hunks)
doc/model/index.rst (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (2)

doc/model/index.rst
deepmd/pt/infer/deep_eval.py

🧰 Additional context used

📓 Learnings (1)

deepmd/utils/argcheck.py (1)

Learnt from: 1azyking
PR: deepmodeling/deepmd-kit#4169
File: deepmd/utils/argcheck.py:1982-2117
Timestamp: 2024-11-12T05:47:21.643Z
Learning: The `loss_ener_hess` and `loss_ener` functions should remain separate to avoid confusion, despite code duplication.

🪛 Ruff (0.8.2)

deepmd/entrypoints/test.py

619-619: Use key in dict instead of key in dict.keys()

Remove .keys()

(SIM118)

🔇 Additional comments (7)

deepmd/utils/argcheck.py (2)

2182-2183: Documentation looks good!

The documentation for the new Hessian parameters follows the established pattern and clearly describes their purpose.

2238-2251: Implementation looks good, verify usage in loss calculation.

The new Hessian parameters are properly implemented with appropriate defaults and optional flags. The implementation maintains backward compatibility and follows the project's conventions.

Run the following script to verify the usage of these parameters in the loss calculation code:

✅ Verification successful

Parameters are properly integrated into loss calculation

The verification shows that the new Hessian parameters are well integrated into the codebase:

In deepmd/pt/loss/ener.py:
- Parameters are properly documented with clear descriptions
- Used to control Hessian loss computation via has_h flag
- Correctly interpolated between start and limit values using pref_h
- Applied to the Hessian loss calculation when Hessian data is available
In deepmd/pt/train/training.py:
- whether_hessian() function properly checks for Hessian activation using start_pref_h
Test coverage exists in source/tests/pt/test_loss.py validating the parameters' usage

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Verify that the new Hessian parameters are properly used in loss calculation code
# Test: Search for usage of start_pref_h and limit_pref_h in loss calculation files

# Look for Hessian loss calculation in Python files
rg -A 5 "start_pref_h|limit_pref_h" --type py

# Look for Hessian tensor calculation
ast-grep --pattern 'hessian($_, $_)'

Length of output: 4500

deepmd/infer/deep_eval.py (3)

78-78: LGTM: Backend mapping for Hessian output is correctly defined.

The addition of "energy_derv_r_derv_r": "hessian" follows the established pattern and correctly maps the energy second derivative to the Hessian backend implementation.

278-280: LGTM: Base method for Hessian support check is properly implemented.

The method follows the established pattern of other feature check methods in the class, providing a default implementation that can be overridden by derived classes.

551-554: LGTM: Property for Hessian support check is properly implemented.

The property correctly delegates to the backend's get_has_hessian() method, following the established pattern of other similar properties in the class.

deepmd/entrypoints/test.py (1)

589-591: LGTM: Hessian metrics are correctly added to the return dictionary.

The Hessian MAE and RMSE metrics are properly included in the return dictionary with their corresponding sizes.

deepmd/pt/train/training.py (1)

268-277: LGTM: Loss parameter initialization correctly handles Hessian activation.

The code properly initializes loss parameters for both single-task and multi-task scenarios, ensuring Hessian activation is correctly configured.

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (4)

doc/model/train-energy-hessian.md (4)
84-92: Add language specification to the loss output code block.

Specify 'text' as the language for the fenced code block to improve documentation consistency.

🧰 Tools

🪛 Markdownlint (0.37.0)

84-84: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

100-100: Fix grammar in conditional statement.

Change "If one do freeze" to "If one does freeze" to correct the subject-verb agreement.
-If one do freeze and test a Hessian model using the commands:
+If one does freeze and test a Hessian model using the commands:
🧰 Tools

🪛 LanguageTool

[grammar] ~100-~100: Possible subject-verb agreement error.
Context: ...output Hessian predictions. ::: If one do freeze and test a Hessian model using t...

(ONE_AGREEMENT)

119-122: Add language specification to output format code blocks.

Specify 'text' as the language for these fenced code blocks to maintain consistency throughout the documentation.

Also applies to: 141-143, 147-158

🧰 Tools

🪛 Markdownlint (0.37.0)

119-119: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

96-98: Enhance warning about model freezing limitations.

Consider expanding the warning to explain:

Why freezing is not supported for Hessian models

What functionality is lost when forcing freezing

Alternative approaches if freezing is required

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e95c140 and d9602a1.

📒 Files selected for processing (1)

doc/model/train-energy-hessian.md (1 hunks)

🧰 Additional context used

🪛 LanguageTool

doc/model/train-energy-hessian.md

[grammar] ~100-~100: Possible subject-verb agreement error.
Context: ...output Hessian predictions. ::: If one do freeze and test a Hessian model using t...

(ONE_AGREEMENT)

🪛 Markdownlint (0.37.0)

doc/model/train-energy-hessian.md

35-35: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

84-84: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

119-119: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

141-141: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

147-147: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

🔇 Additional comments (3)

doc/model/train-energy-hessian.md (3)

1-5: Add standard frontmatter and TOC entry.

Following the project's documentation standards, please add the frontmatter section and include this document in the table of contents.

14-25: Update loss type to match implementation.

The loss type should be "ener_hess" instead of "ener" when training with Hessian calculations.

35-42: Specify language for fenced code block and add example data.

Add language specification to the code block.
Consider adding a small example snippet showing the actual content format of hessian.npy to help users understand the data structure better.

🧰 Tools

🪛 Markdownlint (0.37.0)

35-35: null
Fenced code blocks should have a language specified

(MD040, fenced-code-language)

examples/hessian/single_task/input.json

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (1)

examples/hessian/single_task/input.json (1)
79-90: ⚠️ Potential issue

Add Hessian-specific loss configuration.

The current loss configuration uses type "ener" without any Hessian-related terms, which is inconsistent with the PR's objective of training with energy Hessian.

Apply this diff to add Hessian-specific parameters:
   "loss": {
-    "type": "ener",
+    "type": "ener_hess",
     "start_pref_e": 0.02,
     "limit_pref_e": 1,
     "start_pref_f": 1000,
     "limit_pref_f": 1,
     "start_pref_v": 0,
     "limit_pref_v": 0,
     "start_pref_h": 10,
     "limit_pref_h": 1,
+    "start_pref_hess": 0.1,
+    "limit_pref_hess": 1,
     "_comment": " that's all"
   }

🧹 Nitpick comments (2)

examples/hessian/single_task/input.json (2)

97-97: Consider increasing the batch size.

A batch size of 1 might lead to:

Unstable training due to high variance in gradients

Inefficient GPU utilization

Slower convergence

Consider increasing the batch size to improve training stability and efficiency.

Also applies to: 104-104

94-96: Use absolute paths or environment variables for data paths.

The relative paths to training and validation data might cause issues if the working directory changes. Consider:

Using absolute paths

Using environment variables

Adding a base path configuration option

Also applies to: 101-103

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between cf7b6b4 and 56ff530.

📒 Files selected for processing (1)

examples/hessian/single_task/input.json (1 hunks)

🔇 Additional comments (3)

examples/hessian/single_task/input.json (3)

11-58: Update DPA2 configuration as per latest example.

As noted in previous comments, the DPA2 configuration appears to be outdated. Please update it according to example/water/dpa2, preferably using the small configuration for faster testing.

Let's verify the latest DPA2 configuration:

#!/bin/bash
# Description: Compare with the latest DPA2 configuration
# Check if the water example exists and compare configurations
fd -t f "input.json" -p "example/water/dpa2"

72-78: LGTM: Learning rate configuration is well-structured.

The exponential decay setup with appropriate start and stop learning rates should work well for training.

92-92: Verify Hessian data in statistics file.

Please ensure that the HDF5 statistics file contains the necessary Hessian-related statistics for proper scaling during training.

1azyking added 30 commits August 6, 2024 14:28

Update argcheck.py

347b064

Signed-off-by: Anchor Yu <[email protected]>

Update data.py

9a6b896

Signed-off-by: Anchor Yu <[email protected]>

Update training.py

717e672

Signed-off-by: Anchor Yu <[email protected]>

Update wrapper.py

9ce1c5f

Signed-off-by: Anchor Yu <[email protected]>

Create ener_hess.py

2bedfc3

Signed-off-by: Anchor Yu <[email protected]>

Update __init__.py

afb0227

Signed-off-by: Anchor Yu <[email protected]>

Update __init__.py

5742674

Signed-off-by: Anchor Yu <[email protected]>

Update make_hessian_model.py

75f4dce

Signed-off-by: Anchor Yu <[email protected]>

Create ener_hess_model.py

0165eba

Signed-off-by: Anchor Yu <[email protected]>

Update make_hessian_model.py

e560573

Signed-off-by: Anchor Yu <[email protected]>

Update test.py

5384731

Signed-off-by: Anchor Yu <[email protected]>

Update deep_eval.py

ab73bde

Signed-off-by: Anchor Yu <[email protected]>

Update deep_pot.py

1befb61

Signed-off-by: Anchor Yu <[email protected]>

Update deep_eval.py

9ae7163

Signed-off-by: Anchor Yu <[email protected]>

Update __init__.py

6dd723e

Signed-off-by: Anchor Yu <[email protected]>

Update test.py

4f078b8

Signed-off-by: Anchor Yu <[email protected]>

Update deep_eval.py

bedb6ec

Signed-off-by: Anchor Yu <[email protected]>

Update deep_pot.py

484c62e

Signed-off-by: Anchor Yu <[email protected]>

Update deep_eval.py

7502ae8

Signed-off-by: Anchor Yu <[email protected]>

Update ener_hess.py

950a4da

Signed-off-by: Anchor Yu <[email protected]>

Update ener_hess_model.py

2ccc19a

Signed-off-by: Anchor Yu <[email protected]>

Update make_hessian_model.py

bc92bde

Signed-off-by: Anchor Yu <[email protected]>

Update training.py

d69e5f7

Signed-off-by: Anchor Yu <[email protected]>

Update wrapper.py

e2acfe2

Signed-off-by: Anchor Yu <[email protected]>

Update deep_eval.py

bffe6eb

Signed-off-by: Anchor Yu <[email protected]>

Update deep_eval.py

31de3e4

Signed-off-by: Anchor Yu <[email protected]>

Create test_dp_hessian_model.py

ae06bfc

Signed-off-by: Anchor Yu <[email protected]>

Update argcheck.py

24f76f1

Signed-off-by: Anchor Yu <[email protected]>

Update test.py

a933f7a

Signed-off-by: Anchor Yu <[email protected]>

Update tensor.py

37e9ba4

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai bot reviewed Dec 14, 2024

View reviewed changes

Update ener_model.py

7dead9c

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai bot reviewed Dec 15, 2024

View reviewed changes

deepmd/pt/model/model/ener_model.py Show resolved Hide resolved

Update test_change_bias.py

6a31b02

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai bot reviewed Dec 15, 2024

View reviewed changes

Merge branch 'devel' into enerhess

550178e

iProzd reviewed Dec 23, 2024

View reviewed changes

doc/model/train-energy-hessian.md Show resolved Hide resolved

coderabbitai bot reviewed Dec 23, 2024

View reviewed changes

deepmd/pt/train/training.py Show resolved Hide resolved

1azyking added 3 commits December 25, 2024 10:33

Merge branch 'devel' into enerhess

8c1656a

Update train-energy-hessian.md

5f07d2b

Signed-off-by: Anchor Yu <[email protected]>

Merge branch 'devel' into enerhess

e95c140

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai bot reviewed Dec 25, 2024

View reviewed changes

Update train-energy-hessian.md

d9602a1

Signed-off-by: Anchor Yu <[email protected]>

1azyking requested review from njzjz and wanghan-iapcm December 25, 2024 09:14

coderabbitai bot reviewed Dec 25, 2024

View reviewed changes

njzjz approved these changes Dec 25, 2024

View reviewed changes

Merge branch 'devel' into enerhess

1a70498

wanghan-iapcm requested changes Dec 26, 2024

View reviewed changes

examples/hessian/single_task/input.json Show resolved Hide resolved

1azyking added 2 commits December 27, 2024 10:31

Merge branch 'devel' into enerhess

cf7b6b4

Update input.json

56ff530

Signed-off-by: Anchor Yu <[email protected]>

coderabbitai bot reviewed Dec 27, 2024

View reviewed changes

1azyking requested a review from wanghan-iapcm December 27, 2024 02:52

wanghan-iapcm approved these changes Dec 27, 2024

View reviewed changes

wanghan-iapcm enabled auto-merge December 27, 2024 04:04

wanghan-iapcm added this pull request to the merge queue Dec 27, 2024

Merged via the queue into deepmodeling:devel with commit c5ad841 Dec 27, 2024
61 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(pt): train with energy Hessian #4169

feat(pt): train with energy Hessian #4169

1azyking commented Sep 29, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot left a comment

1azyking commented Dec 15, 2024

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot left a comment

feat(pt): train with energy Hessian #4169

feat(pt): train with energy Hessian #4169

Conversation

1azyking commented Sep 29, 2024 • edited by coderabbitai bot Loading

Summary by CodeRabbit

coderabbitai bot left a comment

Choose a reason for hiding this comment

1azyking commented Dec 15, 2024

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

1azyking commented Sep 29, 2024 •

edited by coderabbitai bot

Loading