Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do not configure BlueField NICs in DPU mode #816

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

e0ne
Copy link
Collaborator

@e0ne e0ne commented Dec 13, 2024

Fixes #662

Copy link

Thanks for your PR,
To run vendors CIs, Maintainers can use one of:

  • /test-all: To run all tests for all vendors.
  • /test-e2e-all: To run all E2E tests for all vendors.
  • /test-e2e-nvidia-all: To run all E2E tests for NVIDIA vendor.

To skip the vendors CIs, Maintainers can use one of:

  • /skip-all: To skip all tests for all vendors.
  • /skip-e2e-all: To skip all E2E tests for all vendors.
  • /skip-e2e-nvidia-all: To skip all E2E tests for NVIDIA vendor.
    Best regards.

@coveralls
Copy link

coveralls commented Dec 13, 2024

Pull Request Test Coverage Report for Build 12629341153

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

Details

  • 0 of 14 (0.0%) changed or added relevant lines in 1 file are covered.
  • 180 unchanged lines in 3 files lost coverage.
  • Overall coverage increased (+0.1%) to 47.365%

Changes Missing Coverage Covered Lines Changed/Added Lines %
pkg/vendors/mellanox/mellanox.go 0 14 0.0%
Files with Coverage Reduction New Missed Lines %
pkg/consts/constants.go 4 50.0%
api/v1/helper.go 56 76.1%
controllers/sriovnetworknodepolicy_controller.go 120 58.66%
Totals Coverage Status
Change from base Build 12298677979: 0.1%
Covered Lines: 7254
Relevant Lines: 15315

💛 - Coveralls

log.Log.V(2).Info("mellanox-plugin: configFW(): can't get DPU mode for NIC", "pciAddress", pciAddr)
}
if bfMode == BluefieldDpu {
return errors.Errorf("NIC %s is in DPU mode", pciAddr)
Copy link
Collaborator

@adrianchiris adrianchiris Dec 16, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you elaborate that firmware configuration changes are not supported in this case as part of the error message ?
also add a comment saying why (because reboot will not re-load firmware) and that a possible solution is to use mstfwreset b4 reboot

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we need to document that carefully as we have the fwreset feature we implement in the operator but that is not to switch the BF card mode

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

log.Log.V(2).Info("mellanox-plugin: configFW(): can't get DPU mode for NIC", "pciAddress", pciAddr)
}
if bfMode == BluefieldDpu {
// Host reboot won't re-load NIC firmware in DPU mode. To apply FW changes power cycle is required or mstfwreset could be used.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

base on this comment should we allow the configuration if the fwreset feature gate is enabled?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IMO, it should be done as a separate effort. We need to fix a bug now to prevent infinite boot loop

Signed-off-by: Ivan Kolodiazhnyi <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Mellanox plugin should handle Bluefield card in DPU mode correctly
4 participants