Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

rdma/rdma_shared_device_a resources allocation #92

Open
cairong-ai opened this issue Dec 27, 2023 · 2 comments
Open

rdma/rdma_shared_device_a resources allocation #92

cairong-ai opened this issue Dec 27, 2023 · 2 comments

Comments

@cairong-ai
Copy link

rdmaSharedDevicePlugin:
deploy: true
resources:
- name: rdma_shared_device_a
ifNames: [ibs10, ibs11, ibs18, ibs19]

I have deployed the network-operator using the configuration shown above, and I see that on the node it displays: rdma/rdma_shared_device_a: 63.
I want to know if these 63 resources are evenly distributing the bandwidth of these IB cards.
If a pod only requests 1 rdma/rdma_shared_device_a resource, how much bandwidth can this pod use?

@pepesi
Copy link

pepesi commented Dec 28, 2023

Based on my understanding, it is merely a device-plugin that exposes the host's InfiniBand (IB) devices to containers. The kubelet can mount them into the containers, allowing the containers to use these devices internally. However, it does not have any mechanisms for bandwidth partitioning or traffic limiting.

@adrianchiris
Copy link
Collaborator

adrianchiris commented Dec 28, 2023

the device plugin does not handle any BW allocation (nor am i famiiar with a kernel interface that provides that for the same IB device mounted in different containers), if there is one, let me know :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants