Skip to content

Commit

Permalink
Add missing YAML and rephrase full A100 behavior
Browse files Browse the repository at this point in the history
Signed-off-by: davidmirror-ops <[email protected]>
  • Loading branch information
davidmirror-ops committed Jul 25, 2024
1 parent 1be52eb commit 13c5e8f
Showing 1 changed file with 11 additions and 6 deletions.
17 changes: 11 additions & 6 deletions docs/user_guide/productionizing/configuring_access_to_gpus.md
Original file line number Diff line number Diff line change
Expand Up @@ -294,16 +294,21 @@ def gpu_available() -> bool:

#### How it works?

In this case, the task is scheduled with a `2g.10gb` MIG instance profile by default.

`flytepropeller` only injects the node selector that matches nodes with an `A100` device:
flytekit uses a default `2g.10gb`partition size and `flytepropeller` injects the node selector that matches labels on nodes with an `A100` device:

```yaml
spec:
affinity:
nodeAffinity:
requiredDuringSchedulingIgnoredDuringExecution:
nodeSelectorTerms:
- matchExpressions:
- key: nvidia.com/gpu.accelerator
operator: In
values:
- nvidia-tesla-a100
```


### Request an unpartitioned A100 device
The goal is to run the task using the resources of the entire A100 GPU:

Expand Down

0 comments on commit 13c5e8f

Please sign in to comment.