-
Notifications
You must be signed in to change notification settings - Fork 23
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Signed-off-by: Mike McKiernan <[email protected]>
- Loading branch information
1 parent
351299e
commit b46f15c
Showing
10 changed files
with
643 additions
and
14 deletions.
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
apiVersion: v1 | ||
kind: ConfigMap | ||
metadata: | ||
name: mps-config-all | ||
data: | ||
mps-any: |- | ||
version: v1 | ||
sharing: | ||
mps: | ||
resources: | ||
- name: nvidia.com/gpu | ||
replicas: 4 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
apiVersion: v1 | ||
kind: ConfigMap | ||
metadata: | ||
name: mps-config-fine | ||
data: | ||
mps-four: |- | ||
version: v1 | ||
sharing: | ||
mps: | ||
renameByDefault: false | ||
resources: | ||
- name: nvidia.com/gpu | ||
replicas: 4 | ||
mps-two: |- | ||
version: v1 | ||
sharing: | ||
mps: | ||
renameByDefault: false | ||
resources: | ||
- name: nvidia.com/gpu | ||
replicas: 2 | ||
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,32 @@ | ||
apiVersion: apps/v1 | ||
kind: Deployment | ||
metadata: | ||
name: mps-verification | ||
labels: | ||
app: mps-verification | ||
spec: | ||
replicas: 5 | ||
selector: | ||
matchLabels: | ||
app: mps-verification | ||
template: | ||
metadata: | ||
labels: | ||
app: mps-verification | ||
spec: | ||
tolerations: | ||
- key: nvidia.com/gpu | ||
operator: Exists | ||
effect: NoSchedule | ||
hostPID: true | ||
containers: | ||
- name: cuda-sample-vector-add | ||
image: "nvcr.io/nvidia/k8s/cuda-sample:vectoradd-cuda11.7.1-ubuntu20.04" | ||
command: ["/bin/bash", "-c", "--"] | ||
args: | ||
- while true; do /cuda-samples/vectorAdd; done | ||
resources: | ||
limits: | ||
nvidia.com/gpu: 1 | ||
nodeSelector: | ||
nvidia.com/gpu.sharing-strategy: mps |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -28,3 +28,5 @@ spec: | |
resources: | ||
limits: | ||
nvidia.com/gpu: 1 | ||
nodeSelector: | ||
nvidia.com/gpu.sharing-strategy: time-slicing |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,11 @@ | ||
LAST SEEN TYPE REASON OBJECT MESSAGE | ||
38s Normal SuccessfulDelete daemonset/nvidia-device-plugin-daemonset Deleted pod: nvidia-device-plugin-daemonset-l86fw | ||
38s Normal SuccessfulDelete daemonset/gpu-feature-discovery Deleted pod: gpu-feature-discovery-shj2m | ||
38s Normal Killing pod/gpu-feature-discovery-shj2m Stopping container gpu-feature-discovery | ||
38s Normal Killing pod/nvidia-device-plugin-daemonset-l86fw Stopping container nvidia-device-plugin | ||
37s Normal Scheduled pod/nvidia-device-plugin-daemonset-lcklx Successfully assigned gpu-operator/nvidia-device-plugin-daemonset-lcklx to worker-1 | ||
37s Normal SuccessfulCreate daemonset/gpu-feature-discovery Created pod: gpu-feature-discovery-pgx9l | ||
37s Normal Scheduled pod/gpu-feature-discovery-pgx9l Successfully assigned gpu-operator/gpu-feature-discovery-pgx9l to worker-0 | ||
37s Normal SuccessfulCreate daemonset/nvidia-device-plugin-daemonset Created pod: nvidia-device-plugin-daemonset-lcklx | ||
36s Normal Created pod/nvidia-device-plugin-daemonset-lcklx Created container config-manager-init | ||
36s Normal Pulled pod/nvidia-device-plugin-daemonset-lcklx Container image "nvcr.io/nvidia/cloud-native/gpu-operator-validator:v24.3.0" already present on machine |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
NAME READY STATUS RESTARTS AGE | ||
mps-verification-86c99b5666-hczcn 1/1 Running 0 3s | ||
mps-verification-86c99b5666-sj8z5 1/1 Running 0 3s | ||
mps-verification-86c99b5666-tnjwx 1/1 Running 0 3s | ||
mps-verification-86c99b5666-82hxj 1/1 Running 0 3s | ||
mps-verification-86c99b5666-9lhh6 1/1 Running 0 3s |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
Found 5 pods, using pod/mps-verification-86c99b5666-tnjwx | ||
[Vector addition of 50000 elements] | ||
Copy input data from the host memory to the CUDA device | ||
CUDA kernel launch with 196 blocks of 256 threads | ||
Copy output data from the CUDA device to the host memory | ||
Test PASSED | ||
Done | ||
[Vector addition of 50000 elements] | ||
Copy input data from the host memory to the CUDA device | ||
CUDA kernel launch with 196 blocks of 256 threads | ||
Copy output data from the CUDA device to the host memory | ||
Test PASSED | ||
... |