-
-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
minikube cannot detect the GPUs #2
Comments
Perhaps you need k8s-device-plugin? |
Thanks, but I have tried this. I start from the section "Enabling GPU Support in Kubernetes". I think this image has done the work before this section, I am not sure if it is right?
Then I run the command:
The pod status is
I also tried to start from the Preparing your GPU Nodes. But there is some difficulty with |
Perhaps https://github.com/NVIDIA/k8s-device-plugin?tab=readme-ov-file#configure-containerd or https://github.com/NVIDIA/k8s-device-plugin?tab=readme-ov-file#configure-docker is the problematic location. |
I am trying to implement these, but |
|
Still not work.
|
I also tried this document: https://github.com/intelligent-machine-learning/dlrover/blob/master/docs/tutorial/gpu_user_guide.md, similar to the Nvidia's document. Still get the same result.
|
I've updated the NVIDIA container toolkit. Please see if this might solve anything. |
Hello @darrenglow \ CC @ehfd How will you be able to run Minikube inside a container? I have been trying for a long time, but I keep getting OCI and Cgroup errors. Can you help me with this? |
I have no idea... Perhaps try KinD? |
Sure I'll try it...Thanks |
https://www.substratus.ai/blog/kind-with-gpus/ Both actually look relevant/applicable here too. |
@ehfd I have tried but it did'nt work :(( |
NVIDIA/k8s-device-plugin#332 (comment) |
Thank you, @ehfd . I’ve already explored that resource, but unfortunately, it didn’t work too. However, I’ve now switched to using virtual machines. |
I used your image to create a container. In the container, I installed minikube. When I run
minikube start
, the nodeminikube
didn't detect any GPU. I am wondering how to fix this. By the way, the commandnvidia-smi
works well.Then the result shows that the GPU num is
<none>
When I run a pod, the pod status is:
The text was updated successfully, but these errors were encountered: