Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add clarity on how on the GPU model format for price_target_gpu_mappings config #172

Open
andy108369 opened this issue Apr 26, 2024 · 0 comments

Comments

@andy108369
Copy link
Contributor

I've noticed one provider did set 4090 instead of rtx4090 in his price_target_gpu_mappings (in provider.yaml)

Need to bring clarity to the doc on that, that it should match the name set by the operator-inventory based on the gpus.json file, and can as well be viewed by running the following commands:

kubectl get nodes --show-labels

Example

$ kubectl get nodes --show-labels
NAME    STATUS   ROLES           AGE   VERSION   LABELS
node1   Ready    control-plane   29h   v1.28.6   akash.network/capabilities.gpu.vendor.nvidia.model.rtx4090.interface.pcie=8,akash.network/capabilities.gpu.vendor.nvidia.model.rtx4090.ram.24Gi=8,akash.network/capabilities.gpu.vendor.nvidia.model.rtx4090=8,akash.network/capabilities.storage.class.beta3=1,akash.network=true,beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,kubernetes.io/arch=amd64,kubernetes.io/hostname=node1,kubernetes.io/os=linux,node-role.kubernetes.io/control-plane=,node.kubernetes.io/exclude-from-external-load-balancers=,nvidia.com/gpu.present=true
node2   Ready    control-plane   29h   v1.28.6   akash.network/capabilities.gpu.vendor.nvidia.model.rtx4090.interface.pcie=8,akash.network/capabilities.gpu.vendor.nvidia.model.rtx4090.ram.24Gi=8,akash.network/capabilities.gpu.vendor.nvidia.model.rtx4090=8,akash.network/capabilities.storage.class.beta3=1,akash.network=true,beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,kubernetes.io/arch=amd64,kubernetes.io/hostname=node2,kubernetes.io/os=linux,node-role.kubernetes.io/control-plane=,node.kubernetes.io/exclude-from-external-load-balancers=,nvidia.com/gpu.present=true
node3   Ready    <none>          29h   v1.28.6   akash.network/capabilities.gpu.vendor.nvidia.model.rtx4090.interface.pcie=8,akash.network/capabilities.gpu.vendor.nvidia.model.rtx4090.ram.24Gi=8,akash.network/capabilities.gpu.vendor.nvidia.model.rtx4090=8,akash.network/capabilities.storage.class.beta3=1,akash.network=true,beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,kubernetes.io/arch=amd64,kubernetes.io/hostname=node3,kubernetes.io/os=linux,nvidia.com/gpu.present=true
$ kubectl -n akash-services get pod akash-provider-0 -o yaml |grep -A1 AKASH_FROM
    - name: AKASH_FROM
      value: akash1cnzkdynwd4u6j7s8z5j0fg76h3g6yhsggmuqta
...

$ provider-services query provider get akash1cnzkdynwd4u6j7s8z5j0fg76h3g6yhsggmuqta -o text
attributes:
...
...
- key: capabilities/gpu/vendor/nvidia/model/rtx4090
  value: "true"
- key: capabilities/gpu/vendor/nvidia/model/rtx4090/ram/24Gi
  value: "true"
- key: capabilities/gpu/vendor/nvidia/model/rtx4090/ram/24Gi/interface/pcie
  value: "true"
- key: capabilities/gpu/vendor/nvidia/model/rtx4090/interface/pcie
  value: "true"
...

as you can see it is rtx4090, not the 4090.

HoomanHQ added a commit that referenced this issue May 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant