skypilot-org/skypilot

[Kubernetes] GPU prefix matching should not allow non-existent GPU variants

Open

#9,035 建立於 2026年3月10日

在 GitHub 查看
 (2 留言) (0 反應) (0 負責人)Python (4,859 star) (311 fork)batch import
good first issuegood starter issues

描述

On a K8s cluster where sky show-gpus --infra k8s shows only H100:

  • sky launch --gpus H100-MEGA succeeds (should it?)
  • sky launch --gpus 'NVIDIA:40GB+' shows H100-NVLINK, H100-SXM, H100-MEGA, H100-NVL as separate options, all resolving to the same physical H100

K8s prefix-based fuzzy matching in _accelerator_name_matches() accepts any name where H100 is a prefix (e.g., H100-MEGA, H100-SXM). The optimizer should filter against what's actually available on the cluster.

貢獻者指南