skypilot-org/skypilot

[Kubernetes] GPU prefix matching should not allow non-existent GPU variants

Open

Aperta il 10 mar 2026

Vedi su GitHub
 (2 commenti) (0 reazioni) (0 assegnatari)Python (4859 star) (311 fork)batch import
good first issuegood starter issues

Descrizione

On a K8s cluster where sky show-gpus --infra k8s shows only H100:

  • sky launch --gpus H100-MEGA succeeds (should it?)
  • sky launch --gpus 'NVIDIA:40GB+' shows H100-NVLINK, H100-SXM, H100-MEGA, H100-NVL as separate options, all resolving to the same physical H100

K8s prefix-based fuzzy matching in _accelerator_name_matches() accepts any name where H100 is a prefix (e.g., H100-MEGA, H100-SXM). The optimizer should filter against what's actually available on the cluster.

Guida contributor