skypilot-org/skypilot

[Kubernetes] GPU prefix matching should not allow non-existent GPU variants

Open

#9035 opened on Mar 10, 2026

View on GitHub
 (2 comments) (0 reactions) (0 assignees)Python (4,859 stars) (311 forks)batch import
good first issuegood starter issues

Description

On a K8s cluster where sky show-gpus --infra k8s shows only H100:

  • sky launch --gpus H100-MEGA succeeds (should it?)
  • sky launch --gpus 'NVIDIA:40GB+' shows H100-NVLINK, H100-SXM, H100-MEGA, H100-NVL as separate options, all resolving to the same physical H100

K8s prefix-based fuzzy matching in _accelerator_name_matches() accepts any name where H100 is a prefix (e.g., H100-MEGA, H100-SXM). The optimizer should filter against what's actually available on the cluster.

Contributor guide