AgentKernelArena provides an end-to-end siloed-benchmarking environment where different LLM-powered agents—such as Cursor Agent, Claude Code, Codex, SWE-agent, and GEAK—can be evaluated side-by-side on the same GPU kernel tasks, using objective and reproducible metrics.
Dépôts
Dépôts de AMD-AGI
AMD-AGI/AgentKernelArenaPython
(17 stars) (7 forks) (0 issues indexées) (0 good first issues ouvertes)
AMD-AGI/ApexPython
Agents, and RL environment, for optimizing GPU kernels on AMD ROCm using LLM agents. Benchmarks LLM serving workloads end-to-end, profiles bottleneck kernels, optimizes them via Claude Code or Codex, and scores on compilation, correctness, and speedup.
(68 stars) (9 forks) (0 issues indexées) (0 good first issues ouvertes)
AMD-AGI/MagpiePython
A lightweight, general-purpose framework for evaluating GPU kernel and benchmark.
(53 stars) (6 forks) (0 issues indexées) (0 good first issues ouvertes)