AgentKernelArena provides an end-to-end siloed-benchmarking environment where different LLM-powered agents—such as Cursor Agent, Claude Code, Codex, SWE-agent, and GEAK—can be evaluated side-by-side on the same GPU kernel tasks, using objective and reproducible metrics.
Repositories
AMD-AGI Repositories
AMD-AGI/AgentKernelArenaPython
(17 Stars) (7 Forks) (0 indexierte Issues) (0 offene good first issues)
AMD-AGI/ApexPython
Agents, and RL environment, for optimizing GPU kernels on AMD ROCm using LLM agents. Benchmarks LLM serving workloads end-to-end, profiles bottleneck kernels, optimizes them via Claude Code or Codex, and scores on compilation, correctness, and speedup.
(68 Stars) (9 Forks) (0 indexierte Issues) (0 offene good first issues)
AMD-AGI/MagpiePython
A lightweight, general-purpose framework for evaluating GPU kernel and benchmark.
(53 Stars) (6 Forks) (0 indexierte Issues) (0 offene good first issues)