sMiNT0S/AIBugBench

From prompt to paste: evaluate AI / LLM output under a strict Python sandbox and get actionable scores across 7 categories, including security, correctness and upkeep.

PythonStars 1Forks 1Watchers 1Open issues 4License Apache License 2.0

Details

仓库信息

OwnersMiNT0S

Homepagehttps://smint0s.github.io/AIBugBench/

GitHubhttps://github.com/sMiNT0S/AIBugBench

Last pushed2025-12-01

Last updated2025-12-14

Issues fetched at—

sMiNT0S/AIBugBench

Community at a glance