stanford-crfm/helm

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.

PythonStars 2578Forks 346Watchers 2578Open issues 139License Apache License 2.0

Details

仓库信息

Ownerstanford-crfm

Homepagehttps://crfm.stanford.edu/helm

GitHubhttps://github.com/stanford-crfm/helm

Last pushed2025-12-12

Last updated2025-12-13

Issues fetched at—

stanford-crfm/helm

Community at a glance