Repositories

open-compass repositories

[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO

Last commit Apr 30, 2025

 (65 stars) (3 forks) (0 indexed issues) (0 open good first issues)

Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent with a hierarchical manner across multiple platforms, including Windows, Linux, macOS, iOS, Android and Web.

Last commit Sep 8, 2025

 (111 stars) (5 forks) (0 indexed issues) (0 open good first issues)

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Last commit May 28, 2026

 (7,047 stars) (780 forks) (1 indexed issue) (1 open good first issue)