Repositórios

Repositórios de open-compass

[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO

Último commit 30 de abr. de 2025

 (65 stars) (3 forks) (0 issues indexadas) (0 good first issues abertas)

Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent with a hierarchical manner across multiple platforms, including Windows, Linux, macOS, iOS, Android and Web.

Último commit 8 de set. de 2025

 (111 stars) (5 forks) (0 issues indexadas) (0 good first issues abertas)

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Último commit 28 de mai. de 2026

 (7.047 stars) (780 forks) (1 issue indexada) (1 good first issue aberta)