Open Compass Leaderboard
Explore Open Compass Leaderboard, a Large language Model Evaluation System, an open-source hub for efficient model evaluation.
Read MoreExplore Open Compass Leaderboard, a Large language Model Evaluation System, an open-source hub for efficient model evaluation.
Read MoreEvalPlus software provides enhanced testing for LLM code with HumanEval+ and MBPP+.
Read MoreChatbot Arena: Revolutionizing the benchmarking of large language models with community participation and advanced evaluation mechanisms.
Read More