Open Compass Leaderboard
Explore Open Compass Leaderboard, a Large language Model Evaluation System, an open-source hub for efficient model evaluation.
Read MoreExplore Open Compass Leaderboard, a Large language Model Evaluation System, an open-source hub for efficient model evaluation.
Read MoreBIRD dataset leads Large-scale Text-to-SQL Evaluation, setting new standards in semantic parsing.
Read MoreEvalPlus software provides enhanced testing for LLM code with HumanEval+ and MBPP+.
Read More