Open Compass Leaderboard
Explore Open Compass Leaderboard, a Large language Model Evaluation System, an open-source hub for efficient model evaluation.
Read MoreExplore Open Compass Leaderboard, a Large language Model Evaluation System, an open-source hub for efficient model evaluation.
Read MoreBIRD dataset leads Large-scale Text-to-SQL Evaluation, setting new standards in semantic parsing.
Read MoreEvalPlus software provides enhanced testing for LLM code with HumanEval+ and MBPP+.
Read MoreChatbot Arena: Revolutionizing the benchmarking of large language models with community participation and advanced evaluation mechanisms.
Read MoreMamba neural network is a groundbreaking architecture that surpasses Transformers in efficiency and speed, offering a 5x increase in inference speed and state-of-the-art performance in various tasks by utilizing structured state space models and hardware-aware parallel algorithms.
Read MoreDiscover how StreamingLLM revolutionizes language modeling by enabling LLMs to generalize to infinite sequence length without fine-tuning, outperforming sliding window recomputation by up to 22.2x speedup. Optimize models like Llama-2, MPT, Falcon, and Pythia for stable and efficient performance with up to 4 million tokens using StreamingLLM, enhanced by a placeholder token for improved streaming deployment.
Read More