Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark
The MMMU benchmark is a new evaluation tool for multimodal models that can assess their ability to understand and reason across multiple disciplines, such as Art & Design, Business, Health & Medicine, Science, Humanities & Social Science, and Technology & Engineering. The benchmark includes over 183 subfields and covers a variety of image formats, such as diagrams, tables, charts, chemical structures, photographs, paintings, geometric shapes, and musical scores.