HellaSwag

Discover how the HellaSwag benchmark assesses a model’s common sense reasoning ability by presenting scenarios that test implicit knowledge about the world.

Read More