Glossary

DPO

Posted by Fede Nolasco | Jan 21, 2024

Direct Preference Optimization (DPO) is a simplified and efficient approach to fine-tuning large language models (LLMs).

Arc-c

Posted by Fede Nolasco | Jan 14, 2024

ARC-c is a challenging variation of the ARC Benchmark, designed to assess the reasoning and commonsense understanding of large language models. Learn more about this dataset and the difficulty it presents for models.

Arc-e

Posted by Fede Nolasco | Jan 14, 2024

ARC-e is an enhanced version of the ARC Benchmark, evaluating large language models’ reasoning abilities. With 1,169 challenging questions, no model has reached a 75% score yet.

1
…
197
198
199
…
202