Glossary

Self-Rewarding Language Models

Posted by Fede Nolasco | Jan 21, 2024

Self-Rewarding Language Models (SRLMs) utilize an LLM’s own judgment to provide feedback...

SELF-ALIGN

Posted by Fede Nolasco | Jan 21, 2024

SELF-ALIGN, a novel approach that combines principle-driven reasoning and the generative power of LLMs, can self-align AI agents with minimal human supervision.

RLAIF, a reinforcement learning technique

Posted by Fede Nolasco | Jan 21, 2024

RLAIF, a reinforcement learning technique that utilizes AI feedback to instruct large language models (LLMs), offers a scalable and cost-effective alternative to RLHF.