Direct Preference Optimization

Phi-4 AI model for STEM reasoning

Phi-4 AI model for STEM reasoning is a 14-billion parameter language model by Microsoft, designed to excel in reasoning and problem-solving with advanced synthetic data training and post-training optimizations.

Read More