Moondream Vision Model: 1.6B Parameters, SigLIP, Phi-1.5, LLaVA Dataset
Explore a Vision model with 1.6B parameters, trained on the LLaVA dataset using SigLIP and Phi-1.5, available for trial on Hugging Face Spaces with CC-BY-SA licensed weights.
Read More