by Fede Nolasco | Mar 19, 2024
The web page introduces a Vision model with 1.6B parameters built using SigLIP, Phi-1.5 and the LLaVA training dataset. The model weights are licensed under CC-BY-SA due to using the LLaVA dataset. The model can be tried out on Hugging Face Spaces.