Write and Run Code

LLaMA Model Inference in C/C++

Posted by Fede Nolasco | Apr 14, 2024

Discover efficient LLaMA model inference in pure C/C++ for cutting-edge local and cloud-based performance on a wide range of hardware platforms.

Python Bindings for Llama.cpp Library

Posted by Fede Nolasco | Apr 14, 2024

Utilize Python bindings for llama.cpp to enhance your projects with efficient hardware acceleration, detailed documentation, and OpenAI compatibility.

Mastering ZEPHYR with Hugging Face API Tutorial

Posted by Fede Nolasco | Apr 13, 2024

Uncover the power of ‘ZEPHYR’ with Hugging Face API. Learn to set up your environment, utilize the model effectively, and create a user-friendly front end with Chain Lit. Dive into this comprehensive guide today.