llama.cpp Alternatives

llama.cpp is described as 'The main goal of llama.cpp is to enable LLM inference with minimal setup and state-of-the-art performance on a wide range of hardware - locally and in the cloud' and is a large language model (llm) tool in the ai tools & services category. There are more than 25 alternatives to llama.cpp for a variety of platforms, including Windows, Linux, Mac, Android and iPhone apps. The best llama.cpp alternative is Ollama, which is both free and Open Source. Other great apps like llama.cpp are Jan.ai, GPT4ALL, AnythingLLM and LM Studio.

More about llama.cpp

llama.cpp alternatives page was last updated Feb 27, 2026

FastFlowLM
2 likes
Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.
Cost / License
Free Personal
Open Source
Application types
Large Language Model (LLM)
AI Chatbot
Origin
United States
Platforms
Windows
Online
Self-Hosted
Is FastFlowLM a good alternative to llama.cpp?
RWKV Chat
2 likes
Experience the power of RWKV models directly on your device. Completely offline, privacy-first, and efficient. No internet required.
Cost / License
Free
Open Source (Apache-2.0)
Application type
Large Language Model (LLM)
Origin
China
Platforms
Mac
Windows
Linux
Android
iPhone
iPad
Android Tablet
+5
Is RWKV Chat a good alternative to llama.cpp?
AI Playground
2 likes
This application provides a full suite of generative AI features for chat, code assistance, document search, image analysis, image and video generation. All features run offline and are powered by your PC’s Intel® Core™ Ultra with built-in Intel Arc GPU or Intel Arc™ dGPU...
Cost / License
Free
Open Source (MIT)
Application types
Large Language Model (LLM)
AI Image Generator
Origin
United States
Platforms
Windows
Is AI Playground a good alternative to llama.cpp?
AI Dev Gallery
2 likes
Learn how to add AI with local models and APIs to Windows apps. Discover AI scenarios and models such as Phi, Mistral, Stable Diffusion, Whisper, and many more to delight your users. The AI Dev Gallery is an open-source app designed to help Windows developers integrate AI...
Cost / License
Free
Open Source (MIT)
Application type
Large Language Model (LLM)
Origin
United States
Platforms
Windows
+1
Is AI Dev Gallery a good alternative to llama.cpp?
AI00 RWKV Server
2 likes
AI00 RWKV Server is an inference API server for the RWKV language model based upon the web-rwkv inference engine.
Cost / License
Free
Open Source (MIT)
Application type
Large Language Model (LLM)
Origin
China
Platforms
Windows
Mac
Linux
Rust
Is AI00 RWKV Server a good alternative to llama.cpp?
Lemonade Server
2 likes
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs.
17 Lemonade Server alternatives
Cost / License
Free
Open Source (Apache-2.0)
Origin
United States
Platforms
Windows
Linux
Docker
Snapcraft
iPhone
iPad
Self-Hosted
Python
+1
Is Lemonade Server a good alternative to llama.cpp?
Nexa Studio
2 likes
Nexa Studio: Offline, Private AI with Unlimited Use.
Cost / License
Free
Proprietary
Application type
Large Language Model (LLM)
Origin
United States
Platforms
Android
+1
Is Nexa Studio a good alternative to llama.cpp?
RWKV Runner
2 likes
This project aims to eliminate the barriers of using large language models by automating everything for you. All you need is a lightweight executable program of just a few megabytes. Additionally, this project provides an interface compatible with the OpenAI API, which means...
Cost / License
Free
Open Source (MIT)
Application type
Large Language Model (LLM)
Origin
China
Platforms
Mac
Windows
Linux
Self-Hosted
Python
+8
Is RWKV Runner a good alternative to llama.cpp?
MNN Chat
2 likes
Multimodal Offline LLM Chat App with MNN.
24 MNN Chat alternatives
Cost / License
Free
Open Source (Apache-2.0)
Application type
Large Language Model (LLM)
Origin
China
Platforms
Android
iPhone
iPad
+1
Is MNN Chat a good alternative to llama.cpp?
vllm-playground
2 likes
A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for macOS Apple Silicon and enterprise deployment on OpenShift/Kubernetes.
Cost / License
Free
Open Source (Apache-2.0)
Application type
Large Language Model (LLM)
Origin
Australia
Platforms
Python
Mac
Linux
Self-Hosted
Kubernetes
OpenShift
+1
Is vllm-playground a good alternative to llama.cpp?
ChatterUI
2 likes
Run LLMs on device or connect to various commercial or open source APIs. ChatterUI aims to provide a mobile-friendly interface with fine-grained control over chat structuring.
21 ChatterUI alternatives
Cost / License
Free
Open Source (AGPL-3.0)
Application types
Large Language Model (LLM)
AI Chatbot
Platforms
Android
+5
Is ChatterUI a good alternative to llama.cpp?
Operit AI
1 like
📱 The first fully functional, standalone AI assistant for mobile devices with powerful tool-calling capabilities 📱
70 Operit AI alternatives
Cost / License
Free
Open Source
Origin
China
Platforms
Android
+1
Is Operit AI a good alternative to llama.cpp?

You are at page 2 of llama.cpp alternatives

llama.cpp Alternatives

Alternatives list

FastFlowLM

Cost / License

Application types

Origin

Platforms

RWKV Chat

Cost / License

Application type

Origin

Platforms

AI Playground

Cost / License

Application types

Origin

Platforms

AI Dev Gallery

Cost / License

Application type

Origin

Platforms

AI00 RWKV Server

Cost / License

Application type

Origin

Platforms

Lemonade Server

Cost / License

Origin

Platforms

Nexa Studio

Cost / License

Application type

Origin

Platforms

RWKV Runner

Cost / License

Application type

Origin

Platforms

MNN Chat

Cost / License

Application type

Origin

Platforms

vllm-playground

Cost / License

Application type

Origin

Platforms

ChatterUI

Cost / License

Application types

Platforms

Operit AI

Cost / License

Origin

Platforms