Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs.
Cost / License
- Free
- Open Source (Apache-2.0)
Platforms
- Windows
- Linux
- Docker
- Snapcraft
- iPhone
- iPad
- Self-Hosted
- Python




MNN Chat is described as 'Multimodal Offline LLM Chat App with MNN' and is a large language model (llm) tool in the system & hardware category. There are more than 25 alternatives to MNN Chat for a variety of platforms, including Windows, Android, Linux, Mac and Self-Hosted apps. The best MNN Chat alternative is Ollama, which is both free and Open Source. Other great apps like MNN Chat are AnythingLLM, LM Studio, PocketPal and Maid (AI).
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs.




AI00 RWKV Server is an inference API server for the RWKV language model based upon the web-rwkv inference engine.




This project aims to eliminate the barriers of using large language models by automating everything for you. All you need is a lightweight executable program of just a few megabytes. Additionally, this project provides an interface compatible with the OpenAI API, which means...




This application provides a full suite of generative AI features for chat, code assistance, document search, image analysis, image and video generation. All features run offline and are powered by your PC’s Intel® Core™ Ultra with built-in Intel Arc GPU or Intel Arc™ dGPU...


Learn how to add AI with local models and APIs to Windows apps. Discover AI scenarios and models such as Phi, Mistral, Stable Diffusion, Whisper, and many more to delight your users. The AI Dev Gallery is an open-source app designed to help Windows developers integrate AI...




A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for macOS Apple Silicon and enterprise deployment on OpenShift/Kubernetes.




Run LLMs on device or connect to various commercial or open source APIs. ChatterUI aims to provide a mobile-friendly interface with fine-grained control over chat structuring.




LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar.


SmolChat allows you to download and run popular LLMs on your Android device, locally, without needing an internet connection. Customize the model used for each chat, tune settings like temperature and min-p, and pin your favourite chats on the home-screen with shortcuts.




📱 The first fully functional, standalone AI assistant for mobile devices with powerful tool-calling capabilities 📱




LLM Hub is an open-source Android app for on-device LLM chat and image generation. It's optimized for mobile usage (CPU/GPU/NPU acceleration) and supports multiple model formats so you can run powerful models locally and privately.



