LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar.


Google AI Edge Gallery is described as 'A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally' and is a AI Chatbot in the ai tools & services category. There are more than 50 alternatives to Google AI Edge Gallery for a variety of platforms, including Mac, Windows, Linux, Android and iPhone apps. The best Google AI Edge Gallery alternative is Ollama, which is both free and Open Source. Other great apps like Google AI Edge Gallery are Perplexity, GPT4ALL, Jan.ai and Microsoft Copilot.
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar.






The simplest way to use local and online AI models. Interact with any AI model with just a click of a button.




SwitchAI lets you seamlessly choose your preferred AI assistant. With a single tap, access your selected AI or set a default assistant for your device’s digital assistant feature. Boost productivity by effortlessly switching between AI assistants for different tasks.

Connect your local or hosted instance using your URL and API key. Once connected, all available models are loaded automatically. You can switch between them at any time, start new chats, or continue existing ones in a clean and focused interface.




LLMChat offers a versatile platform to engage with various AI models, enhance your experience with plugins, and create custom assistants.




QVAC converts local AI into a high-quality experience that sits in your pocket. All the value you’re used to seeing in other AI assistants, minus the lack of privacy.




Run LLMs on device or connect to various commercial or open source APIs. ChatterUI aims to provide a mobile-friendly interface with fine-grained control over chat structuring.




Experience private offline AI chat with no internet connection required. Secret AI is a secure local LLM chatbot and local AI assistant that runs entirely on your device - your conversations stay 100% private with zero data collection, no servers, no account required, no...




Typing Mind is a commercial alternative front end for various LLM engines, using various APIs it offers a front end interface for managing chats, uploading documents, and it’s own plugins. It can use OpenAI API, Anthropic, and OpenRouter API out of the box, and you can configure...
Paperclip by FireCube (formerly known as Clippy by FireCube) brings back the infamous Clippit companion into your desktop powered by a Large Language Model AI to chat with. The app is completely free to use and open source.

A local device focused AI assistant built in Rust — persistent memory, autonomous tasks, ~27MB binary. Inspired by and compatible with OpenClaw.