Facilitates local deployment of Llama 3, Code Llama, and other language models, enabling customization and offline AI development. Perfect for creating personalized AI chatbots and writing tools.


Facilitates local deployment of Llama 3, Code Llama, and other language models, enabling customization and offline AI development. Perfect for creating personalized AI chatbots and writing tools.


Duck.ai is a free feature that allows you to have private conversations with 3rd-party AI chat models, anonymized by us. It currently supports Anthropic’s Claude 3 Haiku, Meta’s Llama 4, Mistral AI’s Mistral Small 3, and OpenAI’s GPT-4o mini.



Fully private offline chatbot runs locally in browser, with no server or installation needed. Supports open source models like Llama 3 and Mistral for secure use.

Privacy-focused open-source chatbot enables unlimited document uploads, multi-user support, vector database integration, and intelligent chat from existing files.










Together Chat is a next-generation consumer app designed to let you interact seamlessly with today's most popular open-source models, including free access to DeepSeek R1, securely hosted in the North America.




The simplest way to use private LLMs: Works fully offline and private when you don’t have internet. Runs models on-device optimized for Apple silicon.




Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.




An open source wireframe to app tool powered by Llama 3.2 vision. Upload a screenshot of a simple site/design & get code.


LlamaGPT is a chatbot that provides a ChatGPT-like experience, with no data leaving your device.



A multi-platform, AI-augmented coding companion ensuring secure development with unit test creation.




NanoGPT is revolutionizing AI access with a core mission: democratizing state-of-the-art models like ChatGPT, Claude, and more for everyone, globally. We believe cutting-edge AI should be accessible, not expensive or complex.




Drop-In OpenAI replacement, On-device, local-first, Generate text/image/speech/music/etc... Backend Agnostic: (llama.cpp, diffusers, bark.cpp, etc...), Optional Distributed Inference(P2P/Federated).




Experience hands-free text creation by converting voice dictation into formatted documents with offline capabilities and AI-generated formatting.




Improve your writing in any macOS application with AI assistance. Quickly correct grammar mistakes, change writing styles, or translate text.



Use your locally running AI models to assist you in your web browsing.




Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs.




Chat with generative language models locally on your computer with zero setup. LocalChat is a simple, easy to set up local AI chat built on top of llama.cpp. It requires no technical knowledge and enables users to experience ChatGPT-like behavior on their own machines — fully...


llamafile lets you distribute and run LLMs with a single file, providing an OpenAI-compatible API as well as a KoboldAI API.

Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.
LLM Hub is an open-source Android app for on-device LLM chat and image generation. It's optimized for mobile usage (CPU/GPU/NPU acceleration) and supports multiple model formats so you can run powerful models locally and privately.



