Facilitates local deployment of Llama 3, Code Llama, and other language models, enabling customization and offline AI development. Perfect for creating personalized AI chatbots and writing tools.


Facilitates local deployment of Llama 3, Code Llama, and other language models, enabling customization and offline AI development. Perfect for creating personalized AI chatbots and writing tools.


Open-source offline AI chat software supporting local LLMs, cloud model connections, custom assistant creation, OpenAI-compatible API, and broad hardware support.




Privacy-focused open-source chatbot running locally on consumer CPUs without GPU or internet, supporting multiple customizable large language models and licensing.







Private offline on-device local LLM app by Ente, open source code, no accounts or tracking, and works cross-platform on mobile and desktop devices.




Privacy-focused open-source chatbot enables unlimited document uploads, multi-user support, vector database integration, and intelligent chat from existing files.







Open-source cross-platform AI client offering model selection, on-prem deployment, enterprise features, security auditing, and control of data.

As part of Meta’s commitment to open science, today we are publicly releasing Llama (Large Language Model Meta AI), a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI.




Cherry Studio is a desktop client that supports for multiple LLM providers, available on Windows, Mac, and Linux.




A family of lightweight, state-of-the art open models built from the same research and technology that we used to create the Gemini models.



Open-source platform supporting local and cloud AI chat, semantic search, document retrieval, and image generation across multiple devices and data sources.




Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.




Ask questions to your documents without an internet connection, using the power of LLMs. 100% private, no data leaves your execution environment at any point. You can ingest documents and ask questions without an internet connection!

MiroThinker is an open-source search agent model, built for tool-augmented reasoning and real-world information seeking, aiming to match the deep research experience of OpenAI Deep Research and Gemini Deep Research.


Minimal, clean full-stack LLM chatbot, running tokenization, pretraining, finetuning, evaluation, inference, and web UI on a single 8xH100 node.

Visual programming environment for building, debugging, and deploying LLM agent workflows with real-time collaboration, YAML-based version control, and TypeScript integration.

The self-improving AI agent built by Nous Research. It's the only agent with a built-in learning loop — it creates skills from experience, improves them during use, nudges itself to persist knowledge, searches its own past conversations, and builds a deepening model of who...

SillyTavern is a user interface you can install on your computer (and Android phones) that allows you to interact with text generation AIs and chat/roleplay with characters you or the community create.




Qwen Code is an AI-powered command-line workflow tool designed for developers, adapted from Gemini CLI and optimized for Qwen3-Coder models.



Terminal-native open-source AI coding assistant supporting Mac, Windows, Linux, BSD, with cross-session memory, Git integration, and major LLM API compatibility.




A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (GGUF), Llama models.

The main goal of llama.cpp is to enable LLM inference with minimal setup and state-of-the-art performance on a wide range of hardware - locally and in the cloud.




NodeTool is a playground for AI that uses a visual canvas to connect different AI tools - like GPT, image creators, and video generators - into one seamless workflow. Instead of jumping between five different apps to write a script, generate an image, and turn it into a video...

