Minimal, clean full-stack LLM chatbot, running tokenization, pretraining, finetuning, evaluation, inference, and web UI on a single 8xH100 node.

Minimal, clean full-stack LLM chatbot, running tokenization, pretraining, finetuning, evaluation, inference, and web UI on a single 8xH100 node.

Build and deploy generative AI on the fastest and most efficient inference engine, fine-tuning and switching between models without extra costs.




AI00 RWKV Server is an inference API server for the RWKV language model based upon the web-rwkv inference engine.




Learn how to add AI with local models and APIs to Windows apps. Discover AI scenarios and models such as Phi, Mistral, Stable Diffusion, Whisper, and many more to delight your users. The AI Dev Gallery is an open-source app designed to help Windows developers integrate AI...








Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs.




Experience the power of RWKV models directly on your device. Completely offline, privacy-first, and efficient. No internet required.




This application provides a full suite of generative AI features for chat, code assistance, document search, image analysis, image and video generation. All features run offline and are powered by your PC’s Intel® Core™ Ultra with built-in Intel Arc GPU or Intel Arc™ dGPU...


Inferencer lets you run, host and deeply control the latest SOTA AI models (OSS, DeepSeek, Qwen, Kimi, GLM and more) from your own computer.




Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.
Run Llama, Gemma, Qwen, DeepSeek, and more locally on your iPhone, iPad, and Mac. Offline. Private. No login. Optimized for Apple Silicon.




LLM Hub is an open-source Android app for on-device LLM chat and image generation. It's optimized for mobile usage (CPU/GPU/NPU acceleration) and supports multiple model formats so you can run powerful models locally and privately.




An AI-powered chat interface designed for exploring advanced topics like quantum computing, programming, and emerging AI trends through interactive conversation.




OpenRouter Runner is a monolith inference engine, built with Modal. It serves as a robust solution for the deployment of tons of open source models that are hosted in a fallback capacity on openrouter.ai.
Corpus2GPT: A project enabling users to train their own GPT models on diverse datasets, including local languages and various corpus types, using Keras and compatible with TensorFlow, PyTorch, or JAX backends for subsequent storage or sharing.

HugstonOne Enterprise Edition — Awesome AI App with Code editor and Live Preview Create games, dashboards, maps, tables, charts, webpages, data analysis, converters etc in seconds.








The proxy server for AI-native apps. Arch handles the pesky low-level work in building agents like clariyfing vague user input, routing prompts to the right agents and unifying access to any LLM - all without locking you into a framework.

The DeepSeek API provides developers with direct access to DeepSeek’s advanced AI models, enabling them to run text, code, and multimodal tasks through simple endpoints for seamless integration into applications.




GPUniq is a stable GPU compute platform built for AI teams and developers who need reliable performance without paying traditional cloud prices. It provides on-demand access to powerful GPUs for LLM training, inference, computer vision, generative workloads, and 3D rendering —...



MiniMax Platform is a versatile AI ecosystem offering advanced models for text, speech, video, and music generation, optimized for coding, creative expression, and immersive interaction.




You’re about to supercharge your AI models with lightning-fast inference. Inference Engine makes it easy to scale, and optimize your models for real-time performance. Let’s get started—your AI is ready to shine.




Lyrio.ai is your ultimate tool for structured communication. From multithreading to pinned messages, folders, and multi-LLMs, Lyrio.ai helps you manage conversations with ease. Whether you’re handling research, professional projects, or daily tasks, this AI-powered assistant is...


