Inferencer lets you run, host and deeply control the latest SOTA AI models (OSS, DeepSeek, Qwen, Kimi, GLM, MiniMax and more) from your own computer.
Cost / License
- Freemium
- Proprietary
Application types
Platforms
- Mac
- iPhone
- iPad




LM Studio is described as 'Discover, download, and run local LLMs' and is a large language model (llm) tool in the ai tools & services category. There are more than 50 alternatives to LM Studio for a variety of platforms, including Mac, Windows, Linux, Android and Self-Hosted apps. The best LM Studio alternative is Ollama, which is both free and Open Source. Other great apps like LM Studio are Jan.ai, GPT4ALL, Open WebUI and AnythingLLM.
Inferencer lets you run, host and deeply control the latest SOTA AI models (OSS, DeepSeek, Qwen, Kimi, GLM, MiniMax and more) from your own computer.




Warden is a minimalist, simple and beautiful macOS AI chat app, that supports most AI providers: ChatGPT, Anthropic (Claude), xAI (Grok), Google Gemini, Perplexity, Groq, Local LLMs through Ollama, OpenRouter, and almost any OpenAI-compatible APIs.




KoboldCpp is an easy-to-use AI text-generation software for GGML models. It's a single self contained distributable from Concedo, that builds off llama.cpp, and adds a versatile Kobold API endpoint, additional format support, backward compatibility, as well as a fancy UI...




Experience AI chat on macOS with a SwiftUI-designed client utilizing Swift, CoreML, and BERT for native performance. Enjoy privacy-focused, intuitive chats with intelligent AI responses, profile customization, and full control via editable chat history and message rewind.

Typing Mind is a commercial alternative front end for various LLM engines, using various APIs it offers a front end interface for managing chats, uploading documents, and it’s own plugins. It can use OpenAI API, Anthropic, and OpenRouter API out of the box, and you can configure...
The simplest way to use local and online AI models. Interact with any AI model with just a click of a button.




Cloudflare Workers AI provides a serverless platform to execute AI models utilizing GPUs in its network, eliminating infrastructure needs. Access over 50 open-source models, use AI Gateway for app control, and deploy globally with tools like Vectorize, R2, and D1.


ArgalAI is an offline AI chat app that allows you to chat with the latest AI models to get things done or have a bit of fun. Best of all, there are no monthly fees, and none of your chats ever leave your device.




Cortex is the open-source brain for robots: vision, speech, language, tabular, and action -- the cloud is optional.


The main goal of llama.cpp is to enable LLM inference with minimal setup and state-of-the-art performance on a wide range of hardware - locally and in the cloud.




The Swiss Army Knife of offline AI. Chat, speak, and generate images. Privacy first, zero internet. Download an LLM and use it on your mobile device. No data ever leaves your phone.


HugstonOne Enterprise Edition — Awesome AI App with Code editor and Live Preview Create games, dashboards, maps, tables, charts, webpages, data analysis, converters etc in seconds.



