Apps tagged with 'llm-inference'

All apps in Apps tagged with 'llm-inference' category. Use the filters below to narrow down your search.

Ensu
30 likes
Private offline on-device local LLM app by Ente, open source code, no accounts or tracking, and works cross-platform on mobile and desktop devices.
Cost / License
Free
Open Source (AGPL-3.0)
Application types
AI Chatbot
Large Language Model (LLM) Tool
Origin
United States
Platforms
Mac
Windows
Linux
Android
iPhone
Android Tablet
iPad
+1
Best alternatives are ChatGPTandLumo by Proton
59 alternatives
nanochat
9 likes
Minimal, clean full-stack LLM chatbot, running tokenization, pretraining, finetuning, evaluation, inference, and web UI on a single 8xH100 node.
Cost / License
Free
Open Source (MIT)
Application types
AI Chatbot
Large Language Model (LLM) Tool
Origin
United States
Platforms
Mac
Windows
Linux
Self-Hosted
Python
Best alternatives are OpenClawandJan.ai
78 alternatives
Fireworks AI
2 likes
Build and deploy generative AI on the fastest and most efficient inference engine, fine-tuning and switching between models without extra costs.
Cost / License
Paid
Proprietary
Platforms
Online
+1
Best alternatives are Mistral ForgeandUnsloth
28 alternatives
llmfit
2 likes
Hundreds of models & providers. One command to find what runs on your hardware.
Cost / License
Free
Open Source (MIT)
Application type
Large Language Model (LLM) Tool
Origin
United Kingdom
Platforms
Windows
Mac
Linux
Best alternatives are Off Grid Mobile
3 alternatives
FastFlowLM
2 likes
Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.
Cost / License
Free Personal
Open Source
Application types
Large Language Model (LLM) Tool
AI Chatbot
Origin
United States
Platforms
Windows
Online
Self-Hosted
Best alternatives are OllamaandJan.ai
26 alternatives
Locally AI
2 likes
Run Llama, Gemma, Qwen, DeepSeek, and more locally on your iPhone, iPad, and Mac. Offline. Private. No login. Optimized for Apple Silicon.
Cost / License
Free
Proprietary
Application type
Large Language Model (LLM) Tool
Origin
France
EU
Platforms
Mac
iPhone
iPad
+5
Best alternatives are Lumo by ProtonandPocketPal
9 alternatives
AI00 RWKV Server
2 likes
AI00 RWKV Server is an inference API server for the RWKV language model based upon the web-rwkv inference engine.
Cost / License
Free
Open Source (MIT)
Application type
Large Language Model (LLM) Tool
Origin
China
Platforms
Windows
Mac
Linux
Rust
Best alternatives are OllamaandJan.ai
20 alternatives
Nexa Studio
2 likes
Nexa Studio: Offline, Private AI with Unlimited Use.
Cost / License
Free
Proprietary
Application type
Large Language Model (LLM) Tool
Origin
United States
Platforms
Android
+1
Best alternatives are OllamaandLM Studio
19 alternatives
AI Dev Gallery
2 likes
Learn how to add AI with local models and APIs to Windows apps. Discover AI scenarios and models such as Phi, Mistral, Stable Diffusion, Whisper, and many more to delight your users. The AI Dev Gallery is an open-source app designed to help Windows developers integrate AI...
Cost / License
Free
Open Source (MIT)
Application type
Large Language Model (LLM) Tool
Origin
United States
Platforms
Windows
+1
Best alternatives are OllamaandAnythingLLM
25 alternatives
MNN Chat
2 likes
Multimodal Offline LLM Chat App with MNN.
Cost / License
Free
Open Source (Apache-2.0)
Application type
Large Language Model (LLM) Tool
Origin
China
Platforms
Android
iPhone
iPad
+1
Best alternatives are OllamaandAnythingLLM
24 alternatives
Lemonade Server
2 likes
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs.
Cost / License
Free
Open Source (Apache-2.0)
Origin
United States
Platforms
Windows
Linux
Docker
Snapcraft
iPhone
iPad
Self-Hosted
Python
+1
Best alternatives are OllamaandAnythingLLM
17 alternatives
RWKV Chat
2 likes
Experience the power of RWKV models directly on your device. Completely offline, privacy-first, and efficient. No internet required.
Cost / License
Free
Open Source (Apache-2.0)
Application type
Large Language Model (LLM) Tool
Origin
China
Platforms
Mac
Windows
Linux
Android
iPhone
iPad
Android Tablet
+5
Best alternatives are OllamaandJan.ai
29 alternatives
AI Playground
2 likes
This application provides a full suite of generative AI features for chat, code assistance, document search, image analysis, image and video generation. All features run offline and are powered by your PC’s Intel® Core™ Ultra with built-in Intel Arc GPU or Intel Arc™ dGPU...
Cost / License
Free
Open Source (MIT)
Application types
Large Language Model (LLM) Tool
AI Image Generator
Origin
United States
Platforms
Windows
Best alternatives are OllamaandAnythingLLM
24 alternatives
Inferencer
3 likes
Inferencer lets you run, host and deeply control the latest SOTA AI models (OSS, DeepSeek, Qwen, Kimi, GLM, MiniMax and more) from your own computer.
Cost / License
Freemium
Proprietary
Application types
AI Chatbot
Large Language Model (LLM) Tool
Origin
Australia
Platforms
Mac
iPhone
iPad
+5
Best alternatives are ChatGPTandDeepSeek
24 alternatives
Minimax Platform
3 likes
MiniMax Platform is a versatile AI ecosystem offering advanced models for text, speech, video, and music generation, optimized for coding, creative expression, and immersive interaction.
Cost / License
Freemium
Proprietary
Origin
China
Platforms
Mac
Windows
Online
+1
Best alternatives are Mistral ForgeandUnsloth
14 alternatives
oMLX
2 likes
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar.
Cost / License
Free
Open Source (Apache-2.0)
Application type
Large Language Model (LLM) Tool
Origin
South Korea
Platforms
Mac
Best alternatives are DeepSeekandOllama
51 alternatives
SmolChat
1 like
SmolChat allows you to download and run popular LLMs on your Android device, locally, without needing an internet connection. Customize the model used for each chat, tune settings like temperature and min-p, and pin your favourite chats on the home-screen with shortcuts.
Cost / License
Free
Open Source (Apache-2.0)
Application type
Large Language Model (LLM) Tool
Origin
India
Platforms
Android
+2
Best alternatives are ChatGPTandLumo by Proton
51 alternatives
LLM Hub
1 like
LLM Hub is an open-source Android app for on-device LLM chat and image generation. It's optimized for mobile usage (CPU/GPU/NPU acceleration) and supports multiple model formats so you can run powerful models locally and privately.
Cost / License
Free
Open Source
Application types
Large Language Model (LLM) Tool
AI Image Generator
Origin
Australia
Platforms
Android
Android Tablet
+7
Best alternatives are AnythingLLMandPocketPal
21 alternatives
Arcee
1 like
An AI-powered chat interface designed for exploring advanced topics like quantum computing, programming, and emerging AI trends through interactive conversation.
Cost / License
Freemium
Open Source (Apache-2.0)
Application type
Large Language Model (LLM) Tool
Origin
United States
Platforms
Online
+2
Best alternatives are ChatGPTandLumo by Proton
51 alternatives
RamaLama
1 like
RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.
Cost / License
Free
Open Source (MIT)
Origin
United States
Platforms
Linux
Mac
Python
Best alternatives are OllamaandJan.ai
53 alternatives
OpenRouter Runner
1 like
OpenRouter Runner is a monolith inference engine, built with Modal. It serves as a robust solution for the deployment of tons of open source models that are hosted in a fallback capacity on openrouter.ai.
Cost / License
Freemium
Open Source (MIT)
Origin
United States
Platforms
Self-Hosted
Corpus2GPT
1 like
Corpus2GPT: A project enabling users to train their own GPT models on diverse datasets, including local languages and various corpus types, using Keras and compatible with TensorFlow, PyTorch, or JAX backends for subsequent storage or sharing.
Cost / License
Free
Open Source (Apache-2.0)
Application type
Large Language Model (LLM) Tool
Platforms
Self-Hosted
Best alternatives are Mistral Vibe
5 alternatives
HugstonOne
1 like
HugstonOne Enterprise Edition — Awesome AI App with Code editor and Live Preview Create games, dashboards, maps, tables, charts, webpages, data analysis, converters etc in seconds.
Cost / License
Free
Proprietary
Application types
AI Chatbot
Large Language Model (LLM) Tool
Origin
Sweden
EU
Platforms
Windows
Linux
+6
Best alternatives are OllamaandJan.ai
21 alternatives
GonkaGate
1 like
GonkaGate is a USD-billed gateway to Gonka Network that exposes an OpenAI-compatible API.
Cost / License
Paid
Proprietary
Application type
Large Language Model (LLM) Tool
Platforms
Online
Software as a Service (SaaS)
+2
Best alternatives are OpenRouterandBazaarLink
3 alternatives
Arch
Like
The proxy server for AI-native apps. Arch handles the pesky low-level work in building agents like clariyfing vague user input, routing prompts to the right agents and unifying access to any LLM - all without locking you into a framework.
Cost / License
Free
Open Source (Apache-2.0)
Origin
United States
Platforms
Self-Hosted
Docker

Ensu

Cost / License

Application types

Origin

Platforms

nanochat

Cost / License

Application types

Origin

Platforms

Fireworks AI

Cost / License

Platforms

llmfit

Cost / License

Application type

Origin

Platforms

FastFlowLM

Cost / License

Application types

Origin

Platforms

Locally AI

Cost / License

Application type

Origin

Platforms

AI00 RWKV Server

Cost / License

Application type

Origin

Platforms

Nexa Studio

Cost / License

Application type

Origin

Platforms

AI Dev Gallery

Cost / License

Application type

Origin

Platforms

MNN Chat

Cost / License

Application type

Origin

Platforms

Lemonade Server

Cost / License

Origin

Platforms

RWKV Chat

Cost / License

Application type

Origin

Platforms

AI Playground

Cost / License

Application types

Origin

Platforms

Inferencer

Cost / License

Application types

Origin

Platforms

Minimax Platform

Cost / License

Origin

Platforms

oMLX

Cost / License

Application type

Origin

Platforms

SmolChat

Cost / License

Application type

Origin