FastFlowLM

1 like

Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.

Cost / License

Free Personal
Open Source

Application types

Origin

United States

Platforms

Windows
Online
Self-Hosted

FastFlowLM alternatives

1like

0comments

26alternatives

0articles

Features

Properties

Lightweight
Privacy focused

Features

Command line interface
Works Offline
AI Chatbot
Agentic AI
AI-Powered
AMD
Offline

FastFlowLM News & Activities

Highlights All activities

Recent activities

OrdinaryPerson added FastFlowLM as alternative to Operit AI
about 3 hours ago
justarandom added FastFlowLM as alternative to Off Grid Mobile
7 days ago
bugmenot added FastFlowLM as alternative to LLM Hub
17 days ago
bugmenot added FastFlowLM as alternative to NexaSDK
20 days ago
bugmenot added FastFlowLM as alternative to AnythingLLM
20 days ago
bugmenot added FastFlowLM as alternative to Nexa Studio
20 days ago
bugmenot added FastFlowLM as alternative to MNN Chat
20 days ago
bugmenot added FastFlowLM as alternative to AI00 RWKV Server
20 days ago
bugmenot added FastFlowLM as alternative to RWKV Runner
20 days ago
bugmenot added FastFlowLM as alternative to RWKV Chat
21 days ago

FastFlowLM information

Developed by
FastFlowLM
Licensing
Open Source and Free Personal product.
Written in
C++
Alternatives
26 alternatives listed
Supported Languages
- English

AlternativeTo Categories

AI Tools & Services, OS & Utilities

GitHub repository

780 Stars
39 Forks
23 Open Issues
Updated Feb 25, 2026

View on GitHub

Popular alternatives

View all

FastFlowLM was added to AlternativeTo by bugmenot on Feb 4, 2026 and this page was last updated Feb 4, 2026.

No comments or reviews, maybe you want to be first?

What is FastFlowLM?

? FastFlowLM (FLM) — Unlock Ryzen™ AI NPUs

Run large language models — now with Vision, Audio, Embedding and MoE support — on AMD Ryzen™ AI NPUs in minutes. No GPU required. Faster and over 10× more power-efficient. Supports context lengths up to 256k tokens. Ultra-Lightweight (16 MB). Installs within 20 seconds.

📦 The only out-of-box, NPU-first runtime built exclusively for Ryzen™ AI. 🤝 Think Ollama — but deeply optimized for NPUs. ? From Idle Silicon to Instant Power — FastFlowLM Makes Ryzen™ AI Shine.

FastFlowLM (FLM) supports all Ryzen™ AI Series chips with XDNA2 NPUs (Strix, Strix Halo, and Kraken).

🧠 Local AI on NPU

FLM makes it easy to run cutting-edge LLMs (and now VLMs) locally with:

? Fast and low power
🧰 Simple CLI and API (REST and OpenAI API)
🔐 Fully private and offline

No model rewrites, no tuning — it just works.

? Highlights

Runs fully on AMD Ryzen™ AI NPU — no GPU or CPU load
Lightweight runtime (16 MB) — installs within 20 seconds, easy to integrate
Developer-first flow — like Ollama, but optimized for NPU
Support for long context windows — up to 256k tokens (e.g., Qwen3-4B-Thinking-2507)
No low-level tuning required — You focus on your app, we handle the rest

FastFlowLM

Cost / License

Application types

Origin

Platforms

FastFlowLM

Features

Properties

Features

Tags

FastFlowLM News & Activities

Recent activities

FastFlowLM information

Developed by

Licensing

Written in

Alternatives

Supported Languages

AlternativeTo Categories

GitHub repository

Popular alternatives

What is FastFlowLM?

FastFlowLM Videos

Official Links

AppStores & Other Links