

OpenReader
Like
An open-source read-along document reader server with high-quality TTS options, synchronized highlighting, and audiobook export for EPUB, PDF, DOCX, TXT, and MD.
Cost / License
- Free
- Open Source (MIT)
Platforms
- Self-Hosted
- Docker
Features
Properties
- Privacy focused
Features
- No Tracking
- Text to Speech
- Ad-free
- AI-Powered
OpenReader News & Activities
Highlights All activities
Recent activities
- POX updated OpenReader
- phisch added OpenReader
phisch added OpenReader as alternative to ElevenReader, Read Aloud Extension, NaturalReader and Kokoro
OpenReader information
No comments or reviews, maybe you want to be first?
What is OpenReader?
OpenReader is an open source text-to-speech document reader built with Next.js. It provides a read-along experience with narration for EPUB, PDF, TXT, MD, and DOCX documents.
It supports multiple TTS providers including OpenAI, Replicate, DeepInfra, and custom OpenAI-compatible endpoints such as Kokoro-FastAPI, KittenTTS-FastAPI, and Orpheus-FastAPI.
Highlights:
- Layout-aware PDF Parsing
- PP-DocLayoutV3 (ONNX) detects structured blocks with cross-page stitching and geometry-based highlighting for precise read-along sync and clean TTS segmentation
- Word-by-word Highlighting via ONNX Whisper alignment
- Powered by the external compute worker control plane (NATS JetStream-backed)
- Segment-based TTS Playback
- Sentence-aware generation with cached audio segments, background preloading, and resumable playback across EPUB, PDF, TXT, MD, and DOCX
- Multi-Provider TTS Support
- Self-hosted: Kokoro-FastAPI (multi-voice combinations), KittenTTS-FastAPI, Orpheus-FastAPI, or any custom OpenAI-compatible endpoint
- Cloud: OpenAI (tts-1, tts-1-hd, gpt-4o-mini-tts), Replicate (built-in catalog + any model ID), DeepInfra (Kokoro-82M and others)
- Audiobook Export in m4b/mp3 with resumable chapter generation
- Flexible Backend — embedded SeaweedFS or S3-compatible storage, SQLite or Postgres, server library import, and device sync
- Auth and User Isolation — auth is required in v4+, with optional anonymous auth sessions for guest flows
- Customizable — 13 built-in themes (light and dark palettes), per-user TTS settings, and document handling controls


