OpenReader icon
OpenReader icon

OpenReader

An open-source read-along document reader server with high-quality TTS options, synchronized highlighting, and audiobook export for EPUB, PDF, DOCX, TXT, and MD.

Cost / License

  • Free
  • Open Source (MIT)

Platforms

  • Self-Hosted
  • Docker
0likes
0comments
0articles

Features

OpenReader News & Activities

Highlights All activities

Recent activities

OpenReader information

  • Developed by

    US flagRichard Roberson
  • Licensing

    Open Source (MIT) and Free product.
  • Written in

  • Alternatives

    19 alternatives listed
  • Supported Languages

    • English

AlternativeTo Categories

News & BooksOffice & Productivity

GitHub repository

  •  362 Stars
  •  58 Forks
  •  2 Open Issues
  •   Updated  
View on GitHub
OpenReader was added to AlternativeTo by phisch on and this page was last updated .
No comments or reviews, maybe you want to be first?

What is OpenReader?

OpenReader is an open source text-to-speech document reader built with Next.js. It provides a read-along experience with narration for EPUB, PDF, TXT, MD, and DOCX documents.

It supports multiple TTS providers including OpenAI, Replicate, DeepInfra, and custom OpenAI-compatible endpoints such as Kokoro-FastAPI, KittenTTS-FastAPI, and Orpheus-FastAPI.

Highlights:

  • Layout-aware PDF Parsing
  • PP-DocLayoutV3 (ONNX) detects structured blocks with cross-page stitching and geometry-based highlighting for precise read-along sync and clean TTS segmentation
  • Word-by-word Highlighting via ONNX Whisper alignment
  • Powered by the external compute worker control plane (NATS JetStream-backed)
  • Segment-based TTS Playback
  • Sentence-aware generation with cached audio segments, background preloading, and resumable playback across EPUB, PDF, TXT, MD, and DOCX
  • Multi-Provider TTS Support
  • Self-hosted: Kokoro-FastAPI (multi-voice combinations), KittenTTS-FastAPI, Orpheus-FastAPI, or any custom OpenAI-compatible endpoint
  • Cloud: OpenAI (tts-1, tts-1-hd, gpt-4o-mini-tts), Replicate (built-in catalog + any model ID), DeepInfra (Kokoro-82M and others)
  • Audiobook Export in m4b/mp3 with resumable chapter generation
  • Flexible Backend — embedded SeaweedFS or S3-compatible storage, SQLite or Postgres, server library import, and device sync
  • Auth and User Isolation — auth is required in v4+, with optional anonymous auth sessions for guest flows
  • Customizable — 13 built-in themes (light and dark palettes), per-user TTS settings, and document handling controls

Official Links