The OSS Document Scanner is a free and Open Source app to transform your mobile device into a powerful document scanner.




The best open source alternative to Tesseract is OSS Document Scanner. If that doesn't suit you, our users have ranked more than 25 alternatives to Tesseract and 12 is open source so hopefully you can find a suitable replacement. Other interesting open source alternatives to Tesseract are Open Scanner, gscan2pdf, EasyOCR and OCRopus.
The OSS Document Scanner is a free and Open Source app to transform your mobile device into a powerful document scanner.




Open Scanner is a fast and simple-to-use paper-scanning app with AI features to speed up your scanning workflow:




gscan2pdf can scan, clean the scan and do OCR on the scan or imported images (incl. existing PDFs, DjVus or other file types), and make PDF and DjVu-files with embedded OCR-text. It works together with tesseract, ocropus, cuneiform an...

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multilingual capabilities.

Chandra is a highly accurate OCR model that converts images and PDFs into structured HTML/Markdown/JSON while preserving layout information.

CuneiForm (OpenOCR) is a text recognition software for printed templates. Manuscripts or PDF-files, the program can not recognize, however, but table structures. The language-model is applicable for 20 languages, and the results can be used as HTML, RTF or ASCII text to save, or...


Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.



OCRMyScreen is a small, focused utility that lets you capture text from anywhere on your screen, especially in places where normal copy-paste simply does not work. Instead of retyping error messages, one-time codes, or text inside images and locked UIs, you press a button, drag...


GOCR is an OCR (Optical Character Recognition) program, developed under the GNU Public License. It converts scanned images of text back to text files. Joerg Schulenburg started the program, and now leads a team of developers.


Free all-in-one document parsing tool. Accurate parsing, efficient extraction, providing a more fluent and accurate parsing experience.



