The OSS Document Scanner is an Open Source app to scan all your documents. You either scan using your camera or by importing an image. The app will automatically detect you document within the photo and will crop the image.




The best open source alternative to Tesseract is OSS Document Scanner. If that doesn't suit you, our users have ranked more than 25 alternatives to Tesseract and ten of them is open source so hopefully you can find a suitable replacement. Other interesting open source alternatives to Tesseract are Open Scanner, gscan2pdf, EasyOCR and OCRopus.
The OSS Document Scanner is an Open Source app to scan all your documents. You either scan using your camera or by importing an image. The app will automatically detect you document within the photo and will crop the image.




Open Scanner is a fast and simple-to-use paper-scanning app with AI features to speed up your scanning workflow:




gscan2pdf can scan, clean the scan and do OCR on the scan or imported images (incl. existing PDFs, DjVus or other file types), and make PDF and DjVu-files with embedded OCR-text. It works together with tesseract, ocropus, cuneiform an...

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multilingual capabilities.

CuneiForm (OpenOCR) is a text recognition software for printed templates. Manuscripts or PDF-files, the program can not recognize, however, but table structures. The language-model is applicable for 20 languages, and the results can be used as HTML, RTF or ASCII text to save, or...


OCRMyScreen is a small, focused utility that lets you capture text from anywhere on your screen, especially in places where normal copy-paste simply does not work. Instead of retyping error messages, one-time codes, or text inside images and locked UIs, you press a button, drag...


Chandra is a highly accurate OCR model that converts images and PDFs into structured HTML/Markdown/JSON while preserving layout information.

GOCR is an OCR (Optical Character Recognition) program, developed under the GNU Public License. It converts scanned images of text back to text files. Joerg Schulenburg started the program, and now leads a team of developers.

