The OSS Document Scanner is a free and Open Source app to transform your mobile device into a powerful document scanner.




Tesseract is described as 'Js is a javascript library that gets words in almost any language out of images' and is a very popular OCR Engine in the office & productivity category. There are more than 25 alternatives to Tesseract for a variety of platforms, including Windows, Linux, Web-based, Mac and iPhone apps. The best Tesseract alternative is OSS Document Scanner, which is both free and Open Source. Other great apps like Tesseract are ABBYY FineReader PDF, Open Scanner, ClarifyDocuments and Scan Thing: Scan Anything.
The OSS Document Scanner is a free and Open Source app to transform your mobile device into a powerful document scanner.




Optical recognition software boasting unmatched text accuracy converts scanned documents to editable Word, Excel, or searchable PDFs while preserving layout. Supports 190 languages, simplifies processes with automated tasks, and includes PDF editing tools.




Open Scanner is a fast and simple-to-use paper-scanning app with AI features to speed up your scanning workflow:




ClarifyDocuments is a free AI tool that converts PDFs, images, and slides into clean, editable text. It automatically cleans, organizes, and processes content in any language, helping students and educators focus on learning.


Scan Thing is the quickest way to capture and save anything around you. Scan Thing is available on iOS, iPadOS, and MacOS.




gscan2pdf can scan, clean the scan and do OCR on the scan or imported images (incl. existing PDFs, DjVus or other file types), and make PDF and DjVu-files with embedded OCR-text. It works together with tesseract, ocropus, cuneiform an...

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

OwlOCR offers simple optical character recognition of text in PDF files, images or on-screen and converts that to plain text.




OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multilingual capabilities.

Chandra is a highly accurate OCR model that converts images and PDFs into structured HTML/Markdown/JSON while preserving layout information.





An intelligent data extraction service that automates information processing. It uses advanced AI to accurately parse unstructured documents and converts them into clean, structured JSON data.


