Tesseract.js is a javascript library that gets words in almost any language out of images.

CamScanner is not available for Linux but there are plenty of alternatives that runs on Linux with similar functionality. The best Linux alternative is Tesseract, which is both free and Open Source. If that doesn't suit you, our users have ranked more than 100 alternatives to CamScanner and 11 are available for Linux so hopefully you can find a suitable replacement. Other interesting Linux alternatives to CamScanner are GImageReader, NormCap, Crow Translate and dpScreenOCR.
Tesseract.js is a javascript library that gets words in almost any language out of images.




Crow Translate is a simple and lightweight translator programmed in C++ / Qt that allows to translate and speak text using Google, Yandex and Bing translate API.





OCRFeeder is a document layout analysis and optical character recognition system.




Quickly extract text from almost any source: youtube, screencasts, PDFs, webpages, photos, etc. Grab the image and get the text.



A Java/.NET GUI frontend for Tesseract OCR engine. Provides optical character recognition for Vietnamese and other languages supported by Tesseract.


SikuliX automates anything you see on the screen of your desktop computer running Windows, Mac or some Linux/Unix. It uses image recognition.

CuneiForm (OpenOCR) is a text recognition software for printed templates. Manuscripts or PDF-files, the program can not recognize, however, but table structures. The language-model is applicable for 20 languages, and the results can be used as HTML, RTF or ASCII text to save, or...


WatchOCR is an open source OCR server that creates searchable pdfs from images in a watched folder.
