Tesseract icon
Tesseract icon

Tesseract

 101 likes

Tesseract.js is a javascript library that gets words in almost any language out of images.

Tesseract screenshot 1

License model

  • FreeOpen Source

Application type

Platforms

  • Mac
  • Windows
  • Linux
4 / 5 Avg rating (2)
101 likes
2comments
0 news articles

Features

Suggest and vote on features

Tesseract News & Activities

Highlights All activities

Recent News

No news, maybe you know any news worth sharing?
Share a News Tip

Recent activities

Show all activities

Tesseract information

  • Licensing

    Open Source and Free product.
  • Rating

    Average rating of 4
  • Alternatives

    15 alternatives listed
  • Supported Languages

    • English

Our users have written 2 comments and reviews about Tesseract, and it has gotten 101 likes

Tesseract was added to AlternativeTo by Akasam on May 11, 2009 and this page was last updated Jul 19, 2024.

Comments and Reviews

   
 Post comment/review
tylerszabo
  
Top positive commentMar 11, 2019

In terms of OCR this tesseract is fantastic. I compared it to ABBYY 14 and tesseract had fewer errors on dictionary words. While it doesn't offer layout preservation with the OCR (i.e. converting into an editable document that should print similarly) you'll likely make up for that in the reduced time needed to fix OCR errors.

For handling PDFs you'll need to convert them to an image file, first - pdftopng (an Open Source tool that can be found in the Xpdf project)

4
TBayAreaPat
  
Review
Pending approval • Edited Sep 19, 2024

Requres that Java be installed

-3

What is Tesseract?

Tesseract.js is a javascript library that gets words in almost any language out of images.

The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but it is probably one of the most accurate open source OCR engines available. The source code will read a binary, grey or color image and output text. A tiff reader is built in that will read uncompressed TIFF images, or libtiff can be added to read compressed images. There are language files for many languages, even for text set in Fraktur and blackletter typefaces.