

OCRFeeder
OCRFeeder is a document layout analysis and optical character recognition system.
Features
- OCR
- Built-in viewer
- Graphical User Interface
- Scan documents
Tags
- gnu
- tesseract
- GNOME
OCRFeeder News & Activities
Recent News
Recent activities
- ClarifyDocuments added OCRFeeder as alternative to ClarifyDocuments
- evanforcucci reviewed OCRFeeder
Takes an image or PDF (or other such thing) and gives me the text visible in it. I like to use it for when I have an assignment to read a pdf of a selection of a book, but it's just a bunch of images with no embedded text. With OCRFeeder giving me the text, I can just stick it in a text to speech program and listen to it rather than reading the whole thing.
- VPupkin reviewed OCRFeeder
Window size is too big and can't be adjusted. In an attempt to recognize hangs. Belongs in the trash.
- gramblo353 reviewed OCRFeeder
A little finicky but does it's job otherwise quite well.
- gramblo353 added Built-in viewer as a feature to OCRFeeder
- gramblo353 liked OCRFeeder
frogue added OCRFeeder as alternative to Klippa DocHorizon
What is OCRFeeder?
OCRFeeder is a document layout analysis and optical character recognition system.
Given the images it will automatically outline its contents, distinguish between what's graphics and text and perform OCR over the latter. It generates multiple formats being its main one ODT.
It features a complete GTK graphical user interface that allows the users to correct any unrecognized characters, defined or correct bounding boxes, set paragraph styles, clean the input images, import PDFs, save and load the project, export everything to multiple formats, etc. OCRFeeder was developed as the project of the Master's Thesis in Computer Science of Joaquim Rocha.









Comments and Reviews
A little finicky but does it's job otherwise quite well.
Window size is too big and can't be adjusted. In an attempt to recognize hangs. Belongs in the trash.
Takes an image or PDF (or other such thing) and gives me the text visible in it. I like to use it for when I have an assignment to read a pdf of a selection of a book, but it's just a bunch of images with no embedded text. With OCRFeeder giving me the text, I can just stick it in a text to speech program and listen to it rather than reading the whole thing.