

Ambar
Ambar is a smart documents archive with automated crawling, OCR, deduplication and ultra-fast full-text search. Imagine having billion of files in different formats like xls, doc, txt, pdf, ppt, etc..., in any encoding.
Cost / License
- Freemium (Pay once)
- Open Source
Alerts
- Discontinued
Platforms
- Online
- VirtualBox
- Self-Hosted
- Docker
The last release was in 2018, and the GitHub repository of the project is archived, so no further development will be done.
Features
- OCR
- Full-Text Search
- Scheduled Data Crawling
- REST API
- Sync with Google Drive
- Screenshot OCR
Sync with Dropbox
- PDF OCR
Tags
- smb
- content-search-engine
- Search Engine
- text-search
- crawling
- archives
- word-document-processing
- document-archiving
- content-search
- search-content
- ftp-ocr
Ambar News & Activities
Recent activities
POX added Ambar as alternative to ArchiveKeep
What is Ambar?
Ambar is a smart documents archive with automated crawling, OCR, deduplication and ultra-fast full-text search. Imagine having billion of files in different formats like xls, doc, txt, pdf, ppt, etc..., in any encoding. Ambar securely stores them and gives you an ability to search through their content and metadata in milliseconds. It is very lightweight, simple and intuitive, but yet very fast and powerful in terms of data amount and scaling. All the rocket-science is hidden behind the simple UI.









Comments and Reviews
Hi, Great service, it enables me to search quickly through all my text notes on dropbox. A possible enhancement would be if I could forward mail to ambar to save and index it.
Regards and keep up the good work,
Mike.
Originally submitted via email
[Edited by rd17ambar, April 03]
Ambar scans folders for documents, indexes them, runs OCR across, and lets you find documents via keyword search and gives you the direct links and displays some content even directly as a preview in the search results.
While some OCR results are not exactly pleasing to the eye, especially formatted data such as tables, it really does a good job at finding everything quickly. What I am missing is some control over resource use of the provided VM, as well as some more customization of the interface. Though I assume that that is currently reserved for Enterprise customers or in the making.
Great job, best OCR search package I've seen!
The best I've found, and still being developed meaning that the opportunities for improvement still left might just be fulfilled!