
Recoll
This package is a personal full text search package which is based on a very strong backend (Xapian), for which it provides an easy to use and feature-rich interface.
What is Recoll?
This package is a personal full text search package which is based on a very strong backend (Xapian), for which it provides an easy to use and feature-rich interface.
A small fee is required for downloading and using the Windows binary version.
Recoll finds keywords inside documents as well as file names.
It can search most document formats. It can reach any storage place: files, archive members, email attachments, transparently handling decompression. One click will open the document inside a native editor or display an even quicker text preview.
Avaliable through repositories
Features:
- Qt-based GUI
- Will run on most Unix-based systems
- Powerful query facilities, with boolean searches, phrases, proximity, wildcards, filter on file types and directory tree.
- Supports the following document types (and their compressed versions)
- Natively: text, html, OpenOffice files, maildir and mailbox (Mozilla and IceDove mail) with attachments, pidgin log files
- With external helpers: pdf (pdftotext), postscript (ghostscript), msword (antiword), excel, ppt (catdoc), rtf (unrtf)
- Powerful query facilities, with boolean searches, phrases, filter on file types and directory tree
- Support for multiple charsets, Internal processing and storage uses Unicode UTF-8. Multi-language and multi-character set with Unicode based internals.
- Stemming performed at query time (can switch stemming language after indexing)
- Easy installation. No database daemon, web server or exotic language necessary
- An indexer which runs either as a thread inside the GUI or as an external, cron'able program
Recoll Screenshots









Recoll Features
Recoll information
Supported Languages
- English
Comments and Reviews
Tags
- desktop-search
- unix
- boolean-function
- text-search
- find-in-files
Excellent.
Works really well (fast) including with large sets of files (much better than docfetcher). A lot of useful options (to select what to index per folder, to ignore numbers or not, format output, filter them, etc) but the configuration remains straightforward. OCR is possible for pdf (I am using it with tesseract, a bit slow but works).
A small donation is needd for the windows executable but it works so much better than the windows search when you have a lot of data that it is worth it in my opinion.
I use Recoll since the version based upon the version 3 of Qt toolkit. Recoll has a lightweight GUI and uses a very powerful engine (that may be memory hungry, but just when updating the database), being able to index a lot of formats. The database can grow up and reach some gigabytes in size, but the queries are fast.
Pros:
Macrometrópole dir:/mnt/arquivos/caio/Cidades/Brasil/SP/RMSP/Emplasa/biblioteca
andMacrometrópole dir:Emplasa/biblioteca
are the same thing)mime:application/pdf
)Cons:
[Edited by caiocco, April 30]
[Edited by caiocco, April 30]
Not free on window. Force donate to access download.
The executable is not free. The source is available under an open source license. Someone else could compile it I guess and distribute it.
Reply written ago
Recoll is and was my first go to app for indexing pdf files on Ubuntu. It works so well; especially the preview and the ease of adding directories or maps. I also bought the Windows version. After so many uses and it usefulness; no problem. The database is best place in a directory that accommodates it growth when needed. It can also index media, usb and other disks.
Just purrfect for Linux users!
A bit of power usage on initial index, but that's okay.
The best thing since Google Desktop Search. It kicks DocFetcher's butt 10 times over. Well worth the few $/Euros to buy the installer.
It is fantastic. Lightweight considering what it does. I have hundreds of thousand files indexed. Finds fast (where windows search indexing is hell, docfetcher too slow).