Recoll is a personal full text search tool for Unix/Linux.
It is based on the very strong Xapian backend.
It provides an easy to use, feature-rich interface with a Qt GUI.
Most common document types are supported are supported along with their compressed versions (Text, HTML, PDF, Dvi, PostScript, Openoffice, Lyx, Scribus, Word/Excel/PPT, Abiword, Kword, Wordperfect, RTF, djvu, gaim logs, maildir and mailbox mail folders including attachments, misc media files).
Powerful query facilities are provided from simple keyword entry to assisted boolean query building with proximity clauses, filtering on file types or location. A Xesam-compatible query language also supports field searches, and date filtering.
Multiple character sets are supported. Internal processing and storage uses Unicode UTF-8.
Recoll has few dependancies. No database daemon, Web server, or exotic language/framework is necessary. In the default setup, it only runs on your system when you need it. Indexing can be performed in batch mode or in real time.
Thanks to Xapian, indexing does not tax system resources excessively and searching is very fast.
Latest 1.19 is 1.19.11: if you are running an early 1.19 release, you should really upgrade because of a nasty bug in the date selection code (resulting in missed hits). Also, a nice new PPT filter, Python 3 compatibility, and, for Ubuntu Users a Scope for the Saucy Dash.
Release 1.19 brings faster indexing for multiprocessors, new results management features (multiple attachment saves, duplicates listing), advanced search history storage, and other performance and usability enhancements.
Release 1.18.1 brings optional case- and diacritics-sensitive searches, complex search history, direct access to hit pages for PDF documents.
Release 1.17.3 brings a number of usability improvement: management of indexing operations from the GUI, filtering on file size, extended directory filtering, Ubuntu Unity Lens, thumbnails in result lists, Okular notes and Gnumeric filters, etc.
Release 1.16.2 brings a long list of small improvements and bug fixes. Image previews, negative directory filtering, anchored searches, more popup menu entries, etc. Please check the release notes for details (http://www.recoll.org/release-1.16.html).
Release 1.15 (.9): Enhanced native Qt 4 user interface (no more Qt 3 compatibility). Switchable table-like display for the results. Direct access to sort functions. Negative directory filtering. Web archive formats.
Release 1.14 (.3): Modification date searches and filtering. New GNU info filter. Improved Thunderbird mail indexing. Other small bug fixes. date searches and filtering, arbitrary email header indexing, new audio tag extractor based on the Mutagen Python library, and miscellaneous other improvements.
Release 1.13 (.04): New class of persistent filters and indexed file types: zip, chm, ics. Improved big text files handling, Firefox visited pages indexing. Quite a few other performance and usability improvements.
Release 1.12: new KDE KIO slave module, collapsing of identical results, context-sensitive F1 help, saving email attachments and other embedded documents to files, and other small improvements and bug fixes.
Release 1.11: easy filtering of results by document type, nicer previews which use html when possible, python programming interface for indexing and searching, better support for the Xesam user query language, new filter framework, better support for arbitrary field indexing and searching.
- Created mailing-list to improve support. Check home page.
- Fixed openSuse 11 compile issues.
- Fixed bug in interpreting email mime structure, which resulted in base-64 decoding errors.
- Fixed "Prev" button in preview window. Would actually go forward when walking the search terms.
- Allow setting the highlight color for search terms in result list and preview
- Added svg filter
- Ensure that in case the data of a file can't be indexed because of some error, at least the file name is indexed.
- Improve query language to support OR queries of terms with field specifications (ie: title:someterm OR author:someauthor).
- Fix filename search to split patterns on white space, so that a "*.jpg *.jpeg" search does what's expected. Means you now need to use double-quotes if there is actual embedded white space.
- Jump directly to the external editor choice dialog instead of opening preferences when an external viewer is not found.
- Allow stopping indexing through menu action (only works with qt4 for now).
- Create an "indexedmimetypes" configuration variable to allow explicitely restricting the file types which do get indexed.
- Adds support for CJK text, and a GUI configuration tool for the main configuration file.
Release 1.9: This release brings a number of small practical improvements: new filters: Wordperfect, Abiword, Kword, jpeg, flac, ogg; better control of disk and memory usage during indexing; improved abstract generation; arbitrary field support; improved qt4 support; and miscellaneous user interface improvements and bug fixes, described in more detail in the Changes file.