Lemur Project icon
Lemur Project icon

Lemur Project

The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software. The project is best known for its Indri search engine.

Lemur Project screenshot 1

Cost / License

  • Free
  • Open Source

Application type

Platforms

  • Windows
  • Linux
  • Online
  • Self-Hosted
-
No reviews
3likes
0comments
0news articles

Features

Suggest and vote on features
  1.  File Search

 Tags

Lemur Project News & Activities

Highlights All activities

Recent activities

No activities found.

Lemur Project information

  • Developed by

    Unknown
  • Licensing

    Open Source and Free product.
  • Alternatives

    9 alternatives listed
  • Supported Languages

    • English
Lemur Project was added to AlternativeTo by dimlakgorkehgz on and this page was last updated .
No comments or reviews, maybe you want to be first?
Post comment/review

Featured in Lists

A list with 15 apps by 6feriolimarco without a description.

List by Francisco Ferioli Marco with 15 apps, updated

What is Lemur Project?

The Lemur Project develops search engines, browser toolbars, text analysis tools, and data resources that support research and development of information retrieval and text mining software. The project is best known for its Indri search engine, Lemur Toolbar, and ClueWeb09 dataset. Our software and datasets are used widely in scientific and research applications, as well as in some commercial applications.

Indri is a search engine that provides state-of-the-art text search and a rich structured query language for text collections of up to 50 million documents (single machine) or 500 million documents (distributed search). Available for Linux, Solaris, Windows and Mac OSX.

Features Powerful Query Interface

Supports popular structured query operators from INQUERY Suffix-based wildcard term matching Field retrieval Passage retrieval

Flexible Indexing and Document Support

Supports UTF-8 encoded text Language independent tokenization of UTF-8 encoded documents. Parses PDF, HTML, XML, and TREC documents Word and PowerPoint parsing (Windows only) Text Annotations Document Metadata

Package Versatility

Open source, with a flexible BSD-inspired license Includes both command line tools and a Java user interface API can be used from Java, PHP, or C++ Works on Windows, Linux, Solaris and Mac OS X

Scalability and Efficiency

Best-in-class ad hoc retrieval performance Can be used on a cluster of machines for faster indexing and retrieval Scales to terabyte-sized collections

Download Indri can be obtained from the SourceForge Lemur Project Page. Release History The first version (1.0) of Indri was released in Jan 2002. Subsequent releases have been made 2-3 times each year since then. Release notes for the current release can be found on SourceForge.

Official Links