Open Source Heritrix Alternatives

The best open source alternative to Heritrix is Manticore search. If that doesn't suit you, our users have ranked more than 10 alternatives to Heritrix and eight of them is open source so hopefully you can find a suitable replacement. Other interesting open source alternatives to Heritrix are StormCrawler, Apache Nutch, Apisearch and ACHE Crawler.

Copy a direct link to this comment to your clipboard
Heritrix alternatives page was last updated

Alternatives list

  1. Open source search server designed to be fast, scalable and with powerful and accurate full-text search capabilities derived from Sphinx search project.

    Cost / License

    Platforms

    • Mac
    • Windows
    • Linux
     
  2. StormCrawler icon
     2 likes

    StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm. The project is under Apache license v2 and consists of a collection of reusable resources and components, written mostly in Java.

    Cost / License

    Platforms

    • Mac
    • Windows
    • Linux
     
  3. Apache Nutch icon
     2 likes

    Apache Nutch is a highly extensible and scalable open source web crawler software project.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  4. Apisearch icon
     2 likes

    Search over millions of documents, and give to your users unique, amazing and unforgettable experiences.

    Cost / License

    • Freemium
    • Open Source

    Platforms

    • Self-Hosted
    • Instagram
    • Twitter
    • GitHub Pages
     
  5. ACHE Crawler icon
     2 likes

    ACHE is a web crawler for domain-specific search.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  6. TinySearch is a lightweight, fast, full-text search engine. It is designed for static websites.

    Cost / License

    Platforms

    • Self-Hosted
     
  7. Appbase.io provides a supercharged Elasticsearch experience with a #nocode relevance control plane (or JS UI components or a declarative REST API) and out-of-the-box search/click analytics and insights.

    Cost / License

    • Paid
    • Open Source

    Platforms

    • Self-Hosted
    • elasticsearch
    • Software as a Service (SaaS)
     
8 of 8 Heritrix alternatives