AlternativeTo Logo

Heritrix Alternatives

Heritrix is described as 'is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project' and is an app. There are more than 10 alternatives to Heritrix for a variety of platforms, including Online / Web-based, Mac, Windows, Linux and Self-Hosted solutions. The best alternative is Algolia. It's not free, so if you're looking for a free alternative, you could try Apisearch or Apache Nutch. Other great apps like Heritrix are Mixnode, wordpress i-search pro, Expertrec Search Engine and StormCrawler.

This page was last updated Jun 23, 2022
  • FreeOpen Source
  • Mac
  • Windows
  • Linux

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Learn more about Heritrix

  1. Algolia

    • Free PersonalProprietary
    • Online
    • Android SDK
    • Ruby
    • Python
    • JavaScript
    • AngularJS
    • cURL
    • Ruby on Rails
    • Node.JS
    • Objective-C

    Algolia helps product teams connect their users with information by providing the building blocks they need to create fast, relevant, personalized search.

  2. Mixnode is a fast, flexible, massively scalable platform to extract and analyze data from the web.

    Almost everyone thinks Mixnode is a great alternative to Heritrix.


  3. wordpress i-search pro

    • Mac
    • Windows
    • Linux
    • Online
    • Android
    • iPhone
    • Self-Hosted
    • Wordpress

    i-Search Pro changes the way of WordPress Search. It’s full WooCommerce compatible. Provide Live search results in milliseconds. Include almost everything in your search results.

  4. Expertrec custom search started as a replacement for google site search. It adds super-fast search autocomplete, spell correct, search listing pages to your website.

  5. Apisearch

    • FreemiumOpen Source
    • Self-Hosted
    • Instagram
    • Twitter
    • GitHub Pages

    Search over millions of documents, and give to your users unique, amazing and unforgettable experiences.



  6. Apache Nutch

    • FreeOpen Source
    • Mac
    • Windows
    • Linux

    Apache Nutch is a highly extensible and scalable open source web crawler software project.

    No screenshots yet
  7. StormCrawler

    • FreeOpen Source
    • Mac
    • Windows
    • Linux

    StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm. The project is under Apache license v2 and consists of a collection of reusable resources and components, written mostly in Java.

    No screenshots yet
  8. ACHE Crawler

    • FreeOpen Source
    • Mac
    • Windows
    • Linux

    ACHE is a web crawler for domain-specific search.

    No screenshots yet
  9. TinySearch

    • FreeOpen Source
    • Self-Hosted

    TinySearch is a lightweight, fast, full-text search engine. It is designed for static websites.

    No screenshots yet
  10. Appbase.io

    • Self-Hosted
    • elasticsearch
    • Software as a Service (SaaS)

    Appbase.io provides a supercharged Elasticsearch experience with a #nocode relevance control plane (or JS UI components or a declarative REST API) and out-of-the-box search/click analytics and insights.

Showing 10 of 12 alternatives