Alternatives to Apache Nutch for all platforms with any license

  • Scrapy

    Scrapy is an open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.

    Free Open Source Mac Windows Linux BSD

    No features added Add a feature

    Scrapy icon
  • StormCrawler

    StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm.

    Free Open Source Mac Windows Linux

    No features added Add a feature

    StormCrawler icon
  • Heritrix

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or misspelled...

    Free Mac Windows Linux

    No features added Add a feature

    Heritrix icon

Platforms

Show 5 less popular platforms

Apache Nutch Comments

Echo echo ... Feels empty in here

Maybe you want to be the first to submit a comment about Apache Nutch? Just click the button up to your right!