AlternativeTo Logo

StormCrawler Alternatives

StormCrawler is described as 'open source SDK for building distributed web crawlers with Apache Storm. The project is under Apache license v2 and consists of a collection of reusable resources and components, written mostly in Java' and is an app. There are seven alternatives to StormCrawler for a variety of platforms, including Mac, Windows, Linux, Online / Web-based and BSD. The best alternative is Scrapy, which is both free and Open Source. Other great apps like StormCrawler are Mixnode, Heritrix, Apache Nutch and ProxyCrawl.

This page was last updated
  • FreeOpen Source
  • Mac
  • Windows
  • Linux

StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm. The project is under Apache...

Learn more about StormCrawler

  1. Scrapy icon

    Scrapy

    • FreeOpen Source
    • Mac
    • Windows
    • Linux
    • BSD

    Scrapy is a free and open-source web-crawling framework written in Python. Originally designed for web scraping, it can also be used to extract data using APIs or as a general-purpose web crawler.

    Screenshot
  2. Mixnode is a fast, flexible, massively scalable platform to extract and analyze data from the web.

    Main Page
    Almost everyone thinks Mixnode is a great alternative to StormCrawler.


  3. Heritrix icon

    Heritrix

    • FreeOpen Source
    • Mac
    • Windows
    • Linux

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

    Screenshot
  4. Apache Nutch icon

    Apache Nutch

    • FreeOpen Source
    • Mac
    • Windows
    • Linux

    Apache Nutch is a highly extensible and scalable open source web crawler software project.

    No screenshots yet
  5. ProxyCrawl icon

    ProxyCrawl

    • FreemiumProprietary
    • Online

    ProxyCrawl helps you stay anonymous while crawling the web, web crawling protection the way it should be.

    Screenshot


  6. ACHE Crawler icon

    ACHE Crawler

    • FreeOpen Source
    • Mac
    • Windows
    • Linux

    ACHE is a web crawler for domain-specific search.

    No screenshots yet
  7. Kaddara icon

    Kaddara

    • Software as a Service (SaaS)

    Kaddara is a platform designed for professionals who need fresh leads to run their business and whose business is affected by how competitors operate.

    Screenshot
Showing 7 of 7 alternatives