StormCrawler Alternatives

StormCrawler is described as 'Open source SDK for building distributed web crawlers with Apache Storm. The project is under Apache license v2 and consists of a collection of reusable resources and components, written mostly in Java' and is an app. There are more than 10 alternatives to StormCrawler for a variety of platforms, including Windows, Linux, Mac, Web-based and BSD apps. The best StormCrawler alternative is Scrapy, which is both free and Open Source. Other great apps like StormCrawler are Lookyloo, Flyscrape, Crawlbase and Scraperr.

Copy a direct link to this comment to your clipboard
StormCrawler alternatives page was last updated

Alternatives list

  1. Scrapy icon
     104 likes
    Copy a direct link to this comment to your clipboard

    Scrapy is a free and open-source web-crawling framework written in Python. Originally designed for web scraping, it can also be used to extract data using APIs or as a general-purpose web crawler. It was developed and is maintained by Zyte formerly Scrapinghub, a web-scraping...

    104 Scrapy alternatives

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
    • BSD
     
    • Scrapy is the most popular Windows, Mac & Linux alternative to StormCrawler.

    • Scrapy is the most popular Open Source & free alternative to StormCrawler.

    • Scrapy is Free and Open SourceStormCrawler is also Free and Open Source
  2. Lookyloo icon
     4 likes
    Copy a direct link to this comment to your clipboard

    Lookyloo is a web interface that allows users to capture a website page and then display a tree of domains that call each other.

    60 Lookyloo alternatives

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Windows
    • Linux
    • Online
     
    • Lookyloo is the most popular Web-based alternative to StormCrawler.

    • Lookyloo is Free and Open SourceStormCrawler is also Free and Open Source
  3. Flyscrape icon
     6 likes
    Copy a direct link to this comment to your clipboard

    Flyscrape is a standalone and scriptable web scraper, combining the speed of Go with the flexibility of JavaScript. — Focus on data extraction rather than request juggling.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  4. Crawlbase icon
     3 likes
    Copy a direct link to this comment to your clipboard

    Crawlbase, formerly ProxyCrawl, helps you stay anonymous while crawling the web, web crawling protection the way it should be.

    Cost / License

    • Freemium (Subscription)
    • Proprietary

    Application type

    Platforms

    • Online
     
  5. Scraperr icon
     1 like
    Copy a direct link to this comment to your clipboard

    Scraperr is a self-hosted web application that allows users to scrape data from web pages by specifying elements via XPath. Users can submit URLs and the corresponding elements to be scraped, and the results will be displayed in a table.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Self-Hosted
     
    • Scraperr is the most popular Self-Hosted alternative to StormCrawler.

    • Scraperr is Free and Open SourceStormCrawler is also Free and Open Source
  6. Heritrix icon
     5 likes
    Copy a direct link to this comment to your clipboard

    Open-source, extensible web crawler designed for large-scale, archival-quality web archiving, preserves digital artifacts, supports modular plugins, distributed crawling, detailed monitoring, scheduling, and exports data in standardized formats for preservation.

    Cost / License

    • Free
    • Open Source

    Platforms

    • Mac
    • Windows
    • Linux
     
  7. Mixnode icon
     38 likes
    Copy a direct link to this comment to your clipboard

    Mixnode is a fast, flexible, massively scalable platform to extract and analyze data from the web.

    Cost / License

    • Subscription
    • Proprietary

    Application type

    Platforms

    • Online
     
    |
    9
    • Almost everyone thinks Mixnode is a great StormCrawler alternative.

    • Mixnode is the most popular commercial alternative to StormCrawler.

    • Mixnode is Paid and ProprietaryStormCrawler is Free and Open Source
  8. Apache Nutch icon
     2 likes
    Copy a direct link to this comment to your clipboard

    Apache Nutch is a highly extensible and scalable open source web crawler software project.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  9. ACHE Crawler icon
     2 likes
    Copy a direct link to this comment to your clipboard

    ACHE is a web crawler for domain-specific search.

    Cost / License

    • Free
    • Open Source

    Application type

    Platforms

    • Mac
    • Windows
    • Linux
     
  10. Kaddara icon
     Like
    Copy a direct link to this comment to your clipboard

    Kaddara is a platform designed for professionals who need fresh leads to run their business and whose business is affected by how competitors operate.

    Cost / License

    • Subscription
    • Proprietary

    Application type

    Platforms

    • Software as a Service (SaaS)
     
    • Kaddara is the most popular SaaS alternative to StormCrawler.

    • Kaddara is Paid and ProprietaryStormCrawler is Free and Open Source
10 of 10 StormCrawler alternatives