Apache Nutch Alternatives
Apache Nutch is described as 'highly extensible and scalable open source web crawler software project' and is an app. There are seven alternatives to Apache Nutch for a variety of platforms, including Mac, Windows, Linux, Online / Web-based and BSD. The best alternative is Scrapy, which is both free and Open Source. Other great apps like Apache Nutch are Mixnode, StormCrawler, Heritrix and Crawlbase.
Scrapy
Do you think this is a good alternative?YesNo- Free • Open Source
- Web Scraping Tool
83 alternatives to Scrapy- Mac
- Windows
- Linux
- BSD
Scrapy is a free and open-source web-crawling framework written in Python. Originally designed for web scraping, it can also be used to extract data using APIs or as a general-purpose web crawler. It was developed and is maintained by Zyte formerly Scrapinghub, a web-scraping...
Scrapy Features
Mixnode
Do you think this is a good alternative?YesNo22 alternatives to Mixnode- Paid • Proprietary
- Online
Mixnode is a fast, flexible, massively scalable platform to extract and analyze data from the web.
Mixnode vs Apache Nutch opinions
StormCrawler
Do you think this is a good alternative?YesNo- Free • Open Source
7 alternatives to StormCrawler- Mac
- Windows
- Linux
StormCrawler is an open source SDK for building distributed web crawlers with Apache Storm. The project is under Apache license v2 and consists of a collection of reusable resources and components, written mostly in Java.
Heritrix
Do you think this is a good alternative?YesNo- Free • Open Source
12 alternatives to Heritrix- Mac
- Windows
- Linux
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
Heritrix Features
Crawlbase
Do you think this is a good alternative?YesNo61 alternatives to Crawlbase- Freemium • Proprietary
- Web Scraping Tool
- Online
Crawlbase, formerly ProxyCrawl, helps you stay anonymous while crawling the web, web crawling protection the way it should be.
Crawlbase Features
ACHE Crawler
Do you think this is a good alternative?YesNo- Free • Open Source
7 alternatives to ACHE Crawler- Mac
- Windows
- Linux
ACHE is a web crawler for domain-specific search.
Kaddara
Do you think this is a good alternative?YesNo78 alternatives to Kaddara- Paid • Proprietary
- Web Scraping Tool
- Software as a Service (SaaS)
Kaddara is a platform designed for professionals who need fresh leads to run their business and whose business is affected by how competitors operate.