Free software that downloads entire sites to local storage for offline use, maintaining the original link structure for seamless browsing.
- Website Downloader
- Free • Open Source
- Mac
- Windows
- Linux
- Android
This is an independent DIY search engine that focuses on non-commercial content, and attempts to show you sites you perhaps weren't aware of in favor of the sort of sites you probably already knew existed.
- Web Search Engine
- Free • Open Source
- Online
Portia is an open source visual scraping tool, allows you to scrape websites without any programming knowledge required! Simply annotate pages you're interested in, and Portia will create a spider to extract data from similar pages.
- Web Scraping Tool
- Free • Open Source
- Linux
- Vagrant
- Docker
+2Apify is a web scraping and automation platform - it extracts data from websites, crawls lists of URLs and automates workflows on the web. Turn any website into an API!.
- Web Scraping Tool
- Freemium • Open Source
- Online
Minexa.ai is a next-generation tool that makes web scraping faster and more affordable with an AI-powered solution no other alternative has. Unlike others that require constant tweaking, struggle under heavy loads, or charge extra for natural language processing, Minexa adapts...
- Web Scraping Tool
- Paid • Proprietary
- Online
+1Netpeak Spider is an SEO crawler for a day-to-day SEO audit, fast issue check, comprehensive analysis, and website scraping.
- SEO Tool
- Paid • Proprietary
- Windows
+2grab-site is a crawler for archiving websites to WARC files. It includes a dashboard for monitoring multiple crawls, and supports changing URL ignore patterns during the crawl.
- Website Downloader
- Free • Open Source
- Mac
- Linux
Oxylabs is the leading global provider of premium proxies and data scraping solutions for large-scale web intelligence collection.
- Web Scraping Tool
- Paid • Proprietary
- Online
- Android
- Google Chrome
- Software as a Service (SaaS)
+6Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
- Free • Open Source
- Mac
- Windows
- Linux
A browser extension that uses AI to detect listings type data which can be easily scraped into CSV or Excel file, no coding required. Can automatically click next button to continue to the next page. The extension runs completely in user’s browser.
- Web Scraping Tool
- Free Personal • Proprietary
- Online
- Microsoft Edge
- Google Chrome
- Chromium
This project is a java web spider (web crawler) with the ability to download (and resume) files. It is also highly customizable with regular expressions and download templates.
- Website Downloader
- Free • Open Source
- Mac
- Windows
- Linux
- BSD
+5Extract information from web sites with a visual point-and-click toolkit. Turn websites into useful data. Automate data workflows on the web, process, and transform data at any scale.
- Web Scraping Tool
- Freemium • Open Source
- Software as a Service (SaaS)
+1Listly, a web extension, simplifies web scraping without coding. This helps you collect and export enormous volumes of data into either Excel or Google Sheets.
- Web Scraping Tool
- Freemium • Proprietary
- Online
- Google Chrome
+1