Free software that downloads entire sites to local storage for offline use, maintaining the original link structure for seamless browsing.



Free software that downloads entire sites to local storage for offline use, maintaining the original link structure for seamless browsing.



Portia is an open source visual scraping tool, allows you to scrape websites without any programming knowledge required! Simply annotate pages you're interested in, and Portia will create a spider to extract data from similar pages.




Cloud-based platform for extracting data and automating website workflows, featuring headless browser support, advanced web crawling, reusable code acts and scalable storage.



Boost your site's SEO with SEO Tracer! Crawl fast, find broken links, analyze meta tags, and optimize. Free, private and secure.




ParseHub is a web scraping tool built to handle the modern web.
You can extract data from anywhere. ParseHub works with single-page apps, multi-page apps and just about any other modern web technology.
ParseHub can handle Javascript, AJAX, cookies, sessions and redirects. You c.

Minexa.ai is a next-generation tool that makes web scraping faster and more affordable with an AI-powered solution no other alternative has. Unlike others that require constant tweaking, struggle under heavy loads, or charge extra for natural language processing, Minexa adapts...




Free desktop SEO crawler - open source alternative to Screaming Frog and similar tools. Crawl websites, analyze links, extract SEO data, and export results without subscription fees.

A flexible web crawler and scraping tool using Playwright, supporting both BFS and DFS crawling strategies with screenshot capture and structured output. Installable via npm and usable both as a CLI and programmatically.



This project is a java web spider (web crawler) with the ability to download (and resume) files. It is also highly customizable with regular expressions and download templates.




Infatica boasts a global portfolio of residential IPs - over 2,500,000 residential socks5 proxies sourced from real consumers across dozens of countries. Support via tickets, live chat, and phone, with 24-7 response for urgent technical issues.



Open-source, extensible crawler for large-scale web archiving, preserves digital artifacts, offers plugin support, distributed crawling, and standardized export formats.

Extract information from web sites with a visual point-and-click toolkit. Turn websites into useful data. Automate data workflows on the web, process, and transform data at any scale.




Listly, a web extension, simplifies web scraping without coding. This helps you collect and export enormous volumes of data into either Excel or Google Sheets.




Using WebHarvy you can easily scrape Text, HTML, Images, URLs & Emails from any website, and save the scraped data in various formats.




Advanced web scraping API offers real-time data from sites, eCommerce, and more. Features live stats, easy onboarding, and AI data parsing.



Common Crawl builds and maintains an open repository of web crawl data that can be accessed and analyzed by anyone

ZennoPoster is a versatile automation tool for web scraping, data extraction, and task automation, perfect for digital marketers, developers, and business owners.




Mixnode is a fast, flexible, massively scalable platform to extract and analyze data from the web.

Apache Nutch is a highly extensible and scalable open source web crawler software project.
PromptCloud is a leading web scraping solution providing clean data, quality service, managed infrastructure support, and unrivaled domain expertise.



A web scraping service to collect data from websites, without any programming or DIY tools.
Magifind uses advanced AI and natural language processing to truly understand the intent behind a customer's search, not just match keywords. This semantic search approach allows Magifind to return highly relevant results, guiding customers to the exact products they are...

Oxydata is a tech company that provides advanced web scraping and data extraction services, including powerful data collection APIs. With Oxydata, you can turn web content into actionable data.