A web scraper that's fast, intuitive and 100% free to use in the browser. Download data from websites and tables in seconds.
Cost / License
- Freemium
- Proprietary
Application type
Platforms
- Online
- Google Chrome
- Software as a Service (SaaS)




A web scraper that's fast, intuitive and 100% free to use in the browser. Download data from websites and tables in seconds.




Free proxy service and web scraping API that allows you to scrape and parse any webpage's HTML with Cheerio to turn it into a personalized item dataset.





Textricator is a tool for extracting text from computer-generated PDFs and generating structured data . If you have a bunch of PDFs with the same format (or one big, consistently formatted PDF) and you want to extract the data to CSV or JSON,.




OpenSource Web Media Server to browse and stream any video file format supported by ffmpeg with easy web interface for play on any platform with html5 browser, dnla or kodi plugin.

















Orange is an open-source, cross-platform data mining and machine learning suite. It features visual programming as intuitive means of combining data analysis and interactive visualization methods into powerful workflows.




Knime is a java open-source, cross-platform application which name means "Konstanz Information Miner". It is actually used extensively for data mining, data analysis and optimization. It can be downloaded as the core application itself(Knime Desktop), or the whole SDK...




The Beaker Notebook is a new open source tool for research and data science. It's advanced UI allows you to focus on your data and your science, instead of getting frustrated by your tool. We designed it to be polyglot from the ground up.




SpagoBI is the only entirely Open Source Business Intelligence suite. It covers all the analytical areas of Business Intelligence projects, with innovative themes and engines. SpagoBI offers a wide range of analytical tools: Reporting, Multidimensional Analysis (OLAP), Charts...






Stan is a probabilistic programming language for data analysis, enabling automatic inference for a large class of statistical models.
Dradis is an open source framework to enable effective information sharing, specially during security assessments.


sn0int is a semi-automatic OSINT framework and package manager. It was built for IT security professionals and bug hunters to gather intelligence about a given target or about yourself. sn0int is enumerating attack surface by semi-automatically processing public information and...

Pyspread is a non-traditional spreadsheet application that is based on and written in the programming language Python.

DataMelt (or DMelt) is a program for numeric computation, statistics, data analysis and data visualization.



ELKI: "Environment for Developing KDD-Applications Supported by Index-Structures" is a development framework for data mining algorithms written in Java. It includes a large variety of popular data mining algorithms, distance functions and index structures.



Light weight open source reports tool, that allows users to create their own HTML reports and dashboards just dragging and dropping elements, powered by a semantic layer.



InfraNodus can visualize any research notes, ideas, texts, even the Google search results on a certain topic as a text network. The words are the nodes and their co-occurrences are the connections between them.



Apache Mahout is an Apache project to produce free implementations of distributed or otherwise scalable machine learning algorithms on the Hadoop platform. Mahout is a work in progress; the number of implemented algorithms has grown quickly, but there are still various...
ggraptR is an open source R package providing a GUI for visualization. It is based on principles of visualization analysis by Tamara Munzner, and also acts as a wrapper for functionality implemented in the grammar of graphics for R, ggplot2.


Facilitates web scraping and lead generation using no-code bots across LinkedIn, Twitter, Facebook, and more. Cloud-based APIs enhance productivity.




REPODS is an online data warehouse service for managing & analyzing data histories in data pods. Data can be imported via various interfaces. IoT devices can also stream data directly to a data pod for cross-analysis with other data warehouse data.