xenmaster's data-science tools

Data Science is more about learning concepts rather than software. These concepts include statistics, linear algebra, and ab-testing. But the following tools are the most commonly used in this practice.

Alex Ruiz
Alex RuizList by Alex Ruiz, last updated 2019-04-01
  1. Python icon

    Python is an interpreted, interactive, object-oriented, extensible programming language. It provides an extraordinary combination of clarity and versatility, and is free and comprehensively ported.

    • Programming Language
    • FreeOpen Source
    • Mac
    • Windows
    • Linux
    • Symbian S60
    • BSD
    • AROS
    • Haiku
    • AmigaOS
    • OpenSolaris
    • MorphOS
    • ...
    Python screenshot 1
  2. Basic Computer Skills

    Learning to work in the terminal is a good skill to have for anyone working in the computer science field. And no programmer's experience is complete without learning the version control power of Git!

  3. PowerShell icon

    PowerShell (including Windows PowerShell and PowerShell Core) is a task automation and configuration management framework from Microsoft, consisting of a command-line shell and associated scripting language built on the .NET Framework.

    • ...
    PowerShell screenshot 1
  4. Terminal icon

    Terminal (also referred to as Terminal.app) is a terminal emulator included in Apple's Mac OS X operating system. It originated in Mac OS X's predecessors, NEXTSTEP and OPENSTEP, and allows the user to interact with the computer through a command line interface.

    • ...
    Terminal screenshot 1
  5. GNOME Terminal icon

    GNOME Terminal is a terminal emulator for the GNOME desktop environment written by Havoc Pennington and others. Terminal emulators allow users to execute commands using a real UNIX shell while remaining on their graphical desktop.[

    • ...
    GNOME Terminal screenshot 1
  6. Git icon

    Git is a free & open source, distributed version control system designed to handle everything from small to very large projects with speed and efficiency.

    • Version Control System
    • FreeOpen Source
    • Mac
    • Windows
    • Linux
    • Android
    • iPhone
    • Chrome OS
    • Android Tablet
    • BSD
    • Linux Mobile
    • Haiku
    • ...
  7. Programming

    Python and R are the most commonly used programming languages. I've included additional IDEs (Integrated Development Environments) as well, two for Python (one with a desktop GUI, the other for the terminal) and one for R.

  8. Jupyter icon

    The Jupyter Notebook is a web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, machine learning and much...

    • FreeOpen Source
    • Mac
    • Windows
    • Linux
    • Online
    • Cloudron
    • ...
    The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, data visualization, machine learning, and much more.
  9. IPython icon

    IPython is an interactive shell for the Python programming language that offers enhanced introspection, additional shell syntax, syntax highlighting, tab completion and rich history. It is a component of the SciPy package.

    • FreeOpen Source
    • Mac
    • Windows
    • Linux
    • ...
    Normal startup
  10. R (programming language) icon

    R is a free software environment for statistical computing and graphics. It is a GNU project which is similar to the S language and environment which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues. R can be consider.

    • ...
    R (programming language) screenshot 1
  11. RStudio icon

    RStudio™ is an integrated development environment (IDE) for R. RStudio combines an intuitive user interface with powerful coding tools to help you get the most out of R.

    • IDE
    • FreeOpen Source
    • Mac
    • Windows
    • Linux
    • Xfce
    • ...
    RStudio on Windows
  12. Data Visualization and Manipulation

    Matplotlib is a basic data visualization tool. SciPy is a great choice for manipulating data and TensorFlow is a fantastic platform if you are interested in machine learning (especially running with Keras.io and scikit-learn).

  13. Matplotlib icon

    Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python. Matplotlib makes easy things easy and hard things possible.

    • FreeOpen Source
    • Mac
    • Windows
    • Linux
    • Online
    • Python
    • ...
    Matplotlib screenshot 1
  14. SciPy icon

    SciPy is a collection of mathematical algorithms and convenience functions built on NumPy. It adds significant power to Python by providing the user with high-level commands and classes for manipulating and visualizing data.

    • FreeOpen Source
    • Mac
    • Windows
    • Linux
    • BSD
    • Python
    • ...
    SciPy screenshot 1
  15. TensorFlow icon

    TensorFlow is an open source software library for machine learning in various kinds of perceptual and language understanding tasks. It was originally developed by the Google and later released under the Apache 2.0 open source license on Nov 9, 2015.

    • FreeOpen Source
    • Mac
    • Linux
    • ...
  16. Databases

    Below are the most commonly used databases for raw data processing power. I've included a relational database and a noSQL database for handing document driven data, particularly useful in the big-data space!

  17. PostgreSQL icon

    PostgreSQL is a powerful, open source object-relational database system with over 35 years of active development that has earned it a strong reputation for reliability, feature robustness, and performance.

    • ...
    PostgreSQL screenshot 1
  18. pgAdmin icon

    pgAdmin is the most popular and feature rich open source administration and development platform for PostgreSQL, the most advanced open source database in the world.

    • ...
    Welcome screen, standard theme
  19. MongoDB icon

    MongoDB is a document database with the scalability and flexibility that you want with the querying and indexing that you need

    • ...
  20. MongoDB Compass icon

    The GUI for MongoDB. Visually explore your data. Run ad hoc queries in seconds. Interact with your data with full CRUD. View and optimize your query performance. Compass empowers you to make smarter decisions about indexing, document validation, etc.

    • FreeProprietary
    • Mac
    • Windows
    • Linux
    • ...
    MongoDB Compass screenshot 1
  21. Business Data

    I've seen the following used frequently for data visualization on the business side. Pick one or more!

  22. D3.js icon

    D3 allows you to bind arbitrary data to a Document Object Model (DOM), and then apply data-driven transformations to the document. For example, you can use D3 to generate an HTML table from an array of numbers.

    • FreeOpen Source
    • Online
    • Self-Hosted
    • ...
    D3.js screenshot 1
  23. Microsoft Power BI icon

    Uncover powerful insights and turn them into impact,

    Connect to and visualize any data, and seamlessly infuse visuals into the apps you use every day.

    • Business Intelligence Tool
    • FreemiumProprietary
    • Windows
    • Online
    • Android
    • iPhone
    • Android Tablet
    • iPad
    • Microsoft Office 365
    • Microsoft Office Excel
    • Power BI for Office 365
    • ...
    Microsoft Power BI screenshot 1
  24. Tableau icon

    Tableau can help anyone see and understand their data. Connect to almost any database, drag and drop to create visualizations, and share with a click.

    • ...
    Tableau screenshot 1
  25. Microsoft Excel icon

    Microsoft Excel, part of the Microsoft 365 (Formerly Office) , is Microsoft's spreadsheet application. With the Microsoft Office Fluent user interface, rich data visualization, pivot table views, and professional-looking charts are easier to create...

    • Spreadsheet App
    • Mac
    • Windows
    • Android
    • iPhone
    • Android Tablet
    • Windows Phone
    • iPad
    • ...
    excel 365
  26. Distributed Cloud Computing

    Some people prefer using the cloud to do their dirty data-processing work. Pick one and go for it!

  27. Apache Hadoop icon

    Apache Hadoop is a open source software framework that supports data-intensive distributed applications licensed under the Apache v2 license. It enables applications to work with thousands of computational independent computers and petabytes of data.

    • FreeOpen Source
    • Mac
    • Windows
    • Linux
    • ...
  28. Microsoft Azure icon

    The Azure cloud platform is more than 200 products and cloud services designed to help you bring new solutions to life—to solve today’s challenges and create the future. Build, run, and manage applications across multiple clouds, on-premises, and at the edge, with the tools and...

    • ...
    Microsoft Azure screenshot 1
    Microsoft Azure screenshot 2
    Microsoft Azure screenshot 3
  29. Amazon Machine Learning allows developers to use machine learning. It provides visualization tools and wizards that guide you in the process of creating machine learning (ML) models. It makes it easy to obtain predictions using simple APIs.

    • Online
    • Amazon Web Services
    • ...

This list was created by Alex Ruiz Mar 6, 2019 and was last updated Apr 1, 2019. The list has 23 apps.

You can create your own lists from our large database of apps either for your own sake or if you want to share some great apps with other people.


No comments so far, maybe you want to be first?
Gu