Apache Spark icon
Apache Spark icon

Apache Spark

 11 likes

Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

Apache Spark screenshot 1

License model

  • FreeOpen Source

Country of Origin

  • US flagUnited States

Platforms

  • Self-Hosted
  • Docker
  • Python
  No rating
11likes
0comments
0news articles

Features

Suggest and vote on features
  1.  Parallel Computing
  2.  Data analytics

 Tags

Apache Spark News & Activities

Highlights All activities

Recent News

No news, maybe you know any news worth sharing?
Share a News Tip

Recent activities

Show all activities

Apache Spark information

AlternativeTo Categories

Network & AdminBusiness & Commerce

GitHub repository

  •  40,951 Stars
  •  28,490 Forks
  •  199 Open Issues
  •   Updated Apr 16, 2025 
View on GitHub

Our users have written 0 comments and reviews about Apache Spark, and it has gotten 11 likes

Apache Spark was added to AlternativeTo by hal9000ht on Jun 23, 2014 and this page was last updated Oct 15, 2024.
No comments or reviews, maybe you want to be first?
Post comment/review

What is Apache Spark?

Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

Key features

  • Batch/streaming data: Unify the processing of your data in batches and real-time streaming, using your preferred language: Python, SQL, Scala, Java or R.
  • SQL analytics: Execute fast, distributed ANSI SQL queries for dashboarding and ad-hoc reporting. Runs faster than most data warehouses.
  • Data science at scale: Perform Exploratory Data Analysis (EDA) on petabyte-scale data without having to resort to downsampling.
  • Machine learning: Train machine learning algorithms on a laptop and use the same code to scale to fault-tolerant clusters of thousands of machines.

Official Links