Apache Spark icon
Apache Spark icon

Apache Spark

Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

Apache Spark screenshot 1

Cost / License

  • Free
  • Open Source

Application type

Platforms

  • Self-Hosted
  • Docker
  • Python
-
No reviews
11likes
0comments
0news articles

Features

Suggest and vote on features
  1.  Data analytics
  2.  Parallel Computing

 Tags

Apache Spark News & Activities

Highlights All activities

Recent News

No news, maybe you know any news worth sharing?
Share a News Tip

Recent activities

Show all activities

Apache Spark information

AlternativeTo Categories

Network & AdminBusiness & Commerce

GitHub repository

  •  42,470 Stars
  •  28,970 Forks
  •  212 Open Issues
  •   Updated  
View on GitHub
Apache Spark was added to AlternativeTo by hal9000ht on and this page was last updated .
No comments or reviews, maybe you want to be first?
Post comment/review

What is Apache Spark?

Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

Key features

  • Batch/streaming data: Unify the processing of your data in batches and real-time streaming, using your preferred language: Python, SQL, Scala, Java or R.
  • SQL analytics: Execute fast, distributed ANSI SQL queries for dashboarding and ad-hoc reporting. Runs faster than most data warehouses.
  • Data science at scale: Perform Exploratory Data Analysis (EDA) on petabyte-scale data without having to resort to downsampling.
  • Machine learning: Train machine learning algorithms on a laptop and use the same code to scale to fault-tolerant clusters of thousands of machines.

Official Links