Apache Spark Alternatives
Apache Spark is described as *'fast and general engine for large-scale data processing. Speed Run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.
Spark has an advanced DAG execution engine that supports cyclic data flow and in-memory computing'* and is a Cloud Computing service in the business & commerce category. There are eight alternatives to Apache Spark for a variety of platforms, including Linux, Mac, Windows, Online / Web-based and BSD. The best alternative is Apache Flink, which is both free and Open Source. Other great apps like Apache Spark are Apache Hadoop, Amazon Kinesis, Disco MapReduce and Heron.
Apache Spark alternatives are mainly Cloud Computing Services but may also be Web Analytics Services. Filter by these if you want a narrower list of alternatives or looking for a specific functionality of Apache Spark.Apache Flink
Do you think this is a good alternative?YesNo- Free • Open Source
- Cloud Computing Service
13 alternatives to Apache Flink- Mac
- Windows
- Linux
- BSD
Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.
Apache Flink Features
Apache Flink vs Apache Spark opinions
Apache Hadoop
Do you think this is a good alternative?YesNo- Free • Open Source
7 alternatives to Apache Hadoop- Mac
- Windows
- Linux
Apache Hadoop is a open source software framework that supports data-intensive distributed applications licensed under the Apache v2 license. It enables applications to work with thousands of computational independent computers and petabytes of data.
Apache Hadoop Features
Amazon Kinesis
Do you think this is a good alternative?YesNo6 alternatives to Amazon Kinesis- Paid • Proprietary
- Web Analytics Service
- Online
Amazon Kinesis services make it easy to work with real-time streaming data in the AWS cloud.
Amazon Kinesis Features
Disco MapReduce
Do you think this is a good alternative?YesNo- Free • Open Source
6 alternatives to Disco MapReduce- Mac
- Windows
- Linux
Disco is an implementation of mapreduce for distributed computing. Disco supports parallel computations over large data sets, stored on an unreliable cluster of computers, as in the original framework created by Google.
Disco MapReduce Features
Heron
Do you think this is a good alternative?YesNo- Free • Open Source
4 alternatives to Heron- Linux
- Self-Hosted
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter.
Heron Features
Apache Storm
Do you think this is a good alternative?YesNo- Free • Open Source
5 alternatives to Apache Storm- Mac
- Windows
- Linux
- BSD
Apache Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing.
Apache Storm Features
Gearpump
Do you think this is a good alternative?YesNo4 alternatives to Gearpump- Free • Open Source
- Linux
Apache Gearpump is a real-time big data streaming engine. The name Gearpump is a reference to the engineering term “gear pump” which is a super simple pump that consists of only two gears, but is very powerful at streaming water.
DiscontinuedThe Gearpump podling retired on 2018-09-19
Upsolver
Do you think this is a good alternative?YesNo3 alternatives to Upsolver- Paid • Proprietary
- Cloud Computing Service
- Online
Upsolver is an In-Memory Data Preparation Platform. It removes the complexity from Big Data and Real-Time projects and shortens their implementation time from weeks/months to several hours, literally.
Upsolver Features