AlternativeTo Logo

    Apache Spark Alternatives

    Apache Spark is described as 'fast and general engine for large-scale data processing' and is a Cloud Computing service in the business & commerce category. There are eight alternatives to Apache Spark for a variety of platforms, including Linux, Mac, Windows, Online / Web-based and BSD. The best alternative is Apache Flink, which is both free and Open Source. Other great apps like Apache Spark are Apache Hadoop, Amazon Kinesis, Disco MapReduce and Heron.

    Apache Spark is mainly a Cloud Computing Service but alternatives to it may also be Web Analytics Services. Filter by these if you want a narrower list of alternatives or looking for a specific functionality of Apache Spark.
    This page was last updated Jun 28, 2019
    • FreeOpen Source
    • Mac
    • Windows
    • Linux
    More
    Apache Spark™ is a fast and general engine for large-scale data processing.
    Learn more about Apache Spark

    1. Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.
      No screenshots yet
      • FreeOpen Source
      • Mac
      • Windows
      • Linux
      More
      Apache Hadoop is a open source software framework that supports data-intensive distributed applications licensed under the Apache v2 license. It enables applications to work with thousands of computational independent computers and petabytes of data.
      No screenshots yet


    2. Amazon Kinesis services make it easy to work with real-time streaming data in the AWS cloud.
      No screenshots yet
      • FreeOpen Source
      • Mac
      • Windows
      • Linux
      More
      Disco is an implementation of mapreduce for distributed computing. Disco supports parallel computations over large data sets, stored on an unreliable cluster of computers, as in the original framework created by Google.
      No screenshots yet
    3. Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter.
      No screenshots yet


      • FreeOpen Source
      • Mac
      • Windows
      • Linux
      • BSD
      More
      Apache Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing.
      No screenshots yet
      • FreeOpen Source
      • Linux
      More
      More
      Apache Gearpump is a real-time big data streaming engine. The name Gearpump is a reference to the engineering term “gear pump” which is a super simple pump that consists of only two gears, but is very powerful at streaming water.

      Discontinued

      The Gearpump podling retired on 2018-09-19

      No screenshots yet
    4. Upsolver is an In-Memory Data Preparation Platform. It removes the complexity from Big Data and Real-Time projects and shortens their implementation time from weeks/months to several hours, literally.
    Showing 8 of 8 alternatives