Google Cloud Dataproc Alternatives

Google Cloud Dataproc is described as 'Dataproc is a fully managed and highly scalable service for running Apache Hadoop, Apache Spark, Apache Flink, Presto, and 30+ open source tools and frameworks. Use Dataproc for data lake modernization, ETL, and secure data science, at scale, integrated with' and is an website in the development category. There are more than 10 alternatives to Google Cloud Dataproc, not only websites but also apps for a variety of platforms, including Linux, SaaS, Kubernetes and Self-Hosted apps. The best Google Cloud Dataproc alternative is Cloudera CDH. It's not free, so if you're looking for a free alternative, you could try Amazon EMR or ILUM. Other great sites and apps similar to Google Cloud Dataproc are Gigasheet, IBM InfoSphere BigInsights, Sybase IQ and Platfora.

Copy a direct link to this comment to your clipboard
Google Cloud Dataproc alternatives page was last updated

Alternatives list

  1. Cloudera CDH icon
     2 likes
    Copy a direct link to this comment to your clipboard

    Cloudera's open-source Apache Hadoop distribution, CDH (Cloudera Distribution Including Apache Hadoop), targets enterprise-class deployments of that technology. Cloudera says that more than 50% of its engineering output is donated upstream to the various Apache-licensed open...

    Cost / License

    • Paid
    • Proprietary

    Platforms

    • Linux
    • Online
     
    • Cloudera CDH is the most popular Web-based & Linux alternative to Google Cloud Dataproc.

    • Cloudera CDH is the most popular commercial alternative to Google Cloud Dataproc.

    • Cloudera CDH is Paid and ProprietaryGoogle Cloud Dataproc is also Paid and Proprietary
  2. Copy a direct link to this comment to your clipboard

    Amazon EMR is the industry-leading cloud big data solution for petabyte-scale data processing, interactive analytics, and machine learning using open-source frameworks such as Apache Spark, Apache Hive, and Presto.

    Cost / License

    • Freemium (Subscription)
    • Proprietary

    Platforms

    • Software as a Service (SaaS)
    • Amazon Web Services
     
    • Amazon EMR is the most popular SaaS alternative to Google Cloud Dataproc.

    • Amazon EMR is the most popular free alternative to Google Cloud Dataproc.

    • Amazon EMR is Freemium and ProprietaryGoogle Cloud Dataproc is Paid and Proprietary
  3. ILUM icon
     5 likes
    Copy a direct link to this comment to your clipboard

    Ilum is a free data lakehouse platform designed for scalability, flexibility, and simplicity.

    Cost / License

    • Freemium
    • Proprietary

    Platforms

    • Self-Hosted
    • Software as a Service (SaaS)
    • Kubernetes
     
    • ILUM is the most popular Self-Hosted alternative to Google Cloud Dataproc.

    • ILUM is Freemium and ProprietaryGoogle Cloud Dataproc is Paid and Proprietary
    • ILUM is Lightweight and Privacy focusedGoogle Cloud Dataproc is not according to our users
  4. Gigasheet icon
     1 like
    Copy a direct link to this comment to your clipboard

    The big data spreadsheet that requires no coding skills.

    Cost / License

    • Freemium (Subscription)
    • Proprietary

    Application type

    Platforms

    • Online
    • Software as a Service (SaaS)
     
  5. Copy a direct link to this comment to your clipboard

    IBM InfoSphere BigInsights brings the power of Hadoop to the enterprise. Apache Hadoop is the open source software framework, used to reliably managing large volumes of structured and unstructured data.

    Cost / License

    • Freemium
    • Proprietary

    Platforms

    • Linux
    • Online
     
  6. Sybase IQ icon
     1 like
    Copy a direct link to this comment to your clipboard

    SAP Sybase IQ, a highly optimized analytics server software which provides business intelligence through column-based oriented architecture tool for data warehousing and mining. Our Sybase analytic database management software server product provides faster results for...

    Cost / License

    • Paid
    • Proprietary

    Platforms

    • Windows
    • Linux
     
    • Sybase IQ is the most popular Windows alternative to Google Cloud Dataproc.

    • Sybase IQ is Paid and ProprietaryGoogle Cloud Dataproc is also Paid and Proprietary
  7. Platfora icon
     2 likes
    Copy a direct link to this comment to your clipboard

    Platfora puts the power of Big Data Analytics into the hands of business users, providing self-service analytics capability across all of your customer interaction, machine and transactional data sets. With Platfora, you can visualize insights and make decisions that were never...

    Cost / License

    • Paid
    • Proprietary

    Platforms

    • Online
     
  8. Stackable icon
     2 likes
    Copy a direct link to this comment to your clipboard

    The Stackable Data Platform was designed with openness and flexibility in mind. It provides you with a curated selection of the best open source data apps like Apache Kafka®, Apache Druid, Trino and Apache Spark™.

    Cost / License

    • Freemium (Subscription)
    • Open Source

    Platforms

    • Kubernetes
    • Self-Hosted
    • Software as a Service (SaaS)
     
    • Stackable is the most popular Open Source alternative to Google Cloud Dataproc.

    • Stackable is Freemium and Open SourceGoogle Cloud Dataproc is Paid and Proprietary
  9. Datameer icon
     1 like
    Copy a direct link to this comment to your clipboard

    Datameer is a business-user-focused business intelligence (BI) platform for Hadoop. But Datameer doesn't treat Hadoop as an island of information; it can connect to any data source through JDBC, Hive, HTTP, or other standards.

    Cost / License

    • Paid
    • Proprietary

    Platforms

    • Mac
    • Windows
    • Linux
    • Online
     
    • Datameer is the most popular Mac alternative to Google Cloud Dataproc.

    • Datameer is Paid and ProprietaryGoogle Cloud Dataproc is also Paid and Proprietary
  10. MapR icon
     Like
    Copy a direct link to this comment to your clipboard

    MapR makes Apache Hadoop more affordable and easier to use for big data analytics, business intelligence, distributed computing, machine learning, distributed file systems and map reduce grid computing.

    Cost / License

    • Freemium
    • Proprietary

    Platforms

    • Linux
    • Online
     
  11. Copy a direct link to this comment to your clipboard

    Greenplum HD is an open-source certified and supported version of the Apache Hadoop stack. It includes Hadoop Distributed File System (HDFS), MapReduce, Hive, Pig, HBase, and ZooKeeper. Greenplum HD’s packaged Hadoop distribution removes the need in building out a Hadoop cluster...

    Cost / License

    • Free
    • Open Source

    Platforms

    • Linux
    • Online
     
  12. Copy a direct link to this comment to your clipboard

    Run your code faster, without the infrastructure hassle

    Domino makes it easy to run your Python, R, MATLAB, and Julia code on more powerful hardware with one command, so you can get your results faster. Customers tell us these features can reduce set-up and configuration times b.

    Cost / License

    • Paid
    • Proprietary

    Platforms

    • Online
     
12 of 15 Google Cloud Dataproc alternatives