Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines.
Cost / License
- Free
- Open Source (Apache-2.0)
Platforms
- Mac
- Windows
- Linux
- Online
- Self-Hosted




Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines.




Orbital automates integration between data sources (APIs, Databases, Queues and Functions). BFF's, API Composition and ETL pipelines that adapt as your specs change.


Cloud-based platform for managing complex data integrations using templates and pre-configured functions, enhancing diverse data projects.

CocoIndex is Python-native data transformation for any engineer, designed for AI workloads, with a smart incremental engine for always-fresh, explainable data.

A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine.

Preparing for a data engineer interview and are overwhelmed by all the tools and concepts? Enroll now for free and learn how to ace the data engineering interview.
Quadratic is a Web-based spreadsheet application with Python, SQL, and Formulas that runs in the browser and as a native app (via Electron).

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.




TABLUM.IO is a data management tool that specializes in data staging and preparation, specifically for raw and unstructured data from files, feeds, and API responses.




The concept behind Dataplane is to make it quicker and easier to construct a data mesh with robust data pipelines and automated workflows for businesses and teams of all sizes. In addition to being more user friendly, there has been an emphasis on scaling, resilience...




Shipyard’s your go-to cloud-based DataOps platform for secure data extraction, transformation, reverse ETL, workflow orchestration, monitoring, alerting, and more.

Open metadata and governance for enterprises - automatically capturing, managing and exchanging metadata between tools and platforms, no matter the vendor.



Mage.ai is an open-source data pipeline tool designed to simplify the process of building, running, and maintaining machine learning and data workflows. With an intuitive interface, it allows users to create powerful pipelines for ETL (Extract, Transform, Load) processes, data...



Dagster is a cloud-native data pipeline orchestrator for the whole development lifecycle, with integrated lineage and observability, a declarative programming model, and best-in-class testability.



High-performance synthetic data generator written in Rust. Produces GDPR/HIPAA-compliant test data for PostgreSQL & MySQL.

Context Data is an enterprise data infrastructure built to accelerate the development of data pipelines for Generative AI applications. The platform automates the process of setting up internal data processing and transformation flows using an easy-to-use connectivity framework...

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

A next-generation data discovery and observability tool for startups and enterprises that helps to efficiently democratize data, powers collaboration of data science and data engineering teams, significantly reduces time to data discovery, cuts on data downtime and offers a...




Prefect is an orchestrator for data-intensive workflows. It's the simplest way to transform any Python function into a unit of work that can be observed and orchestrated. With Prefect, you can build resilient, dynamic workflows that react to the world around them and recover...




Coco Alemana allows data scientists, analysts and engineers to interact with their datasets in an interactive, visual way. Extend your actions with full SQL support. Spend less time writing code and more time on your analysis.
