Workflow Engine for Kubernetes
-
Updated
May 4, 2026 - Go
Workflow Engine for Kubernetes
Fancy stream processing made operationally mundane
Data pipelines for cloud config and security data. Build cloud asset inventory, CSPM, FinOps, and vulnerability management solutions. Extract from AWS, Azure, GCP, and 70+ cloud and SaaS sources.
lakeFS - Data version control for your data lake | Git for data
Privacy and Security focused Segment-alternative, in Golang and React
Memphis.dev is a highly scalable and effortless data streaming platform
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
Community-driven, simple, yet powerful framework for fast, cost-effective distributed Compute over Data.
Conduit streams data between data stores. Kafka Connect replacement. No JVM required.
Unified MySQL, Postgres & FlightSQL Server, Powered by DuckDB.
A collection of online resources to help you on your Tech journey.
Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)
A lightweight CLI tool for versioning data alongside source code and building data pipelines.
A hybrid CLI/TUI tool written in Go for viewing Pokémon data from the terminal! Also doubles as a Data Engineering project.
Transform your pythonic research to an artifact that engineers can deploy easily.
Beneath is a serverless real-time data platform ⚡️
rtdl makes it easy to build and maintain a real-time data lake
A Kubernetes Operator to orchestrate Benthos pipelines
ETL / ELT / Reverse ETL Framework powered by DuckDB, designed to seamlessly integrate and process data from diverse sources. It leverages Markdown as a configuration medium, where YAML blocks define metadata for each data source, and embedded SQL blocks specify the extraction, transformation, and loading logic.
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."