← 返回首页
dataops · GitHub Topics · GitHub
Skip to content

Navigation Menu

Toggle navigation
Sign in
Appearance settings
Search or jump to...

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
Resetting focus
#

DataOps

DataOps is an automated, process-oriented methodology, used by analytic and data teams, to improve the quality and reduce the cycle time of data analytics. While DataOps began as a set of best practices, it has now matured to become a new and independent approach to data analytics. DataOps applies to the entire data lifecycle from data preparation to reporting, and recognizes the interconnected nature of the data analytics team and information technology operations.

Here are 245 public repositories matching this topic...

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

  • Updated May 24, 2026
  • Rust

Redpanda Console is a developer-friendly UI for managing your Kafka/Redpanda workloads. Console gives you a simple, interactive approach for gaining visibility into your topics, masking data, managing consumer groups, and exploring real-time data with time-travel debugging.

  • Updated May 24, 2026
  • TypeScript

Scalable and efficient data transformation framework - backwards compatible with dbt.

  • Updated May 25, 2026
  • Python

An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈

  • Updated Jan 10, 2025
  • Jupyter Notebook

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

  • Updated May 22, 2026
  • Python

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

  • Updated May 24, 2026
  • HTML

Kafka Docker for development. Kafka, Zookeeper, Schema Registry, Kafka-Connect, , 20+ connectors

  • Updated Nov 20, 2025
  • Shell

Cloud Native DataOps & AIOps Platform | 云原生数智运维平台

  • Updated Dec 13, 2025
  • Java

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

  • Updated May 24, 2026
  • Java

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

  • Updated Jun 8, 2024
  • Go

Tenzir is the data pipeline engine for security teams.

  • Updated May 24, 2026
  • C++

DataOps for Microsoft Data Platform technologies. https://aka.ms/dataops-repo

  • Updated Feb 25, 2026
  • Shell

A list of tools for annotating data, managing annotations, etc.

  • Updated Aug 1, 2024

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

  • Updated Apr 26, 2026
  • Python

One framework to develop, deploy and operate data workflows with Python and SQL.

  • Updated May 18, 2026
  • Python

Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake data warehouse.

  • Updated Mar 13, 2025
  • Python

The data-validation toolkit for enhanced dbt (data build tool) PR review

  • Updated May 25, 2026
  • TypeScript

Power BI DevOps & Source Control Tool

  • Updated Jan 30, 2026
  • C#

Open data platform based on Kubernetes. Scaleph supports SeaTunnel、Flink and Doris backended by SeaTunnel on Flink engine、Flink Kubernetes Operator and Doris operator.

  • Updated Dec 17, 2025
  • Java
Load more…
Followers 51 followers Website github.com/topics/dataops Wikipedia Wikipedia

Related topics

open-data

Footer

© 2026 GitHub, Inc.