You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A production-ready, config-driven Databricks Data Quality framework built on
databricks-labs-dqx. It generates DQ rules
from plain-language business descriptions, runs them across one or many tables, persists
metric results to Delta audit tables, detects drift, enforces SLAs, routes bad rows to
quarantine, and deploys end-to-end via Databricks Asset Bundles (DAB).
Features
Area
Capability
Rule execution
AI-assisted rule generation from business rules; single & multi-table parallel runs
Audit trail
Partitioned Delta audit table with DQ scores per run
Drift alerting
Metric-quality, row-count, and schema drift detection
SLA enforcement
Per-table quality thresholds; breach audit table
Quarantine
Error rows routed to <table>_quarantine Delta tables
Notifications
Slack, Teams webhook, and SMTP email
Rule registry
Versioned rule history with rollback
CLI
dq-framework command for setup, validate, run, score, schedule
DAB deployment
Full Databricks Asset Bundle with CI/CD via Azure DevOps
# Install CLI extra
pip install "dq_framework[cli]"# Bootstrap tables and views
dq-framework setup --catalog my_catalog --schema dq
# Validate config for a table
dq-framework validate my_catalog.bronze.orders --config-table my_catalog.dq.dq_config
# Run DQ for all tables
dq-framework run --config-table my_catalog.dq.dq_config --catalog-schema my_catalog.dq
Public API
Core
Class / Function
Description
DQConfig
Reads and updates the DQ config Delta table
DQRunner
Runs DQ rules — single or multi-table
DQBatchResult
Per-table result dataclass (success, duration, error)
The package targets the Databricks runtime (Unity Catalog, Delta, WorkspaceClient).
The test suite mocks Spark/Delta and runs anywhere. Integration tests run on Databricks.
A production-ready, config-driven Data Quality framework powered by databricks-labs-dqx. It converts plain-language business rules into scalable checks featuring Delta auditing, drift detection, SLA enforcement, bad-row quarantine, and end-to-end DAB deployment.