Skip to content
@databrickslabs

Databricks Labs

Labs projects to accelerate use cases on the Databricks Unified Analytics Platform

Pinned Loading

  1. ucx ucx Public

    Automated migrations to Unity Catalog

    Python 296 100

  2. dolly dolly Public

    Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

    Python 10.8k 1.1k

  3. mosaic mosaic Public

    An extension to the Apache Spark framework that allows easy and fast processing of very large geospatial datasets.

    Jupyter Notebook 318 85

  4. blueprint blueprint Public

    Baseline for Databricks Labs projects written in Python

    Python 58 14

Repositories

Showing 10 of 41 repositories
  • lakebridge Public

    Accelerates migrations to Databricks by automating key migration activities

    databrickslabs/lakebridge’s past year of commit activity
    Python 124 87 288 (4 issues need help) 23 Updated Feb 20, 2026
  • dqx Public

    Databricks framework to validate Data Quality of pySpark DataFrames and Tables

    databrickslabs/dqx’s past year of commit activity
    Python 377 88 72 (1 issue needs help) 8 Updated Feb 20, 2026
  • databrickslabs/lakeflow-community-connectors’s past year of commit activity
    Python 16 56 0 16 Updated Feb 20, 2026
  • pytester Public

    Python Testing for Databricks

    databrickslabs/pytester’s past year of commit activity
    Python 112 14 41 2 Updated Feb 20, 2026
  • sandbox Public

    Experimental labs projects

    databrickslabs/sandbox’s past year of commit activity
    Python 61 52 27 20 Updated Feb 20, 2026
  • ucx Public

    Automated migrations to Unity Catalog

    databrickslabs/ucx’s past year of commit activity
    Python 296 100 298 (4 issues need help) 27 Updated Feb 19, 2026
  • ontos Public

    Business Semantics for Unity Catalog

    databrickslabs/ontos’s past year of commit activity
    Python 119 30 45 4 Updated Feb 19, 2026
  • kasal Public
    databrickslabs/kasal’s past year of commit activity
    Python 66 25 27 7 Updated Feb 18, 2026
  • brickster Public

    R Toolkit for Databricks

    databrickslabs/brickster’s past year of commit activity
    R 77 Apache-2.0 15 6 1 Updated Feb 16, 2026
  • dlt-meta Public

    Metadata driven Spark Declarative Pipelines framework for bronze/silver pipelines

    databrickslabs/dlt-meta’s past year of commit activity
    Python 242 106 31 (3 issues need help) 3 Updated Feb 11, 2026