Skip to content
Change the repository type filter

All

    Repositories list

    • dbignite

      Public
      Low friction integration for performing analytics on FHIR bundles by extracting resources and flattening
      Python
      Other
      12000Updated Mar 3, 2025Mar 3, 2025
    • remorph

      Public
      Cross-compiler and Data Reconciler into Databricks Lakehouse
      Scala
      Other
      35000Updated Feb 21, 2025Feb 21, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      28k000Updated Feb 21, 2025Feb 21, 2025
    • composer

      Public
      Supercharge Your Model Training
      Python
      Apache License 2.0
      435000Updated Feb 21, 2025Feb 21, 2025
    • Use personalized images to enhance the output of an image generating model
      Python
      Other
      4000Updated Feb 21, 2025Feb 21, 2025
    • sfdc-byom

      Public
      Modelling Databricks and Salesforce data to help your customers and improve your business outcomes
      Python
      Other
      1000Updated Feb 21, 2025Feb 21, 2025
    • State of the Art Natural Language Processing with John Snow Labs
      Scala
      Apache License 2.0
      722000Updated Feb 21, 2025Feb 21, 2025
    • dbdemos

      Public
      Demos to implement your Databricks Lakehouse
      HTML
      Other
      111000Updated Feb 21, 2025Feb 21, 2025
    • The Security Reference Architecture (SRA) implements typical security features as Terraform Templates that are deployed by most high-security organizations, and enforces controls for the largest risks that customers ask about most often.
      HCL
      Other
      52000Updated Feb 21, 2025Feb 21, 2025
    • pixels

      Public
      Facilitates simple large scale processing of HLS Medical images, documents, zip files. Previously at https://github.com/dmoore247/pixels
      JavaScript
      Other
      22000Updated Feb 21, 2025Feb 21, 2025
    • anomalib

      Public
      An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
      Python
      Apache License 2.0
      715000Updated Feb 21, 2025Feb 21, 2025
    • LLM training code for MosaicML foundation models
      Python
      Apache License 2.0
      548000Updated Feb 21, 2025Feb 21, 2025
    • DataOps for the Modern Data Warehouse on Microsoft Azure. https://aka.ms/mdw-dataops.
      Shell
      MIT License
      487000Updated Feb 21, 2025Feb 21, 2025
    • Public runnable examples of using John Snow Labs' NLP for Apache Spark.
      Jupyter Notebook
      Apache License 2.0
      610000Updated Feb 21, 2025Feb 21, 2025
    • This repository contains code example used and shared through Databricks Blog posts
      Python
      Other
      9000Updated Feb 12, 2025Feb 12, 2025
    • Bootstrap your large scale forecasting solution on Databricks with Many Models Forecasting (MMF)
      Python
      Other
      25000Updated Feb 12, 2025Feb 12, 2025
    • Help augment diagnostic workflows with this Databricks Solution Accelerator for pathology image analysis. Now you can rapidly process thousands of whole slide images in minutes and use machine learning to automate the detection of metastasis.
      Python
      Other
      11000Updated Feb 3, 2025Feb 3, 2025
    • This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
      Python
      Apache License 2.0
      181000Updated Feb 3, 2025Feb 3, 2025
    • Security Analysis Tool (SAT) analyzes customer's Databricks account and workspace security configurations and provides recommendations that help them follow Databrick's security best practices. When a customer runs SAT, it will compare their workspace configurations against a set of security best practices and delivers a report.
      Python
      Other
      48000Updated Feb 3, 2025Feb 3, 2025
    • This repo provides learning materials and production-ready code to build a high-quality RAG application using Databricks.
      Python
      Other
      99100Updated Feb 3, 2025Feb 3, 2025
    • LLM Bootcamp Series
      Python
      57000Updated Feb 3, 2025Feb 3, 2025
    • hls-tcga

      Public
      Load RNA expression profiles from TCGA and associated clinical data into the Databricks lakehouse platform, and subsequently perform diverse analyses on the dataset
      Python
      Other
      3000Updated Jan 21, 2025Jan 21, 2025
    • Notebooks for the Natural Language Processing with Transformers
      Jupyter Notebook
      Apache License 2.0
      1.3k000Updated Jan 21, 2025Jan 21, 2025
    • Burning Through Electronic Health Records in Real Time With Smolder
      Scala
      Other
      3000Updated Jan 21, 2025Jan 21, 2025
    • Media Mix Modeling Accelerator
      Python
      Other
      3000Updated Jan 21, 2025Jan 21, 2025
    • Low effort linking and easy de-duplication. Databricks ARC provides a simple, automated, lakehouse integrated entity resolution solution for intra and inter data linking.
      Python
      Other
      22000Updated Jan 21, 2025Jan 21, 2025
    • Gen AI application to estimate of the cost of payer treatment, service or procedure
      Python
      Other
      2000Updated Jan 21, 2025Jan 21, 2025
    • Examples of Databricks Asset Bundles
      Python
      Other
      44000Updated Jan 21, 2025Jan 21, 2025
    • Demonstrates how to use various generative AI forecasting models from within Databricks.
      Python
      Other
      8000Updated Jan 21, 2025Jan 21, 2025
    • hub

      Public
      A library for transfer learning by reusing parts of TensorFlow models.
      Python
      Apache License 2.0
      1.7k000Updated Jan 21, 2025Jan 21, 2025