COLLECTED BY
Organization:
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
The Wayback Machine - https://web.archive.org/web/20221113045745/https://github.com/topics/databricks
Here are
410 public repositories
matching this topic...
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Updated
Nov 11, 2022
Python
Simple and Distributed Machine Learning
Updated
Nov 12, 2022
Scala
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
A native Rust library for Delta Lake, with bindings into Python and Ruby.
Updated
Nov 13, 2022
Rust
Updated
Nov 11, 2022
Shell
Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...
Databricks Terraform Provider
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
Updated
Nov 10, 2022
Python
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Updated
Oct 29, 2022
Scala
Apache Spark Connector for Azure Cosmos DB
Updated
Oct 18, 2022
Scala
Manage your Databricks deployments and CI with code.
Updated
Jan 4, 2022
Python
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
Updated
Oct 25, 2022
HTML
Testing framework for Databricks notebooks
Updated
Apr 29, 2022
Python
A set of UDFs and Procedures to extend BigQuery, Snowflake, Redshift, Postgres and Databricks with Spatial Analytics capabilities
Updated
Nov 8, 2022
JavaScript
machine learning for genomic variants
Updated
Nov 11, 2022
JavaScript
Capture deep metrics on one or all assets within a Databricks workspace
Updated
Nov 11, 2022
Scala
A dbt adapter for Databricks.
Updated
Nov 9, 2022
Python
Bloat-free, no BS cloud storage SDK.
Tools for Deploying Databricks Solutions in Azure
Updated
Oct 20, 2022
PowerShell
Generate relevant synthetic data quickly for your projects. The Databricks data generator can be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Updated
Nov 11, 2022
Python
Improve this page
Add a description, image, and links to the
databricks
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
databricks
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.