Databricks Data Science & Engineering is the classic Databricks environment for collaboration among data scientists, data engineers, and data analysts. It also forms the backbone of the Databricks Machine Learning environment.
If you are a data analyst who works primarily with SQL queries and BI tools, you may prefer the Databricks SQL persona-based environment.
The Databricks Data Science & Engineering guide provides how-to guidance to help you get the most out of the Databricks collaborative analytics platform. For getting started tutorials and introductory information, see Get started: Free trial & setup and What is Databricks?.
- Structured Streaming
Learn how to use Apache Spark Structured Streaming to express computation on streaming data in Databricks.
- Delta Live Tables
Learn how to build data processing pipelines with Databricks Delta Live Tables.
Learn about the types of Databricks runtimes and runtime contents.
Learn about Databricks clusters and how to create and manage them.
Learn what a Databricks notebook is, and how to use and manage notebooks to process, analyze, and visualize your data.
Learn how to work with data processing tools and frameworks in Databricks.
Learn how to use and manage libraries in Databricks.
Learn how to use Git to version control your notebooks and other files for development in Databricks.
Learn about Databricks File System (DBFS), a distributed file system mounted into a Databricks workspace and available on Databricks clusters
Learn about options for working with files on Databricks.
Learn how to migrate workloads to Databricks.