Apache Spark API reference

Databricks is built on top of Apache Spark, a unified analytics engine for big data and machine learning. For more information, see Apache Spark - What is Spark on the Databricks website.

Apache Spark has easy-to-use APIs for operating on large datasets. This includes a collection of over 100 operators for transforming data and familiar data frame APIs for manipulating semi-structured data. These APIs include:

To learn how to use the Apache Spark APIs on Databricks, see: