In this post, we will share our Databricks Learning Notebook that contains syntax and best practices for PySpark Dataframes API and Spark SQL.
This notebook will teach you basics and how to handle certain Data Analyst tasks like Missing and Duplicate data, column transformations etc.
Even though this notebook is created for Data Analyst use cases, anyone who uses data transformations on DataBricks will benefit from it.
You can download the notebook via link below. Zipped file contains DBC, HTML and Python File format of the same notebook. You can import DBC file to your workspace to see the notebook.
Click here to Download the Notebook
We divided Databricks Learning Notebook into 3 main categories :
- BASIC DEFINITIONS & LINKS
- DATAFRAMES
- SPARK SQL