About this course
Learn how to harness the power of Apache Spark and powerful clusters running on the Azure Databricks platform to run data analytics workloads in a data lakehouse.
Course Outline
Module 1: Explore Azure Databricks.
• Get started with Azure Databricks.
• Identify Azure Databricks workloads.
• Understand key concepts.
• Exercise – Explore Azure Databricks.
Module 2: Use Apache Spark in Azure Databricks.
• Get to know Spark.
• Create a Spark cluster.
• Use Spark in notebooks.
• Use Spark to work with data files.
• Visualize data.
• Exercise – Use Spark in Azure Databricks
Module 3: Use Delta Lake in Azure Databricks.
• Get Started with Delta Lake.
• Create Delta Lake tables.
• Create and query catalog tables.
• Use Delta Lake for streaming data.
• Exercise – Use Delta Lake in Azure Databricks.
Module 4: Use SQL Warehouses in Azure Databricks.
• Get started with SQL Warehouses.
• Create databases and tables.
• Create queries and dashboards.
• Exercise – Use a SQL Warehouse in Azure Databricks.
Module 5: Run Azure Databricks Notebooks with Azure Data Factory.
• Understand Azure Databricks notebooks and pipelines.
• Create a linked service for Azure Databricks.
• Use a Notebook activity in a pipeline.
• Use parameters in a notebook.
• Exercise – Run an Azure Databricks Notebook with Azure Data Factory