© Copyright VLR Training | 2020
Azure Data Factory is a cloud-based data integration service that allows you to create data-driven workflows in the cloud for orchestrating and automating data movement and data transformation.
Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming clusters with implicit data parallelism and fault tolerance
45 Days
8 am to 9 am (IST)
Online
Basics
Introduction to Data Engineering
Python,SQL and Azure Portal Access
Introduction to ADF V2
Practice basics of ADF
Activities Sessions
scenario based pipeline building
Debugging
Triggers in Pipeline
Real time project showcasing
Basics of Big Data
Spark Basics
Spark Structured API
Spark For ML
Spark Structured API
Spark SQL
Databricks
Optimisation techniques
Improving performance of Spark Jobs
ADF – Databricks Linkage
End-to-End project
A quick review
© Copyright VLR Training | 2020