Azure Databricks – Build data engineering and AI/ML pipeline
Azure Databricks – Build data engineering and AI/ML pipeline, available at $44.99, has an average rating of 2.8, with 49 lectures, based on 25 reviews, and has 152 subscribers.
You will learn about What is Anomaly detection? How to apply unsupervised learning algorithms Isolation Forest, KNN and Clustering based Approach to detect anomalies? Step by Step guide to perform ETL operations using Azure Databricks Understand DataLakeHouse Architecture Build Data Pipeline using Azure Tech stack machine learning model interpretable shapley values Spark structured streaming with Kafka Spark Structured streaming with Azure Event Hub Use MLFlow for managing the end-to-end machine learning lifecycle Anomaly detection on Time series data Building CI/CD Pipeline using Azure Devops Building Data Pipeline using Azure Data Factory Productionizing model using Azure Function and Docker This course is ideal for individuals who are Data Engineers, Data Architect, ETL developer, Data Scientist, Big Data Developer It is particularly useful for Data Engineers, Data Architect, ETL developer, Data Scientist, Big Data Developer.
Enroll now: Azure Databricks – Build data engineering and AI/ML pipeline
Summary
Title: Azure Databricks – Build data engineering and AI/ML pipeline
Price: $44.99
Average Rating: 2.8
Number of Lectures: 49
Number of Published Lectures: 49
Number of Curriculum Items: 49
Number of Published Curriculum Objects: 49
Original Price: ₹1,199
Quality Status: approved
Status: Live
What You Will Learn
- What is Anomaly detection?
- How to apply unsupervised learning algorithms Isolation Forest, KNN and Clustering based Approach to detect anomalies?
- Step by Step guide to perform ETL operations using Azure Databricks
- Understand DataLakeHouse Architecture
- Build Data Pipeline using Azure Tech stack
- machine learning model interpretable shapley values
- Spark structured streaming with Kafka
- Spark Structured streaming with Azure Event Hub
- Use MLFlow for managing the end-to-end machine learning lifecycle
- Anomaly detection on Time series data
- Building CI/CD Pipeline using Azure Devops
- Building Data Pipeline using Azure Data Factory
- Productionizing model using Azure Function and Docker
Who Should Attend
- Data Engineers, Data Architect, ETL developer, Data Scientist, Big Data Developer
Target Audiences
- Data Engineers, Data Architect, ETL developer, Data Scientist, Big Data Developer
This course is designed to help you develop the skill necessary to perform ETL operations in Databricks, build unsupervised anomaly detection models, learn MLOPS, perform CI/CD operations in databricks and Deploy machine learning models into production.
Big Data engineering:
Big data engineers interact with massive data processing systems and databases in large-scale computing environments. Big data engineers provide organizations with analyses that help them assess their performance, identify market demographics, and predict upcoming changes and market trends.
Azure Databricks:
Azure Databricks is a data analytics platform optimized for the Microsoft Azure cloud services platform. Azure Databricks offers three environments for developing data intensive applications: Databricks SQL, Databricks Data Science & Engineering, and Databricks Machine Learning.
Anomlay detection:
Anomaly detection (aka outlier analysis) is a step in data mining that identifies data points, events, and/or observations that deviate from a dataset’s normal behavior. Anomalous data can indicate critical incidents, such as a technical glitch, or potential opportunities, for instance a change in consumer behavior. Machine learning is progressively being used to automate anomaly detection.
Data Lake House:
A data lakehouse is a data solution concept that combines elements of the data warehouse with those of the data lake. Data lakehouses implement data warehouses’ data structures and management features for data lakes, which are typically more cost-effective for data storage .
Explainable AI:
Explainable AI is artificial intelligence in which the results of the solution can be understood by humans. It contrasts with the concept of the “black box” in machine learning where even its designers cannot explain why an AI arrived at a specific decision.
Spark structured streaming:
Structured Streaming is a scalable and fault-tolerant stream processing engine built on the Spark SQL engine. .In short, Structured Streaming provides fast, scalable, fault-tolerant, end-to-end exactly-once stream processing without the user having to reason about streaming.
CI/CD Operation:
CI and CD stand for continuous integration and continuous delivery/continuous deployment. In very simple terms, CI is a modern software development practice in which incremental code changes are made frequently and reliably.
Course Curriculum
Chapter 1: Introduction
Lecture 1: Introduction
Lecture 2: Prerequisite
Lecture 3: Importing Notebooks into Databricks workspace
Chapter 2: Introduction to anomaly detection
Lecture 1: Introduction
Lecture 2: Density based outlier detection
Lecture 3: Support vector machine
Lecture 4: Isolation Forest
Chapter 3: Anomaly detection-LAB
Lecture 1: Clustering based approach
Lecture 2: Angle based approach
Lecture 3: Feature bagging
Lecture 4: KNN
Lecture 5: Local outlier factor
Lecture 6: PCA based approach
Chapter 4: Data Lake house architecture
Lecture 1: Data lake
Lecture 2: Delta Lake
Lecture 3: Elements of delta lake
Lecture 4: Delta lake – LAB
Chapter 5: Build Data pipeline using Azure tech stack
Lecture 1: Architecture design overview
Lecture 2: Upload files to DBFS
Lecture 3: Streaming concepts
Lecture 4: Autoloader concepts
Lecture 5: Mounting DBFS File location
Lecture 6: Data walkthrough
Lecture 7: Autoloader LAB
Chapter 6: Explainable AI
Lecture 1: Anomaly detection using Isolation forest
Lecture 2: Model interpretation introduction
Lecture 3: Model interpretation using Shapley value
Lecture 4: Model interpretation using Shapley values-2
Chapter 7: Spark Structured streaming
Lecture 1: Structured streaming with Kafka- Theory
Lecture 2: Demo – Anonymous Wikipedia edits
Lecture 3: Demo – Log analysis using Kafka
Lecture 4: Demo – Twitter analysis using Kafka
Lecture 5: Spark structured streaming using Azure EventHub – TBA
Chapter 8: MLOPS using MLFlow in DataBricks
Lecture 1: Introduction
Lecture 2: MlFlow tracking demo
Lecture 3: Model registry
Lecture 4: Model registry demo
Lecture 5: Inference using MLFLOW
Lecture 6: Parallelly Training ML Models with pandas UDF
Lecture 7: How ApplyInPandas() function works
Lecture 8: Productionizing the ML model parallelly
Chapter 9: Anomaly detection on Time Series Data -TBA
Lecture 1: TBA
Chapter 10: Building CI_CD Pipeline using Azure Devops – TBA
Lecture 1: TBA
Chapter 11: Building Data Pipeline using Azure Data Factory
Lecture 1: Introduction
Lecture 2: Azure DataFactory Part1
Lecture 3: Azure DataFactory Part2
Lecture 4: Azure DataFactory Part3
Lecture 5: Azure DataFactory Part4
Chapter 12: Productionizing model using Azure Web App
Lecture 1: Deploy Flask application using Azure Webapp
Instructors
-
Data chef
Lead Data Scientist
Rating Distribution
- 1 stars: 8 votes
- 2 stars: 0 votes
- 3 stars: 5 votes
- 4 stars: 4 votes
- 5 stars: 8 votes
Frequently Asked Questions
How long do I have access to the course materials?
You can view and review the lecture materials indefinitely, like an on-demand channel.
Can I take my courses with me wherever I go?
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don’t have an internet connection, some instructors also let their students download course lectures. That’s up to the instructor though, so make sure you get on their good side!
You may also like
- Digital Marketing Foundation Course
- Google Shopping Ads Digital Marketing Course
- Multi Cloud Infrastructure for beginners
- Master Lead Generation: Grow Subscribers & Sales with Popups
- Complete Copywriting System : write to sell with ease
- Product Positioning Masterclass: Unlock Market Traction
- How to Promote Your Webinar and Get More Attendees?
- Digital Marketing Courses
- Create music with Artificial Intelligence in this new market
- Create CONVERTING UGC Content So Brands Will Pay You More
- Podcast: The top 8 ways to monetize by Podcasting
- TikTok Marketing Mastery: Learn to Grow & Go Viral
- Free Digital Marketing Basics Course in Hindi
- MailChimp Free Mailing Lists: MailChimp Email Marketing
- Automate Digital Marketing & Social Media with Generative AI
- Google Ads MasterClass – All Advanced Features
- Online Course Creator: Create & Sell Online Courses Today!
- Introduction to SEO – Basic Principles of SEO
- Affiliate Marketing For Beginners: Go From Novice To Pro
- Effective Website Planning Made Simple