Spark Scala coding framework, testing, Structured streaming
Spark Scala coding framework, testing, Structured streaming, available at $79.99, has an average rating of 4.1, with 57 lectures, based on 262 reviews, and has 3901 subscribers.
You will learn about Spark Scala industry standard coding practices – Logging, Exception Handling, Reading from Configuration File Unit Testing Spark Scala using JUnit , ScalaTest, FlatSpec & Assertion Building a data pipeline using Hive, Spark and PostgreSQL Spark Scala development with Intellij, Maven Cloudera QuickStart VM setup on GCP This course is ideal for individuals who are Students looking at moving from Big Data Spark academic background to a real world developer role It is particularly useful for Students looking at moving from Big Data Spark academic background to a real world developer role.
Enroll now: Spark Scala coding framework, testing, Structured streaming
Summary
Title: Spark Scala coding framework, testing, Structured streaming
Price: $79.99
Average Rating: 4.1
Number of Lectures: 57
Number of Published Lectures: 57
Number of Curriculum Items: 57
Number of Published Curriculum Objects: 57
Original Price: $19.99
Quality Status: approved
Status: Live
What You Will Learn
- Spark Scala industry standard coding practices – Logging, Exception Handling, Reading from Configuration File
- Unit Testing Spark Scala using JUnit , ScalaTest, FlatSpec & Assertion
- Building a data pipeline using Hive, Spark and PostgreSQL
- Spark Scala development with Intellij, Maven
- Cloudera QuickStart VM setup on GCP
Who Should Attend
- Students looking at moving from Big Data Spark academic background to a real world developer role
Target Audiences
- Students looking at moving from Big Data Spark academic background to a real world developer role
This course will bridge the gap between your academic and real world knowledge and prepare you for an entry level Big Data Spark Scala developer role. You will learn the following
-
Spark Scala coding best practices
-
Logging – log4j, slf4
-
Exception Handling
-
Configuration using Typesafe config
-
Doing development work using IntelliJ, Maven
-
Using your local environment as a Hadoop Hive environment
-
Reading and writing to a Postgres database using Spark
-
Unit Testing Spark Scala using JUnit , ScalaTest, FlatSpec & Assertion
-
Building a data pipeline using Hadoop , Spark and Postgres
-
Bonus – Setting up Cloudera QuickStart VM on Google Cloud Platform (GCP)
-
Structured Streaming
Prerequisites :
-
Basic programming skills
-
Basic database knowledge
-
Big Data and Spark entry level knowledge
Course Curriculum
Chapter 1: Introduction
Lecture 1: Introduction
Lecture 2: What is Big Data Spark?
Lecture 3: Big Data Hadoop concepts and hands-on labs for beginners
Chapter 2: Environment Setup & Spark Scala basics
Lecture 1: Installing JDK 11 on a Windows Machine
Lecture 2: Installing IntelliJ and Winutils for Spark Scala Hive programming on Windows
Lecture 3: Scala Basics
Lecture 4: For Mac users – Installing JDK and IntelliJ and Spark Scala Hive Hello World
Lecture 5: Installing PostgreSQL
Lecture 6: psql command line interface for PostgreSQL
Lecture 7: Fetching PostgresSQL data to a Spark DataFrame
Lecture 8: Importing a project into IntelliJ
Chapter 3: Coding Best Practices
Lecture 1: Organizing code with Objects and Methods
Lecture 2: Implementing Log4j SLf4j Logging
Lecture 3: Exception Handling with try, catch, Option, Some and None
Chapter 4: A Data Pipeline with Hive, Spark and Postgres
Lecture 1: Reading from Hive and Writing to Postgres
Lecture 2: Reading Configuration from JSON using Typesafe
Lecture 3: Reading command-line arguments and debugging in InjtelliJ
Lecture 4: Writing data to a Hive Table
Lecture 5: Managing input parameters using a Scala Case Class
Lecture 6: Intellij Maven troubleshooting tips
Chapter 5: Spark Scala Unit Testing using ScalaTest
Lecture 1: Scala Unit Testing using JUnit & ScalaTest
Lecture 2: Spark Transformation unit testing using ScalaTest
Lecture 3: Unit testing to catch an Exception
Lecture 4: Catching Exception using assertThrows
Lecture 5: Throwing Custom Error and Intercepting Error Message
Lecture 6: Testing with assertResult
Lecture 7: Testing with Matchers
Lecture 8: Failing tests intentionally
Lecture 9: Sharing fixtures
Chapter 6: Running the application on Cloudera QuickStart VM on GCP
Lecture 1: Exporting the project to an uber jar
Lecture 2: Signing up for GCP free trial
Lecture 3: Cloudera QuickStart VM Installation on GCP
Lecture 4: Running Spark 2 with Hive on Cloudera QuickStart VM
Lecture 5: Uber Jar spark-submit on Cloudera QuickStart VM
Lecture 6: Doing spark submit locally
Chapter 7: Spark Scala – Structured Streaming
Lecture 1: Structured Streaming concepts
Lecture 2: Streaming data from files
Lecture 3: Batch Vs Streaming code
Lecture 4: Writing streaming data to a Hive table
Lecture 5: Streaming Aggregation
Lecture 6: Filtering Stream
Lecture 7: Adding timestamp to streaming data
Lecture 8: Aggregation in a time window
Lecture 9: Tumbling window and Sliding window
Lecture 10: PySpark coding framework course preview
Lecture 11: Congratulations & Thank You
Chapter 8: Appendix – Big Data Hadoop Hive for beginners
Lecture 1: Big Data concepts
Lecture 2: Hadoop concepts
Lecture 3: Hadoop Distributed File System (HDFS)
Lecture 4: Understanding Google Cloud (GCP) Dataproc
Lecture 5: Signing up for a Google Cloud free trial
Lecture 6: Storing a file in HDFS
Lecture 7: MapReduce and YARN
Lecture 8: Hive
Lecture 9: Querying HDFS data using Hive
Lecture 10: Deleting the Cluster
Lecture 11: Analyzing a billion records with Hive
Instructors
-
FutureX Skills
Empowering Data Engineers and Data Scientists
Rating Distribution
- 1 stars: 6 votes
- 2 stars: 7 votes
- 3 stars: 34 votes
- 4 stars: 92 votes
- 5 stars: 123 votes
Frequently Asked Questions
How long do I have access to the course materials?
You can view and review the lecture materials indefinitely, like an on-demand channel.
Can I take my courses with me wherever I go?
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don’t have an internet connection, some instructors also let their students download course lectures. That’s up to the instructor though, so make sure you get on their good side!
You may also like
- Top 10 Video Editing Courses to Learn in November 2024
- Top 10 Music Production Courses to Learn in November 2024
- Top 10 Animation Courses to Learn in November 2024
- Top 10 Digital Illustration Courses to Learn in November 2024
- Top 10 Renewable Energy Courses to Learn in November 2024
- Top 10 Sustainable Living Courses to Learn in November 2024
- Top 10 Ethical AI Courses to Learn in November 2024
- Top 10 Cybersecurity Fundamentals Courses to Learn in November 2024
- Top 10 Smart Home Technology Courses to Learn in November 2024
- Top 10 Holistic Health Courses to Learn in November 2024
- Top 10 Nutrition And Diet Planning Courses to Learn in November 2024
- Top 10 Yoga Instruction Courses to Learn in November 2024
- Top 10 Stress Management Courses to Learn in November 2024
- Top 10 Mindfulness Meditation Courses to Learn in November 2024
- Top 10 Life Coaching Courses to Learn in November 2024
- Top 10 Career Development Courses to Learn in November 2024
- Top 10 Relationship Building Courses to Learn in November 2024
- Top 10 Parenting Skills Courses to Learn in November 2024
- Top 10 Home Improvement Courses to Learn in November 2024
- Top 10 Gardening Courses to Learn in November 2024