Learn Big Data Analysis with PySpark
Learn Big Data Analysis with PySpark, available at $54.99, has an average rating of 4.4, with 37 lectures, based on 16 reviews, and has 51 subscribers.
You will learn about Learn Most Important PySpark Features Understand Resilient Distributed Dataset Learn Most Important Python Commands and Libraries used for Data Analysis Import Big Data Files in PySpark Work Environment and Clean them Perform Data Analysis in PySpark using SQL Queries This course is ideal for individuals who are Those who have need to learn data analysis in PySpark or Those who need to use SQL on Big Data It is particularly useful for Those who have need to learn data analysis in PySpark or Those who need to use SQL on Big Data.
Enroll now: Learn Big Data Analysis with PySpark
Summary
Title: Learn Big Data Analysis with PySpark
Price: $54.99
Average Rating: 4.4
Number of Lectures: 37
Number of Published Lectures: 37
Number of Curriculum Items: 37
Number of Published Curriculum Objects: 37
Original Price: $19.99
Quality Status: approved
Status: Live
What You Will Learn
- Learn Most Important PySpark Features
- Understand Resilient Distributed Dataset
- Learn Most Important Python Commands and Libraries used for Data Analysis
- Import Big Data Files in PySpark Work Environment and Clean them
- Perform Data Analysis in PySpark using SQL Queries
Who Should Attend
- Those who have need to learn data analysis in PySpark
- Those who need to use SQL on Big Data
Target Audiences
- Those who have need to learn data analysis in PySpark
- Those who need to use SQL on Big Data
Apache Sparkis one of the most powerful tools used in big data analysis because:
It’sRun programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk.
· It can run real and semi-real time data analysis.
· It can handle large scaleof data.
· It can be run using simple code in Python programming language.
You can use the easy commands in Python and SQL languages, to run data analysis on big data that cannot or difficult to import inside relational database engines. This combination of Spark, Python and SQL create a powerful work environment to analyze big data easier and faster.
In this course, you will learn: What is Spark, how does it run, and how data are stored in Spark work environment. You will learn how to configure Python programming environment to run Spark code. Also, you will learn performing data analysis using real big data. In addition, you will learn to import big data filesinside Python. You will learn to clean and transform data for analysis purpose. You will learn conducting business analysis using several Spark functions. You will learn to create SQL queries inside PySpark to run data analysis. After that you will learn how to interpret the results from business perspective.
Course Curriculum
Chapter 1: Introduction
Lecture 1: Introduction to the Course
Lecture 2: What is PySpark
Lecture 3: Spark Features
Lecture 4: Resilient Distributed Dataset (RDD)
Chapter 2: Introduction to Python Development Environment
Lecture 1: Introduction to development platform (Colaboratory)
Lecture 2: Login to Colaboratory
Lecture 3: Create Lists in Python
Lecture 4: Create Tuples and Dictionaries in Python
Lecture 5: Create loops and functions in Python
Lecture 6: Introduction to Pandas Library in Python
Lecture 7: Import and use Pandas in Python
Lecture 8: Introduction to NumPy
Lecture 9: Import and use NumPy in Python
Lecture 10: Configure PySpark development environment to run Spark
Lecture 11: Course Rating
Chapter 3: Cleaning and Transforming Data in PySpark
Lecture 1: Create Resilient Distributed Dataset (RDD) in Spark
Lecture 2: Import data files in PySpark
Lecture 3: Show data in datasets
Lecture 4: Clean and transform data part_1 (replace nulls with values)
Lecture 5: Clean and transform data part_2 (remove null values)
Lecture 6: Clean and transform data part_3 (change data types)
Lecture 7: Clean and transform data part_4 (replace numeric nulls by the column mean)
Lecture 8: Clean and transform data part_5 (change columns names)
Lecture 9: Clean and transform data part_6 (change columns order in a dataset)
Lecture 10: Course Rating
Chapter 4: Performing Data Analysis in PySpark
Lecture 1: A reminder to upload data and run configuration code
Lecture 2: Data analysis part_1 (find sales volume per item using sum function)
Lecture 3: Data analysis part_2 (find number of orders per item using count function)
Lecture 4: Data analysis part_3 (find itemType data using aggregation functions)
Lecture 5: Data analysis part_4 (add a calculated column to datasets)
Lecture 6: Data analysis part_5 (find total profit per item_type and Sales_channel)
Lecture 7: Data analysis part_6 (find sales facts per country using conditions)
Lecture 8: Data analysis part_7 (find total profit per country)
Lecture 9: Data analysis part_8 (find countries generated total profit greater than 4 M)
Lecture 10: Run data analysis with SQL Part_1
Lecture 11: Run data analysis with SQL Part_2
Lecture 12: Course Rating
Instructors
-
Data Science Guide
Data Scientist & SQL Developer
Rating Distribution
- 1 stars: 0 votes
- 2 stars: 0 votes
- 3 stars: 3 votes
- 4 stars: 3 votes
- 5 stars: 10 votes
Frequently Asked Questions
How long do I have access to the course materials?
You can view and review the lecture materials indefinitely, like an on-demand channel.
Can I take my courses with me wherever I go?
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don’t have an internet connection, some instructors also let their students download course lectures. That’s up to the instructor though, so make sure you get on their good side!
You may also like
- Top 10 Video Editing Courses to Learn in November 2024
- Top 10 Music Production Courses to Learn in November 2024
- Top 10 Animation Courses to Learn in November 2024
- Top 10 Digital Illustration Courses to Learn in November 2024
- Top 10 Renewable Energy Courses to Learn in November 2024
- Top 10 Sustainable Living Courses to Learn in November 2024
- Top 10 Ethical AI Courses to Learn in November 2024
- Top 10 Cybersecurity Fundamentals Courses to Learn in November 2024
- Top 10 Smart Home Technology Courses to Learn in November 2024
- Top 10 Holistic Health Courses to Learn in November 2024
- Top 10 Nutrition And Diet Planning Courses to Learn in November 2024
- Top 10 Yoga Instruction Courses to Learn in November 2024
- Top 10 Stress Management Courses to Learn in November 2024
- Top 10 Mindfulness Meditation Courses to Learn in November 2024
- Top 10 Life Coaching Courses to Learn in November 2024
- Top 10 Career Development Courses to Learn in November 2024
- Top 10 Relationship Building Courses to Learn in November 2024
- Top 10 Parenting Skills Courses to Learn in November 2024
- Top 10 Home Improvement Courses to Learn in November 2024
- Top 10 Gardening Courses to Learn in November 2024