From 0 to 1: Hive for Processing Big Data
From 0 to 1: Hive for Processing Big Data, available at $84.99, has an average rating of 4.53, with 87 lectures, based on 894 reviews, and has 7761 subscribers.
You will learn about Write complex analytical queries on data in Hive and uncover insights Leverage ideas of partitioning, bucketing to optimize queries in Hive Customize hive with user defined functions in Java and Python Understand what goes on under the hood of Hive with HDFS and MapReduce This course is ideal for individuals who are Yep! Analysts who want to write complex analytical queries on large scale data or Yep! Engineers who want to know more about managing Hive as their data warehousing solution It is particularly useful for Yep! Analysts who want to write complex analytical queries on large scale data or Yep! Engineers who want to know more about managing Hive as their data warehousing solution.
Enroll now: From 0 to 1: Hive for Processing Big Data
Summary
Title: From 0 to 1: Hive for Processing Big Data
Price: $84.99
Average Rating: 4.53
Number of Lectures: 87
Number of Published Lectures: 87
Number of Curriculum Items: 87
Number of Published Curriculum Objects: 87
Original Price: $89.99
Quality Status: approved
Status: Live
What You Will Learn
- Write complex analytical queries on data in Hive and uncover insights
- Leverage ideas of partitioning, bucketing to optimize queries in Hive
- Customize hive with user defined functions in Java and Python
- Understand what goes on under the hood of Hive with HDFS and MapReduce
Who Should Attend
- Yep! Analysts who want to write complex analytical queries on large scale data
- Yep! Engineers who want to know more about managing Hive as their data warehousing solution
Target Audiences
- Yep! Analysts who want to write complex analytical queries on large scale data
- Yep! Engineers who want to know more about managing Hive as their data warehousing solution
Prerequisites:Hive requires knowledge of SQL. The course includes and SQL primer at the end. Please do that first if you don’t know SQL. You’ll need to know Java if you want to follow the sections on custom functions.
Taught by a 4 person team including 2 Stanford-educated, ex-Googlers and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data.
Hive is like a new friend with an old face (SQL). This course is an end-to-end, practical guide to using Hive for Big Data processing.
Let’s parse that
A new friend with an old face:Hive helps you leverage the power of Distributed computing and Hadoop for Analytical processing. It’s interface is like an old friend : the very SQL like HiveQL. This course will fill in all the gaps between SQL and what you need to use Hive.
End-to-End:The course is an end-to-end guide for using Hive: whether you are analyst who wants to process data or an Engineer who needs to build custom functionality or optimize performance – everything you’ll need is right here. New to SQL? No need to look elsewhere. The course has a primer on all the basic SQL constructs, .
Practical:Everything is taught using real-life examples, working queries and code .
What’s Covered:
Analytical Processing: Joins, Subqueries, Views, Table Generating Functions, Explode, Lateral View, Windowing and more
Tuning Hive for better functionality:Partitioning, Bucketing, Join Optimizations, Map Side Joins, Indexes, Writing custom User Defined functions in Java. UDF, UDAF, GenericUDF, GenericUDTF, Custom functions in Python, Implementation of MapReduce for Select, Group by and Join
For SQL Newbies: SQL In Great Depth
Course Curriculum
Chapter 1: You, Us & This Course
Lecture 1: You, Us & This Course
Chapter 2: Introducing Hive
Lecture 1: Hive: An Open-Source Data Warehouse
Lecture 2: Hive and Hadoop
Lecture 3: Hive vs Traditional Relational DBMS
Lecture 4: HiveQL and SQL
Chapter 3: Hadoop and Hive Install
Lecture 1: Hadoop Install Modes
Lecture 2: Hadoop Install Step 1 : Standalone Mode
Lecture 3: Hadoop Install Step 2 : Pseudo-Distributed Mode
Lecture 4: Hive install
Lecture 5: Code-Along: Getting started
Chapter 4: Hadoop and HDFS Overview
Lecture 1: What is Hadoop?
Lecture 2: HDFS or the Hadoop Distributed File System
Chapter 5: Hive Basics
Lecture 1: Primitive Datatypes
Lecture 2: Collections_Arrays_Maps
Lecture 3: Structs and Unions
Lecture 4: Create Table
Lecture 5: Insert Into Table
Lecture 6: Insert into Table 2
Lecture 7: Alter Table
Lecture 8: HDFS
Lecture 9: HDFS CLI – Interacting with HDFS
Lecture 10: Code-Along: Create Table
Lecture 11: Code-Along : Hive CLI
Chapter 6: Built-in Functions
Lecture 1: Three types of Hive functions
Lecture 2: The Case-When statement, the Size function, the Cast function
Lecture 3: The Explode function
Lecture 4: Code-Along : Hive Built – in functions
Chapter 7: Sub-Queries
Lecture 1: Quirky Sub-Queries
Lecture 2: More on subqueries: Exists and In
Lecture 3: Inserting via subqueries
Lecture 4: Code-Along : Use Subqueries to work with Collection Datatypes
Lecture 5: Views
Chapter 8: Partitioning
Lecture 1: Indices
Lecture 2: Partitioning Introduced
Lecture 3: The Rationale for Partitioning
Lecture 4: How Tables are Partitioned
Lecture 5: Using Partitioned Tables
Lecture 6: Dynamic Partitioning: Inserting data into partitioned tables
Lecture 7: Code-Along : Partitioning
Chapter 9: Bucketing
Lecture 1: Introducing Bucketing
Lecture 2: The Advantages of Bucketing
Lecture 3: How Tables are Bucketed
Lecture 4: Using Bucketed Tables
Lecture 5: Sampling
Chapter 10: Windowing
Lecture 1: Windowing Introduced
Lecture 2: Windowing – A Simple Example: Cumulative Sum
Lecture 3: Windowing – A More Involved Example: Partitioning
Lecture 4: Windowing – Special Aggregation Functions
Chapter 11: Understanding MapReduce
Lecture 1: The basic philosophy underlying MapReduce
Lecture 2: MapReduce – Visualized and Explained
Lecture 3: MapReduce – Digging a little deeper at every step
Chapter 12: MapReduce logic for queries: Behind the scenes
Lecture 1: MapReduce Overview: Basic Select-From-Where
Lecture 2: MapReduce Overview: Group-By and Having
Lecture 3: MapReduce Overview: Joins
Chapter 13: Join Optimizations in Hive
Lecture 1: Improving Join performance with tables of different sizes
Lecture 2: The Where clause in Joins
Lecture 3: The Left Semi Join
Lecture 4: Map Side Joins: The Inner Join
Lecture 5: Map Side Joins: The Left, Right and Full Outer Joins
Lecture 6: Map Side Joins: The Bucketed Map Join and the Sorted Merge Join
Chapter 14: Custom Functions in Python
Lecture 1: Custom functions in Python
Lecture 2: Code-Along : Custom Function in Python
Chapter 15: Custom functions in Java
Lecture 1: Introducing UDFs – you're not limited by what Hive offers
Lecture 2: The Simple UDF: The standard function for primitive types
Lecture 3: The Simple UDF: Java implementation for replacetext()
Lecture 4: Generic UDFs, the Object Inspector and DeferredObjects
Lecture 5: The Generic UDF: Java implementation for containsstring()
Lecture 6: The UDAF: Custom aggregate functions can get pretty complex
Lecture 7: The UDAF: Java implementation for max()
Lecture 8: The UDAF: Java implementation for Standard Deviation
Lecture 9: The Generic UDTF: Custom table generating functions
Lecture 10: The Generic UDTF: Java implementation for namesplit()
Chapter 16: SQL Primer – Select Statemets
Lecture 1: Select Statements
Lecture 2: Select Statements 2
Lecture 3: Operator Functions
Chapter 17: SQL Primer – Group By, Order By and Having
Lecture 1: Aggregation Operators Introduced
Lecture 2: The Group By Clause
Lecture 3: More Group By Examples
Lecture 4: Order By
Lecture 5: Having
Chapter 18: SQL Primer – Joins
Lecture 1: Introduction to SQL Joins
Lecture 2: Cross Joins aka Cartesian Joins
Instructors
-
Loony Corn
An ex-Google, Stanford and Flipkart team
Rating Distribution
- 1 stars: 31 votes
- 2 stars: 33 votes
- 3 stars: 113 votes
- 4 stars: 326 votes
- 5 stars: 391 votes
Frequently Asked Questions
How long do I have access to the course materials?
You can view and review the lecture materials indefinitely, like an on-demand channel.
Can I take my courses with me wherever I go?
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don’t have an internet connection, some instructors also let their students download course lectures. That’s up to the instructor though, so make sure you get on their good side!
You may also like
- Top 10 Video Editing Courses to Learn in November 2024
- Top 10 Music Production Courses to Learn in November 2024
- Top 10 Animation Courses to Learn in November 2024
- Top 10 Digital Illustration Courses to Learn in November 2024
- Top 10 Renewable Energy Courses to Learn in November 2024
- Top 10 Sustainable Living Courses to Learn in November 2024
- Top 10 Ethical AI Courses to Learn in November 2024
- Top 10 Cybersecurity Fundamentals Courses to Learn in November 2024
- Top 10 Smart Home Technology Courses to Learn in November 2024
- Top 10 Holistic Health Courses to Learn in November 2024
- Top 10 Nutrition And Diet Planning Courses to Learn in November 2024
- Top 10 Yoga Instruction Courses to Learn in November 2024
- Top 10 Stress Management Courses to Learn in November 2024
- Top 10 Mindfulness Meditation Courses to Learn in November 2024
- Top 10 Life Coaching Courses to Learn in November 2024
- Top 10 Career Development Courses to Learn in November 2024
- Top 10 Relationship Building Courses to Learn in November 2024
- Top 10 Parenting Skills Courses to Learn in November 2024
- Top 10 Home Improvement Courses to Learn in November 2024
- Top 10 Gardening Courses to Learn in November 2024