Real World Hadoop – Deploying Hadoop with Cloudera Manager
Real World Hadoop – Deploying Hadoop with Cloudera Manager, available at $19.99, has an average rating of 3.95, with 24 lectures, based on 50 reviews, and has 359 subscribers.
You will learn about Able to see Cloudera Manager at work installing a distributed Hadoop cluster easily Acquire the concepts in which to split the various Hadoop services across cluster nodes. Get a picture as to how one can operate a Hadoop cluster in Production. This course is ideal for individuals who are Software engineers who want to expand their skills into the world of distributed computing or System Engineers that want to expand their skillsets beyond the single Hadoop server or Developers who want to write/test their Hadoop code against a centralized, distributed Hadoop enviroment It is particularly useful for Software engineers who want to expand their skills into the world of distributed computing or System Engineers that want to expand their skillsets beyond the single Hadoop server or Developers who want to write/test their Hadoop code against a centralized, distributed Hadoop enviroment.
Enroll now: Real World Hadoop – Deploying Hadoop with Cloudera Manager
Summary
Title: Real World Hadoop – Deploying Hadoop with Cloudera Manager
Price: $19.99
Average Rating: 3.95
Number of Lectures: 24
Number of Published Lectures: 24
Number of Curriculum Items: 24
Number of Published Curriculum Objects: 24
Original Price: $44.99
Quality Status: approved
Status: Live
What You Will Learn
- Able to see Cloudera Manager at work installing a distributed Hadoop cluster easily
- Acquire the concepts in which to split the various Hadoop services across cluster nodes.
- Get a picture as to how one can operate a Hadoop cluster in Production.
Who Should Attend
- Software engineers who want to expand their skills into the world of distributed computing
- System Engineers that want to expand their skillsets beyond the single Hadoop server
- Developers who want to write/test their Hadoop code against a centralized, distributed Hadoop enviroment
Target Audiences
- Software engineers who want to expand their skills into the world of distributed computing
- System Engineers that want to expand their skillsets beyond the single Hadoop server
- Developers who want to write/test their Hadoop code against a centralized, distributed Hadoop enviroment
If you already have a running Cloudera Manager installation this course follows on with the logic behind the placement of the Hadoop master/slave daemons across your cluster. We actually go ahead and discuss the placement and perform the installation of Hadoop.
If you do not have a Cloudera Manager installation and you want to follow along hands on, you can complete the course : “Real World Vagrant – Automate a Cloudera Manager Build – Toyin Akin” beforehand.
“Big Data” technology is a hot and highly valuable skill to have – and this
course will teach you how to quickly deploy a Hadoop Cluster using the Cloudera stack.
Cloudera allows you to download a QuickStart Virtual machine which is great for developers, but this is of no use for the Operations team to start the planning and the building out of DEV / UAT and PROD environments within their organizations. What assumptions were made when the QuickStart VM was put together?
In addition, hosting all of Cloudera’s processes as well as Hadoop’s processes on one VM is not a model that any large organization can or should follow. The Hadoop services need to be split out across multiple VMs/Servers. In fact that’s the whole point out Hadoop!
DistributedData and DistributedCompute.
After all, if you are developing against or operating a distributed
environment, it needs to be tested. Tested in terms of the forcing various failure modes within the cluster and ensuing that the cluster can still respond to user requests. Killing the QuickStart VM destroys the entire cluster!
You’ll learn the same techniques these large enterprise guys use to move to the next step in building out an enterprise grade Hadoop cluster.
If you are a developer, the operations team can build out that centralized cluster in which you are truly testing against a distributed cluster. Testing code against the Quickstart VM may work, but as any experienced distributed developer knows, verifying code against a pseudo cluster on a single machine is different than verifying against code against a truly distributed cluster.
As an example bottlenecks in Networks or CPU cycles will come to light. In addition, this will also assist in capacity planing of the UAT / PROD cluster as initial metrics can be acquired.
If you are in operationsthen this gives the operations team an environment for the team to start learning how to jointly operate the cluster. Here the team can start to understand cluster metrics, adding/removing cluster nodes, managing the various Hadoop services (Zookeeper, HDFS, YARN and Spark) and a lot more. We also look at managing Cloudera Hadoop Parcels as well as changing Hadoop versions once a cluster is deployed.
The operation team can start to develop procedures and change management documentation ready for Production operation of a Hadoop cluster.
.
Here I present a curriculum as to the current state of my Cloudera courses.
My Hadoop courses are based on Vagrant so that you can practice and
destroy your virtual environment before applying the installation onto
real servers/VMs.
.
For those with little or no knowledge of the Hadoop eco system
Udemy course : Big Data Intro for IT Administrators, Devs and Consultants
.
I would first practice with Vagrant so that you can carve out a
virtual environment on your local desktop. You don’t want to corrupt
your physical servers if you do not understand the steps or make a
mistake.
Udemy course : Real World Vagrant For Distributed Computing
.
I would then, on the virtual servers, deploy Cloudera Manager plus
agents. Agents are the guys that will sit on all the slave nodes ready
to deploy your Hadoop services
Udemy course : Real World Vagrant – Automate a Cloudera Manager Build
.
Then deploy the Hadoop services across your cluster (via the
installed Cloudera Manager in the previous step or your own Cloudera Manager installation). We look at the logic
regarding the placement of master and slave services.
Udemy course : Real World Hadoop – Deploying Hadoop with Cloudera Manager
.
If you want to play around with HDFS commands (Hands on distributed file manipulation).
Udemy course : Real World Hadoop – Hands on Enterprise Distributed Storage.
.
You can also automate the deployment of the Hadoop services via
Python (using the Cloudera Manager Python API). But this is an advanced
step and thus I would make sure that you understand how to manually
deploy the Hadoop services first.
Udemy course : Real World Hadoop – Automating Hadoop install with Python!
.
There is also the upgrade step. Once you have a running cluster, how
do you upgrade to a newer hadoop cluster (Both for Cloudera Manager and
the Hadoop Services).
Udemy course : Real World Hadoop – Upgrade Cloudera and Hadoop hands on
Course Curriculum
Chapter 1: Introduction
Lecture 1: QuickStart VM vs Development Environment
Lecture 2: Suggested course curriculum to follow …
Lecture 3: If you want to follow along hands on …
Lecture 4: Development Topology I
Lecture 5: Development Topology II
Chapter 2: Setup – Hadoop Cluster
Lecture 1: Selecting Server Nodes
Lecture 2: Setting up Parcel location
Lecture 3: Hadoop Services
Lecture 4: HDFS – process placement and overview
Lecture 5: YARN – process placement and overview
Chapter 3: Setup – Cloudera Manager Services
Lecture 1: Cloudera Manager Services Summary
Lecture 2: Database selection
Chapter 4: Hadoop Cluster Navigation
Lecture 1: Starting the Hadoop Cluster
Lecture 2: Hadoop Cluster Services Navigation
Lecture 3: Cluster Charts
Lecture 4: Distributed Logging
Chapter 5: Switching to a different version of Hadoop
Lecture 1: Parcel manipulation
Lecture 2: Restart cluster with changed Hadoop version
Lecture 3: Custom Services
Chapter 6: Cluster Operations
Lecture 1: Cluster Events
Lecture 2: People cost of Operating a Cloudera stack.
Lecture 3: Cluster Configuration Elements
Lecture 4: Adding a new Cluster Node
Chapter 7: Conclusion
Lecture 1: Summary
Instructors
-
Toyin Akin
Big Data Engineer, Capital Markets FinTech Developer
Rating Distribution
- 1 stars: 6 votes
- 2 stars: 2 votes
- 3 stars: 8 votes
- 4 stars: 21 votes
- 5 stars: 13 votes
Frequently Asked Questions
How long do I have access to the course materials?
You can view and review the lecture materials indefinitely, like an on-demand channel.
Can I take my courses with me wherever I go?
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don’t have an internet connection, some instructors also let their students download course lectures. That’s up to the instructor though, so make sure you get on their good side!
You may also like
- Digital Marketing Foundation Course
- Google Shopping Ads Digital Marketing Course
- Multi Cloud Infrastructure for beginners
- Master Lead Generation: Grow Subscribers & Sales with Popups
- Complete Copywriting System : write to sell with ease
- Product Positioning Masterclass: Unlock Market Traction
- How to Promote Your Webinar and Get More Attendees?
- Digital Marketing Courses
- Create music with Artificial Intelligence in this new market
- Create CONVERTING UGC Content So Brands Will Pay You More
- Podcast: The top 8 ways to monetize by Podcasting
- TikTok Marketing Mastery: Learn to Grow & Go Viral
- Free Digital Marketing Basics Course in Hindi
- MailChimp Free Mailing Lists: MailChimp Email Marketing
- Automate Digital Marketing & Social Media with Generative AI
- Google Ads MasterClass – All Advanced Features
- Online Course Creator: Create & Sell Online Courses Today!
- Introduction to SEO – Basic Principles of SEO
- Affiliate Marketing For Beginners: Go From Novice To Pro
- Effective Website Planning Made Simple