Hadoop Starter Kit
Hadoop learning made easy and fun. Learn HDFS, MapReduce and introduction to Pig and Hive with FREE cluster access.
Created by Hadoop In Real World - Expert Big Data Consultants
Students: 167147, Price: Free
The objective of this course is to walk you through step by step of all the core components in Hadoop but more importantly make Hadoop learning experience easy and fun.
By enrolling in this course you can also get free access to our multi-node Hadoop training cluster so you can try out what you learn right away in a real multi-node distributed environment.
We are a group of Hadoop consultants who are passionate about Hadoop and Big Data technologies. 4 years ago when we were looking for Big Data consultants to work in our own projects we did not find qualified candidates because the big data industry was very new and hence we set out to train qualified candidates in Big Data ourselves giving them a deep and real world insights in to Hadoop.
WHAT YOU WILL LEARN IN THIS COURSE
In the first section you will learn about what is big data with examples. We will discuss the factors to consider when considering whether a problem is big data problem or not. We will talk about the challenges with existing technologies when it comes to big data computation. We will breakdown the Big Data problem in terms of storage and computation and understand how Hadoop approaches the problem and provide a solution to the problem.
In the HDFS, section you will learn about the need for another file system like HDFS. We will compare HDFS with traditional file systems and its benefits. We will also work with HDFS and discuss the architecture of HDFS.
In the MapReduce section you will learn about the basics of MapReduce and phases involved in MapReduce. We will go over each phase in detail and understand what happens in each phase. Then we will write a MapReduce program in Java to calculate the maximum closing price for stock symbols from a stock dataset.
In the next two sections, we will introduce you to Apache Pig & Hive. We will try to calculate the maximum closing price for stock symbols from a stock dataset using Pig and Hive.
Big Data and Hadoop Essentials
Essential Knowledge for everyone associated with Big Data & Hadoop
Created by Nitesh Jain - Hadoop and Data Analytics Instructor
Students: 158982, Price: Free
Are you interested in the world of Big data technologies, but find it a little cryptic and see the whole thing as a big puzzle.
Are you looking to understand how Big Data impact large and small business and people like you and me?
Do you feel many people talk about Big Data and Hadoop, and even do not know the basics like history of Hadoop, major players and vendors of Hadoop. Then this is the course just for you!
This course builds a essential fundamental understanding of Big Data problems and Hadoop as a solution. This course takes you through:
- Understanding of Big Data problems with easy to understand examples.
- History and advent of Hadoop right from when Hadoop wasn’t even named Hadoop.
- What is Hadoop Magic which makes it so unique and powerful.
- Understanding the difference between Data science and data engineering, which is one of the big confusions in selecting a carrier or understanding a job role.
- And most importantly, demystifying Hadoop vendors like Cloudera, MapR and Hortonworks by understanding about them.
Unlock the world of Big Data!!!
Best part is that this course is Free of cost!!! (Best things is life are free :))
Spark Starter Kit
NOT another "What is Spark?" course ! Explore Spark in depth and get a strong foundation in Spark.
Created by Hadoop In Real World - Expert Big Data Consultants
Students: 51350, Price: Free
When our students asked us to create a course on Spark, we looked at other Spark related courses in the market and also what are some of the common questions students are asking in websites like stackoverflow and other forums when they try to learn Spark and we saw a recurring theme.
Most courses and other online help including Spark's documentation is not good in helping students understand the foundational concepts. They explain what is Spark, what is RDD, what is "this" and what is "that" but students were most interested in understanding core fundamentals and more importantly answer questions like -
- Why do we need Spark when we have Hadoop ?
- What is the need for RDD ?
- How Spark is faster than Hadoop?
- How Spark achieves the speed and efficiency it claims ?
- How does memory gets managed in Spark?
- How fault tolerance work in Spark ?
and that is exactly what you will learn in this free Spark Starter Kit course. The aim of this course is to give you a strong foundation in Spark.
Introduction to Hadoop basics in 30 mins
Basic Hadoop concepts before you start learning Azure HDInsight
Created by Eshant Garg | LearnCloud.Info | 80,000+ Enrollments - Udemy Instructor | LearnCloud.Info | AWS | Azure
Students: 4175, Price: Free
Please note that this is NOT a full course but a single module of the full-length course, and intended to cover very basic fundamental concepts for absolute beginners so that they can speed up with Azure Synapse SQL Data Warehouse course.
This module is NOT GOOD for you if:
You are already experienced in this technology
You are looking for an intermediate or advance concepts
You are looking for practical examples or demo
This module is GOOD for you if:
You want to understand the basic fundamental concepts of this technology.
This is a free module to help others. If you are not in the intended audience, I request you to please feel free to unenroll.
Where I can find a full-length course?
Please look at the bonus lecture in the end.
What will students learn in this course?
Hadoop basic concepts (Crash course to speed up with Azure HDInsight)
If you are not comfortable in English, please do not take course, captions are not good enough to understand course.
Database and BI developers
Anyone who wants to start learning Big Data
Basic Database concepts
Course In Detail
Data Warehouse Crash Course
In this module, you will learn, what was shortcomings of our traditional Monolithic systems.
you will learn how Distributed system is different from Monolithic systems
you will learn about Hadoop fundamental understanding and how it is different from RDBMS
you will learn 3 main building blocks or components of Hadoop, like the HDFS or Hadoop Distributed File System, the MapReduce programming model for processing and the resource negotiator YARN for cluster management.
Microsoft SQL Server, Azure SQL Server, Azure SQL Data Warehouse, Data Factory, Data Lake, Azure Storage, Azure Synapse Analytics Service, PolyBase, Azure monitoring, Azure Security, Data Warehouse, SSIS
Cloudera Hadoop |Big Data | Authentication With Kerberos
Hadoop Administrator | Cloudera | Cloudera Hadoop Secure Cluster | Kerberos Authentication | MIT Kerberos
Created by Imran Chaush - Hadoop Administrator
Students: 2253, Price: Free
Cloudera Hadoop | Big Data | Secure Cloudera Manager With Kerberos Authentication
You will Learn in This course.
1:- Hadoop 2 Prerequisites.
2:- Cloudera Manager Deployment.
3:- Add New Node To Cloudera Cluster.
4:- Kerberos Authentication Steps.
5:- Secure Cloudera Cluster
I have demonstrated that hadoop2 pre-requisites and Cloudera manager installation after installation enabling it Kerberos authentication on Cloudera manager and check one job on the cluster and check Kerberos is working or not. also, show how to create ec2 instance then creating an image of ec2 instance, spot instance on-demand instance then if you want to secure your Hadoop environment you will learn that in this course.