Best Free Big Data Courses

Find the best online Free Big Data Courses for you. The courses are sorted based on popularity and user ratings. We do not allow paid placements in any of our rankings.

Hadoop Starter Kit

Hadoop learning made easy and fun. Learn HDFS, MapReduce and introduction to Pig and Hive with FREE cluster access.

Created by Hadoop In Real World - Expert Big Data Consultants

"]

Students: 167147, Price: Free

The objective of this course is to walk you through step by step of all the core components in Hadoop but more importantly make Hadoop learning experience easy and fun.

By enrolling in this course you can also get free access to our multi-node Hadoop training cluster so you can try out what you learn right away in a real multi-node distributed environment.

ABOUT INSTRUCTOR(S)

We are a group of Hadoop consultants who are passionate about Hadoop and Big Data technologies. 4 years ago when we were looking for Big Data consultants to work in our own projects we did not find qualified candidates because the big data industry was very new and hence we set out to train qualified candidates in Big Data ourselves giving them a deep and real world insights in to Hadoop.

WHAT YOU WILL LEARN IN THIS COURSE

In the first section you will learn about what is big data with examples. We will discuss the factors to consider when considering whether a problem is big data problem or not. We will talk about the challenges with existing technologies when it comes to big data computation. We will breakdown the Big Data problem in terms of storage and computation and understand how Hadoop approaches the problem and provide a solution to the problem.

In the HDFS, section you will learn about the need for another file system like HDFS. We will compare HDFS with traditional file systems and its benefits. We will also work with HDFS and discuss the architecture of HDFS.

In the MapReduce section you will learn about the basics of MapReduce and phases involved in MapReduce. We will go over each phase in detail and understand what happens in each phase. Then we will write a MapReduce program in Java to calculate the maximum closing price for stock symbols from a stock dataset.

In the next two sections, we will introduce you to Apache Pig & Hive. We will try to calculate the maximum closing price for stock symbols from a stock dataset using Pig and Hive.

Big Data and Hadoop Essentials

Essential Knowledge for everyone associated with Big Data & Hadoop

Created by Nitesh Jain - Hadoop and Data Analytics Instructor

"]

Students: 158982, Price: Free

Are you interested in the world of Big data technologies, but find it a little cryptic and see the whole thing as a big puzzle.

Are you looking to understand how Big Data impact large and small business and people like you and me?

Do you feel many people talk about Big Data and Hadoop, and even do not know the basics like history of Hadoop, major players and vendors of Hadoop. Then this is the course just for you!

This course builds a essential fundamental understanding of Big Data problems and Hadoop as a solution. This course takes you through:

  1. Understanding of Big Data problems with easy to understand examples.
  2. History and advent of Hadoop right from when Hadoop wasn’t even named Hadoop.
  3. What is Hadoop Magic which makes it so unique and powerful.
  4. Understanding the difference between Data science and data engineering, which is one of the big confusions in selecting a carrier or understanding a job role.
  5. And most importantly, demystifying Hadoop vendors like Cloudera, MapR and Hortonworks by understanding about them.

Unlock the world of Big Data!!!

Best part is that this course is Free of cost!!! (Best things is life are free :))

Fundamentals Data Analysis & Decision Making Models – Theory

Master handling Big Data, Analysis and presenting interactive DashBoards. Forecasting and

Created by Manish Gupta - Hospitality Finance Expert and Business Strategist

"]

Students: 25482, Price: Free

 Do you want to understand how big data is analysed and how decisions are made based on big data.

In this course we will be covering the various steps involved in data analysis in brief, Objective of this course to make you familiar with these steps and collect your feedbacks and questions.

I will then use those feedback and questions to make the detailed course better and relevant for you.

Introduction to Big Data – an overview of the 10 V’s

An overview of the Dimensions and Forms of Big Data.

Created by Taimur Zahid - Machine Learning Engineer

"]

Students: 14970, Price: Free

This course is designed to be an in-depth overview of the field of Data Science. It teaches the students various Characteristics of Big Data as well as discuss a few types of Data that exists. After completing this course, you will have the knowledge that can be applied later on in your journey into this field when you're selecting an Algorithm, a Tool, a Framework, or even while making a Blueprint of how to deal with the current problem at hand.

Big Data in Advertising – Explained in Plain English

A 30 min overview of what kind of data advertisings use and how that data is collected on your devices.

Created by Ben Silverstein - Digital Advertising Professional & Entrepreneur in NYC

"]

Students: 14303, Price: Free

Big Data is a popular buzzword but it is very vague and can be somewhat scary. In this course I explain what Big Data in advertising actually is, and try to remove some of the confusion around the subject. 

This course gives everyone from beginners to professionals a quick overview of what data means to digital advertising. I review the companies involved, define the types of data advertisers look for, when and why advertisers buy data, how users share it (knowingly or unknowingly), and much more.

In this 35 minute course I cover the following topics:

  1. What Personal Data (PII) is

  2. What laws exist to regulate data collection

  3. Different types of Data Categories that Advertisers look for

    1. Demographic

    2. Behavioral 

    3. Contextual 

    4. Retargeting 

    5. Location

  4. How data is actually collected 

  5. Ad-Tech companies involved in the data & ad delivery process

At the end of the course you will have a better understanding of what kind of data is being collected about you, and how advertisers use that data to target their ads. You will also have a better understanding of why you see certain ads on each of your devices. 

---------------------

Real Student Testimonials:

★★★★★ “Very clear and to the point. I like the graphics.” - David Peterson

★★★★★ “Very clear presentation. I understand very easy.” - Catalin Badea

Reviews from Other Courses

Digital Advertising & Marketing 101

★★★★★ “The real-world examples almost makes it self-explanatory. Professionally done and author speaks with authority - i.e. he knows what he's talking about and it shows.” - AJ Du Toit

★★★★★ “Thought this was an excellent introduction course. Working in the industry without a huge amount of experience in this area, it was a great way to familiarize myself with topics in ongoing conversations internally and externally. Will be taking 201 to further my understanding.” - Jocleyn Armour

★★★★★ “It is advertised as a 101 course and it did exactly that and very well, touching on the building blocks of Digital Advertising and Marketing. Good job Ben.” - Jean C

Digital Advertising & Marketing 201

★★★★★ “When combined with Ben's 101 course, the two classes make for a thorough and well-organized primer on digital media today. Perfect for marketing people and agency folks (creative, account) who are not immersed in a media agency. It will give you a foundation for how digital media is structured, a clear explanation of the jargon and acronyms you'll hear bantered about, and a better understanding of the opportunities available. The 201 course goes into important detail about some of the key changes that have taken place in digital advertising recently. Ben explains the concepts clearly and succinctly. Definitely worth the time investment.” - Shawn E Fraser

★★★★★ “This course is amazing. I do affiliate marketing and always wanted to learn about programmatic advertising and this course me taught that. I completed this for an interview and the employer was really impressed by the knowledge I had. Hope there is another in-depth version of this course. Where he goes into ad platforms or ad servers and teaches the real world applications.” - Suryameet Singh

★★★★★ “Comprehensive overview...detailed!” - Kaithlean Crotty-Clark

Introduction to Programmatic Advertising

★★★★★ “I'm in advertising sales and have been looking for a clean easy way to explain and also test my root knowledge of the programmatic ad space. It was very helpful and simple to understand which is hard to do with this topic.” - Raul Bonilla

★★★★★ "Being an advertising agency media planner and buyer, having this hands on information helps when we face a decision to go into the digital advertising space. Your 101 and 201 was extremely informative and truly like your overviews in a very simplistic explanation. Thank you and look forward to your future courses." - Diane Tody

---------------------

 

According to recent trends by Statista, the digital marketing & advertising industry is on pace to be worth over $330B a year by 2021. If you’re not already learning about this industry, you will be soon. Get a jump start on your career, your co-workers, and peers by taking this intermediate-advance level course.

New in Big Data: Apache HiveMall – Machine Learning with SQL

HiveMall SQL on Spark, MapReduce and Tez. Leverage your knowledge of SQL to enter Machine Learning and Big Data space.

Created by Elena Akhmatova - Data Scientist

"]

Students: 14129, Price: Free

It is widely accepted that applying Machine Learning techniques to data is a complex task that requires knowledge of a variety of programming languages and means hours of coding, compiling and debugging.  

Not any longer!

Apache HiveMall is a Machine Learning library that allows anyone with basic knowledge of SQL to run Machine Learning algorithms. 

  • No coding
  • No compiling
  • No debugging

Apache HiveMall algorithms are hidden behind Hive UDFs. This allows end user to use SQL and only SQL to apply Machine Learning algorithms to a very large volume of training data.

Apache HiveMall Machine Learning Library makes training, testing, and model evaluation easy and accessible to a much wider community of business experts than ever before.

ClickHouse crash course. Conquer big data with ease

Learn how to use one of the most powerful open source OLAP database on the market. Put new life in your big data.

Created by Viktor Dashkov - Software Developer and Fitness Geek

"]

Students: 5819, Price: Free

Has your data grown too much?

Do you have to wait forever to get even simple answers from your system?

Do you just want to explore your data in real time while it’s actually relevant and not 30 minutes later when nobody cares anymore?

Do you want your dev team to work on features and not on the infrastructure?

Then you’ve come to the right place. ClickHouse is a new technology that addresses all of the pain points above.

ClickHouse was designed to be very, very fast. And it is.

What is more, it’s extremely rigid, and it fails only in extreme circumstances.

Put ease of installation and maintenance on top, and you get nearly ideal solution for most OLAP use cases.

How can I help?

Together we’ll explore main functionality of ClickHouse, and we will develop tools and skills to incorporate and manage this database in existing and future systems.

We are going to have lots of fun along the way, because technology should be fun, and with the tools like ClickHouse it is.

Some of the topics we’ll cover:

  • ClickHouse Installation

  • External dictionaries

  • Arrays

  • Sampling

  • Aggregation

  • Cluster Configuration

You'll find lots of code snippets and supplementary material inside the course to help you master even the hardest topics.

At the end of the course, you’ll be able to confidently use ClickHouse in production.

You’ll get familiar with the main features and quirks of the database, as well as some edge cases you might encounter

Is it for me?

If you are an IT pro with specific OLAP needs, or just a DEVOPs looking for a new great technology, then my answer is yes.

All you need is basic knowledge of SQL and Docker

ClickHouse will make the rest a breeze.

Can't wait to see you inside!

Setup Single Node Cloudera Cluster on Google Cloud

Deploy Cloudera Hadoop, Spark & Kafka Environment(on GCP) for CCA 131 Preparation using Google Cloud Free Credits

Created by Bhavuk Chawla - Authorized Instructor for GCP,Snowflake,Cloudera,Confluent

"]

Students: 3635, Price: Free

You may also like our below courses -

  1. Big Data Crash Course | Learn Hadoop, Spark, NiFi and Kafka

  2. Big Data For Architects | Build Big Data Pipelines and Compare Key Big Data Technologies

  3. Google Data Engineer Certification Practice Exams

  4. Confluent Certified Operator for Apache Kafka Practice Test

  5. Confluent Certified Developer Apache Kafka Practice Tests

Please note that this is neither an official course of Cloudera nor Google.

The purpose of this course is to provide hands-on exposure to you to setup Big Data Engineering Lab in Pseudo-distributed mode (Single Machine Cluster). The environment is built using new Cloudera platform i.e. Cloudera Data Platform for deploying HDFS, YARN, Hive, Spark etc. on Google Cloud Platform.

This course leverages free credits of Google Cloud Platform so there is no need to pay anything to anyone for running labs till free credits are available.

You may join our YouTube Channel named "DataCouch" for getting access to interesting videos free of cost.

We have many Google certified instructors who can assist your team in moving forward in Google Cloud implementation in the right way.

We are also an official training delivery partner of Confluent Kafka.. We conduct corporate trainings on various topics including Confluent Kafka Developer, Confluent Kafka Administration, Confluent Kafka Real Time Streaming using KSQL & KStreams and Confluent Kafka Advanced Optimization. Our instructors are well qualified and vetted by Confluent for delivering such courses.

Please feel free to reach out if you have any requirements for Confluent Kafka Training for your team. Happy to assist.

Big Data Analysis With Pandas Data Frame

Real World Projects: Data Analysis

Created by Saima Aziz - Instructor

"]

Students: 3572, Price: Free

Welcome to Data Analysis using Python. My name is Saima Aziz and I will be the instructor for this course. I have more than 25 years of teaching experience.

In this course, you will apply your coding skills to a wide range of datasets to solve real world projects using Pandas Data Frame, such as:

Covid-19 datasets,

London housing datasets,

Car datasets,

Police datasets,

Udemy courses datasets.

You will increase your chances of success in data science by experimenting with Python projects. That way, you're learning by actually doing instead of just watching videos.

Building projects will help you tie together everything you are learning. Once you start building projects, you will immediately feel like you are making progress.

Where should I start? What makes a good project? What do I do when I get stuck?

I have carefully designed the content of the course to be comprehensive and fully compatible with industrial requirements and easy to understand.

If you get stuck, don't give up! There is enough material in the course to help you solve the problems, and your hard work will pay off.

Cloudera Hadoop |Big Data | Authentication With Kerberos

Hadoop Administrator | Cloudera | Cloudera Hadoop Secure Cluster | Kerberos Authentication | MIT Kerberos

Created by Imran Chaush - Hadoop Administrator

"]

Students: 2253, Price: Free

Cloudera Hadoop | Big Data | Secure Cloudera Manager With Kerberos Authentication

You will Learn in This course.

1:- Hadoop 2 Prerequisites.

2:- Cloudera Manager Deployment.

3:- Add New Node To Cloudera Cluster.

4:- Kerberos Authentication Steps.

5:- Secure Cloudera Cluster

I have demonstrated that hadoop2 pre-requisites and Cloudera manager installation after installation enabling it Kerberos authentication on Cloudera manager and check one job on the cluster and check Kerberos is working or not. also, show how to create ec2 instance then creating an image of ec2 instance, spot instance on-demand instance then if you want to secure your Hadoop environment you will learn that in this course.