Menu

maexadata - Welcome to the world of mega-exabytes!!!

HADOOP

Hadoop will probably get us from a hundred thousand data points down to, like, five thousand. So we're down to five days instead of five years.
-- Robin Sloan

Hadoop Administration

Objectives

This program will give the participants a technical competence of performing administrative tasks in Hadoop. This is a technical program which will provide a working knowledge of Hadoop Administration.

Program Outline

The program covers:

  • Hadoop Single Node Installation
  • Hadoop Cluster Installation
  • Hadoop Admin Commands
  • Security with Hadoop
  • Working with Cloudera Manager
  • Case Study: Planning Real Life Hadoop Environment

Competency Achievement

After the completion of program, the participants will be able to:

  • Manage Hadoop Environment
  • Hadoop Single Node Installation
  • Hadoop Cluster Installation
  • Work with Cloudera Manager (CDH5)

 

Hadoop Development

Objectives

This program gives the participants an exposure to the Hadoop System and Hadoop components like Pig, Hive, Sqoop. Also gives an exposure to Spark, SparkHive & Scala. This is a technical program which will provide an overview along with basic working knowledge of the said components.

Program Outline

The program covers the following

Analytics Overview

  • Analysis v/s Analytics
  • Business Analyst v/s Data Scientist
  • Types Of Analytics
  • Application Of Analytics
  • Use Cases

Hadoop Overview

  • What is Big Data?
  • What is Hadoop? Why Hadoop?
  • Hadoop Distributed File System
  • HDFS Commands
  • MapReduce Concept (Understanding Input / Output)
  • Simple Word Count program in Java using MapReduce
  • Hadoop Eco-System Components 

Pig

  • Introduction To Pig
  • Pig Architecture
  • Basic Pig Programming
  • User Defined Functions
  • Case Studies

Hive 

  • Introduction To Hive
  • Hive Architecture
  • Basic Hive Commands
  • Advanced Hive Commands
  • Hive Commands For Analytics
  • Case Studies

Sqoop 
 

  • Introduction To Sqoop
  • Sqoop Data Import
  • Sqoop Data Export
  • Case Studies

Flume 

  • Introduction To Flume
  • Flume Architecture
  • Flume Concepts
  • Working With Flume
  • Case Studies

Spark 

  • Introduction To Spark
  • Spark Architecture
  • Spark Concepts

SparkHive

  • Introduction To SparkHive
  • SparkHive Architecture & Concepts
  • Basic SparkHive Commands
  • Advanced SparkHive Commands
  • SparkHive Commands For Analytics

Scala

  • Introduction To Scala
  • Scala Architecture & Concepts
  • Basic Scala Commands
  • Case Studies

    
Competency Achievement

After the completion of program, the participants will be:

  • Able to transfer data to & from local file system & HDFS.
  • Able to transfer data to & from a RDBMS system & HDFS.
  • Familiar to Hadoop Pig environment and confidently work on Pig development.
  • Familiar to Hadoop Hive environment and execute Hive commands for BigData Analytics.
  • Understand working of Spark.
  • Familiar to SparkHive environment and execute SparkHive commands for BigData Analytics.
  • Familiar to Spark Scala environment and carry out elementary Scala development.