0% found this document useful (0 votes)
41 views3 pages

Hadoop Course Contents PDF

The document outlines the course contents for a Hadoop training program. It includes 10 chapters that cover topics such as introduction to big data and Hadoop, installing and setting up Hadoop, MapReduce, YARN, Hive, Pig, Sqoop, HBase, and Oozie. The course is designed to teach students the fundamental concepts and components of Hadoop as well as how to use related tools for working with big data. It provides hands-on exercises for students to gain experience with Hadoop and its ecosystem. The course duration is 45 hours and costs 20,000 Indian rupees per person.

Uploaded by

punitha.jagan536
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
41 views3 pages

Hadoop Course Contents PDF

The document outlines the course contents for a Hadoop training program. It includes 10 chapters that cover topics such as introduction to big data and Hadoop, installing and setting up Hadoop, MapReduce, YARN, Hive, Pig, Sqoop, HBase, and Oozie. The course is designed to teach students the fundamental concepts and components of Hadoop as well as how to use related tools for working with big data. It provides hands-on exercises for students to gain experience with Hadoop and its ecosystem. The course duration is 45 hours and costs 20,000 Indian rupees per person.

Uploaded by

punitha.jagan536
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Hadoop Course Contents

Chapter : 1 – Introduction To Big Data Chapter: 4 – Map Reduce – 1(MR V1)

 What is Big Data?  Understanding Map Reduce


 Examples of Big Data  Job Tracker and Task Tracker
 Reasons of Big Data Generation  Architecture of Map Reduce
 Why Big Data deserves your attention  Map Function
 Use Cases of Big Data  Reduce Function
 Different options of analyzing Big Data  Data flow of Map Reduce
 Hadoop Writable, Comparable &
Chapter :2 – Introduction To Hadoop
comparision with Java data types
 What is Hadoop  Creation of local files with directories
 History of Hadoop and Hadoop API
 How Hadoop name was given  Creation of HDFS files with directories
 Problems with Traditional RDBMS and and Hadoop API
need for Hadoop  Map Function & Reduce Function
 Hadoop Architecture  How Map Reduce works
 Fundemental concepts of Hadoop  Anatomy of Map Reduce Job
 Rack Awarness  Submission & Initialization of Map
Reduce Job
 Read/Write from HDFS
 HDFS Federation & High Availability  Monitoring & Progress of Map Reduce
Job
Chapter : 3 – Starting Hadoop  Understand the difference between
Block and Input split
 Setting up single node Hadoop
 Role of Record reader, Shuffler and
Cluster(Pseudo mode)
sorter
 Understanding Hadoop configuration
 File Input Formats
files
 File Output Formats
 Hadoop Components – HDFS , Map
 Getting started with Eclipse IDE
Reduce
 Setting up Eclipse Development
 Overview of Hadoop Processes
Environment
 Overview of Hadoop Distributed File
 Creating Map Reduce projects
System
 Configuring Hadoop API on Eclipse IDE
 The building blocks of Hadoop
 Differences between the Hadoop old
 Assignment On: Using HDFS commands
API and New API
 Life cycle of the job  Difference between Cluster by &
 Identity Reducer Distribute by
 Map Reduce Program flow with word  File Input Formats
count  Text File
 Combiner & Partitioner, Custom  RC
Partitioner with examples  ORC
 Joining Multiple data sets in Map  Sequence
Reduce  Avro
 Map Side, Reduce Side Joins With  Parquet
Examples  Creating UDF’S
 Distributed Cache with practical  Optimization Techniques
example  Hands –On On
 Speculation Execution  Assignment on Hive
 Schedulers
 FIFO Schedulers Chapter : 7 – Pig
 FAIR Schedulers  Introduction to Apache Pig
 Capacity Schedulers
 Building Blocks( Bag, Tuple, Field)
Chapter : 5 – Map Reduce -2 (Yarn)  Installing Pig
 PIG Terminology & Data Types
 Limitations of Current Architecture  Different modes of execution of PIG
 Yarn Architecture  Working with various PIG commands
 Application Master, Node Manager, & covering all the functions in PIG
Resource Manager  Developing PIG Scripts
 Writing a Map Reduce using Yarn  Parameter Substitution
 Command line arguments
Chapter : 6 - Hive
 Passing parameters through a
 Introduction to Hive param file
 Architecture of Hive  Joins (Left Outer, Right Outer, Full
 Installing Hive Outer)
 Hive Data Types  Nested Queries
 Exploring Hive Meta Store Tables  Specialized Joins in PIG(Replicated,
 Types of Tables in Hive Skewed, Merge Join)
 Partitions(Static & Dynamic)  HCatalog(Getting data from hive to pig
 Buckets & Sampling & Vice versa)
 Indexes  Working with unstructured data
 Views  Working with Semi-structured data like
 Developing Hive Scripts XML, JSON
 Parameter Substitution  Optimizing techniques
 Difference between order by & Sort by  Creating UDF’s
 Hands-On On
 Assignment on PIG  CRUD operations of HBASE with
Examples
Chapter : 8 - SQOOP
 HIVE integration with HBASE
 Introduction to Sqoop & Architecture  Assignment On
 Import Data from RDBMS to HDFS Chapter :10 - OOZIE
 Importing Data from RDBMS to HIVE
 Exporting Data from Hive To RDBMS  What is OOzie
 Handling incremental load using Sqoop  Features of Oozie
 Assignment On  Job Types in Oozie
 Control Nodes & Action Nodes
Chapter : 9 - HBASE
 Oozie Workflow process flow
 Introduction To HBASE  Oozie Parametarization
 Exploring HBASE Master & Region  Oozie Command Line Examples
server  Oozie Web Console
 Exploring Zookeeper  Assignment On

Duration :45 hours Price : 20,000 INR / person

To know more click http://techlogik.in/?page_id=164 and provide your contact we will reach back to you.

Techlogik Learning Services Private Limited


1st Floor , Sigma Arcade, Marathalli, Bangalore - 560037
Phone : 080 41 606 123, 8088 9412 77, 8088 9412 78
www.techlogik.in

You might also like