0% found this document useful (0 votes)
85 views6 pages

1 Month Big Data Boot Camp

This document outlines an 11 module, 30 day training program on Hadoop and big data technologies. The program covers fundamental concepts like HDFS, MapReduce, Pig and Hive. It also covers related technologies like HBase, Zookeeper, Sqoop, Oozie and Flume. The final module involves building a web log analysis proof of concept using MapReduce. The program aims to provide both theoretical and hands-on training with a 40:60 theory to practical split over 11 sessions. Fees for participating are Rs. 6,500 per head.

Uploaded by

rashmikedia
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
85 views6 pages

1 Month Big Data Boot Camp

This document outlines an 11 module, 30 day training program on Hadoop and big data technologies. The program covers fundamental concepts like HDFS, MapReduce, Pig and Hive. It also covers related technologies like HBase, Zookeeper, Sqoop, Oozie and Flume. The final module involves building a web log analysis proof of concept using MapReduce. The program aims to provide both theoretical and hands-on training with a 40:60 theory to practical split over 11 sessions. Fees for participating are Rs. 6,500 per head.

Uploaded by

rashmikedia
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

From Strategy to Implementation

Project Based Summer Training Programme

BigData / HADOOP
Total No Of days : 30
Theory and Practical : 40:60
Total No of Session - 11

Module 1. What is Big Data & Why Hadoop?

What is Big Data?

Traditional data management systems and their limitations

What is Hadoop?

Why is Hadoop used?

The Hadoop eco-system

Big data/Hadoop use cases

Module 2. HDFS (Hadoop Distributed File System) and installing Hadoop on single node

HDFS Architecture

HDFS internals and use cases

HDFS Daemons

Files and blocks

Namenode memory concerns

Secondary namenode

HDFS access options

Installing and configuring Hadoop

Hadoop daemons

Basic Hadoop commands

Hands-on exercise

Module 3. Advanced HDFS concepts

HDFS workshop

HDFS API

How to use configuration class

Using HDFS in MapReduce and programmatically

HDFS permission and security

Additional HDFS tasks

HDFS web-interface

Hands-on exercise

Module 4. Cloud computing overview and installing Hadoop on multiple nodes

Cloud computing overview

SaaS/PaaS/IaaS

Characteristics of cloud computingSaaS/PaaS/IaaS

Cluster configurationsSaaS/PaaS/IaaS

Configuring Masters and Slaves

Module 5.Introduction to MapReduce

MapReduce basics

Functional programming concepts

List processing

Mapping and reducing lists

Putting them together in MapReduce

Word Count example application

Understanding the driver, mapper and reducer

Closer look at MapReduce data flow

Additional MapReduce functionality

Fault tolerance

Hands-on exercises

Module 6. MapReduce workshop

Hands-on work on MapReduce

Module 7. Advanced MapReduce concepts

Understand combiners & partitioners

Understand input and output formats

Distributed cache

Understanding counters

Chaining, listing and killing jobs

Hands-On Exercise

Module 8. Using Pig and Hive for data analysis

Pig program structure and execution process

Joins & filtering using Pig

Group & co-group

Schema merging and redefining functions

Pig functions

Understanding Hive

Using Hive command line interface

Data types and file formats

Basic DDL operations

Schema design

Hands-on examples

Module 9. Introduction to HBase, Zookeeper & Sqoop

HBase overview, architecture & installation

HBase admin: test

HBase data access

Overview of Zookeeper

Sqoop overview and installation

Importing and exporting data in Sqoop

Hands-on exercise

Module 10. Introduction to Oozie, Flume and advanced Hadoop concepts


Overview of Oozie and Flume

Oozie features and challenges

How does Flume work

Connecting Flume with HDFS

YARN

HDFS Federation

Authentication and high availability in Hadoop

Module 11. Building a web-log analysis POC using MapReduce

Designing structures for POC

Developing MapReduce code

Push data using Flume into HDFS

Run MapReduce code

Analyse the output

Bigdata Projects
Fees: Rs. 6,500/- per head

Contact Details:
Thanks and Regards
Sarika Sharma, Business Development Manager
TechBharat Consulting
Ph: +91-11-47106989
Mob: +91 - 9911030818
E-mail: [email protected]

You might also like