Getting An Overview of Big Data
Getting An Overview of Big Data
Hadoop Ecosystem
Hadoop Distributed File System
MapReduce
Hadoop YARN
Introducing HBase
Combining HBase and HDFS
Hive
Pig and Pig Latin
Sqoop
ZooKeeper
Flume
Oozie
Summary
Quick Revise
Background of YARN
Advantages of YARN
YARN Architecture
Working of YARN
YARN Schedulers
Backward Compatibility with YARN
YARN Configurations
YARN Commands
YARN Containers
Registry
Log Management in Hadoop 1
Summary
Quick Revise
Introducing Hive
Getting Started with Hive
Hive Services
Data Types in Hive
Built-In Functions in Hive
Hive DDL
Data Manipulation in Hive
Data Retrieval Queries
Using JOINS in Hive
Summary
Quick Revise
Introducing Pig
Running Pig
Getting Started with Pig Latin
Working with Operators in Pig
Debugging Pig
Working with Functions in Pig
Error Handling in Pig
Summary
Quick Revise
Introducing Oozie
Installing and Configuring Oozie
Understanding the Oozie Workflow
Oozie Coordinator
Oozie Bundle
Oozie Parameterization with EL
Oozie Job Execution Model
Accessing Oozie
Oozie SLA
Summary
Quick Revise
Introduction to NoSQL
Types of NoSQL Data Models
Schema-Less Databases
Materialized Views
Distribution Models
Sharding
Summary
Quick Revise
Flume Architecture
Sqoop
Importing Data
Sqoop2 vs Sqoop
Summary
Quick Revise
What is Mahout?
Machine Learning
Collaborative Filtering (Recommendation)
Clustering
Classification
Mahout Algorithms
Environment for Mahout
Summary
Quick Revise
Chapter 18: Understanding Analytics and Big Data
Analytical Approaches
History of Analytical Tools
Introducing Popular Analytical Tools
Comparing Various Analytical Tools
Installing R
Installing RStudio
Summary
Quick Revise
Using Plots
Saving Graphs to External Files
Advanced Features of R
Summary
Quick Revise