This 3-day course introduces attendees to integrating Big Data components like Hadoop to create a Data Lake, selecting appropriate data stores, processing large datasets with Hadoop, querying data with Pig and Hive, and planning a Big Data strategy. The course is suitable for managers, programmers, architects and administrators across industries wanting a foundational overview. Attendees will learn concepts but not receive deep training in tools and techniques.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
55 views3 pages
Introduction Big Data With Hadoop
This 3-day course introduces attendees to integrating Big Data components like Hadoop to create a Data Lake, selecting appropriate data stores, processing large datasets with Hadoop, querying data with Pig and Hive, and planning a Big Data strategy. The course is suitable for managers, programmers, architects and administrators across industries wanting a foundational overview. Attendees will learn concepts but not receive deep training in tools and techniques.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3
Introduction Big Data with Hadoop
Duration 3 days
You Will Learn How To:
Integrate Big Data components to create an appropriate Data Lake Select the correct Big Data stores for disparate data sets Process large data sets using Hadoop to extract value Query large data sets in near real time with Pig and Hive Plan and implement a Big Data strategy for your organization
Who Should Attend :
As an introduction to Big Data training, this course is ideal for anyone, including managers, programmers, architects and administrators, who wants a foundational overview of the key components of Big Data and how they can be integrated to provide suitable solutions for their organization. No programming experience is required. Programmers should be aware that the exercises in this course are intended to give attendees high-level exposure to the capabilities of the Big Data software tools and techniques, and not a deep dive.
Course Detail: Introduction to Big Data Defining Big Data
The four dimensions of Big Data: volume, velocity, variety, veracity
Introducing the Storage, MapReduce and Query Stack
Delivering business benefit from Big Data
Establishing the business importance of Big Data
Addressing the challenge of extracting useful data Integrating Big Data with traditional data
Storing Big Data
Analyzing your data characteristics
Selecting data sources for analysis
Eliminating redundant data Establishing the role of NoSQL
Overview of Big Data stores
Data models: key value, graph, document, column–family