0% found this document useful (0 votes)
46 views9 pages

Big Data Analytics & Technologies

This document summarizes a lesson on big data analytics and technologies. It defines big data as the analysis, processing, and storage of large data collections. It discusses how big data solutions are needed when traditional techniques are insufficient. It also describes datasets, big data analytics, Hadoop as an open-source framework, related projects like HBase and Hive, and some common big data career opportunities. The presentation concludes with a question and answer session.

Uploaded by

Wong pi wen
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views9 pages

Big Data Analytics & Technologies

This document summarizes a lesson on big data analytics and technologies. It defines big data as the analysis, processing, and storage of large data collections. It discusses how big data solutions are needed when traditional techniques are insufficient. It also describes datasets, big data analytics, Hadoop as an open-source framework, related projects like HBase and Hive, and some common big data career opportunities. The presentation concludes with a question and answer session.

Uploaded by

Wong pi wen
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

Big Data Analytics &

Technologies
CT047-3-M

Summary
Topic & Structure of The Lesson

• Summary
– Summary of module
– Revision of Major Topic Areas
– Assessment Discussion
– Presentations

CT047-3-M-BDAT - Big Data Analytics & Technologies Summary Slide <2> of 9


Big Data

A field dedicated to the :


– Analysis
– Processing
– Storage of large collections of data
Big data solutions and practices are typically
required when traditional data analysis,
processing and storage technologies and
techniques are insufficient.

CT047-3-M-BDAT - Big Data Analytics & Technologies Summary Slide <3> of 9


Datasets

 Collections or groups of related data are generally


referred to as datasets.
 Each group or dataset member shares the same
set of attributes or properties as others in the same
dataset.
 Examples:
 Tweets stored in a flat file
 A collection of image files in a directory
 An extract of rows from a database table stored in a CSV
formatted file
 Historical weather observations that are stored as XML
files

CT047-3-M-BDAT - Big Data Analytics & Technologies Summary Slide <#> of 9


Big Data Analytics

• Can be used by enterprise application directly


• Results obtained through the processing of Big Data can lead to the
following benefits:
• Operational optimization
– Enterprise Resource Planning
– Actionable intelligence
• Identification of new markets
– Business Analytics-analyze environment and identify any possible
changes that could benefit or problem to organization
• Fault and fraud detection
– Fraud claim in Insurance agency
• More detail records-Medical
• Scientific discoveries
– Medical-to determine the cancer risk

CT047-3-M-BDAT - Big Data Analytics & Technologies Summary Slide <5> of 9


Hadoop

 an open-source framework for large-scale data


storage and data processing that is compatible with
commodity hardware.
 The Hadoop framework has established itself as a de
facto industry platform for contemporary Big Data
solutions.
 It can be used as an ETL engine or as an analytics
engine for processing large amounts of structured,
semi-structured and unstructured data.
 From an analysis perspective, Hadoop implements
the MapReduce processing framework

CT047-3-M-BDAT - Big Data Analytics & Technologies Summary Slide <6> of 9


Other projects based on
Hadoop
• HBase
• Hive
• Spark

CT047-3-M-BDAT - Big Data Analytics & Technologies Summary Slide <7> of 9


Big Data—Career
Opportunities
Big Data Cluster & Hadoop Administrators
• Hadoop Programmers & Developers
• Data & Business Analysts
• Data Scientists
• Big Data Architects

CT047-3-M-BDAT - Big Data Analytics & Technologies Summary Slide <8> of 9


Question and Answer Session

CT047-3-M-BDAT - Big Data Analytics & Technologies Summary Slide <9> of 9

You might also like