0% found this document useful (0 votes)
118 views2 pages

Quiz 3 Big Data

This document discusses big data analytics and related technologies. It asks to identify the process for developing conceptual models of big data analytics. It also asks to draw diagrams of Hadoop and NoSQL databases integrated with a data warehouse infrastructure. Finally, it describes Pig and Hive, two technologies used in Hadoop 2 to make programming easier using a procedural and declarative language respectively.

Uploaded by

Arsyan Syahir
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
118 views2 pages

Quiz 3 Big Data

This document discusses big data analytics and related technologies. It asks to identify the process for developing conceptual models of big data analytics. It also asks to draw diagrams of Hadoop and NoSQL databases integrated with a data warehouse infrastructure. Finally, it describes Pig and Hive, two technologies used in Hadoop 2 to make programming easier using a procedural and declarative language respectively.

Uploaded by

Arsyan Syahir
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Quiz 3 Big Data

 Identify the process that involves in developing conceptual of Big Data Analytics
 Draw the design of Hadoop and No SQL database that linked to datawarehouse infrastructure

Operational User
System
Streaming / Alert
ECP/ Engine BI
Server
Operational
System

Dept Data
Mart

Machine
Data

Data Warehouse

Hadoop

In Memory
Sandbox

Machine
Data

User

Free Standing
Audio/Video Sandbox
Data

External Data
 Draw the realtime engine and event processing in the information architecture

Real time feeds


Streaming Processing Application
memory

disk

Storage Optional
Storage and Queris

 Hadoop 2 solutions for making programming easier using pig and Hive.Describe those them
Pig : Procedural Data Flow Language, For Programming, Mainly used by Researchers and
Programmers, Operates on the client side of a cluster, Does not have a dedicated metadata
database, Pig is SQL like but varies to a great extent, Pig supports Avro file format.

Hive : Declarative SQLish Language, For creating reports, Mainly used by Data Analysts, Operates
on the server side of a cluster, Makes use of exact variation of dedicated SQL DDL language by
defining tables beforehand, Directly leverages SQL and is easy to learn for database expert, Hive
does not support it.

You might also like