Quiz 3 Big Data
Quiz 3 Big Data
Identify the process that involves in developing conceptual of Big Data Analytics
Draw the design of Hadoop and No SQL database that linked to datawarehouse infrastructure
Operational User
System
Streaming / Alert
ECP/ Engine BI
Server
Operational
System
Dept Data
Mart
Machine
Data
Data Warehouse
Hadoop
In Memory
Sandbox
Machine
Data
User
Free Standing
Audio/Video Sandbox
Data
External Data
Draw the realtime engine and event processing in the information architecture
disk
Storage Optional
Storage and Queris
Hadoop 2 solutions for making programming easier using pig and Hive.Describe those them
Pig : Procedural Data Flow Language, For Programming, Mainly used by Researchers and
Programmers, Operates on the client side of a cluster, Does not have a dedicated metadata
database, Pig is SQL like but varies to a great extent, Pig supports Avro file format.
Hive : Declarative SQLish Language, For creating reports, Mainly used by Data Analysts, Operates
on the server side of a cluster, Makes use of exact variation of dedicated SQL DDL language by
defining tables beforehand, Directly leverages SQL and is easy to learn for database expert, Hive
does not support it.