Lecture 2&3 Database Models
Lecture 2&3 Database Models
A domain is the set of all possible values that an attribute or field may
validly contain.
NB: The relational and Object Oriented Models are currently in used.
EMERGING DATABASE MODELS
Big Data refers to voluminous data, velocity(high speed data) and Varied
data.
Hadoop
NoSQL
MapReduce
Key-Value
HADOOP
Hadoop is an open-source software framework that is designed to store
and process large amounts of data across clusters of commodity
hardware. It is an Apache project that was initially created by Doug
Cutting and Mike Cafarella in 2006, and it has since become a widely
adopted tool for big data processing and analysis.
Two Major components
Hadoop Distributed File System (HDFS)
MapReduce.
HDFS is a distributed file system that allows for the storage of large data
sets across multiple machines in a cluster. It is designed to be fault-
tolerant, meaning that it can handle hardware failures without losing
data.
MAPREDUCE
MapReduce is a programming model and processing framework that is
used to process data stored in HDFS.
MapReduce divides data processing tasks into two phases: the Map
phase and the Reduce phase.
In the Map phase, data is broken down into smaller chunks and
processed in parallel across the cluster. In the Reduce phase, the results
of the Map phase are aggregated to produce a final output.
NOSQL AND KEY-VALUE
NoSQL: A new generation of database management systems that is not
based on the traditional relational database model.