3 - Database Systems and Big Data
3 - Database Systems and Big Data
Introduction:
Learning Outcomes:
· Identify the key factors that must be considered when designing a database.
· Identify the various types of data models and explain how they are useful in planning
a database.
· Define the role of the database schema, data definition language, and data
manipulation language.
· Define the term big data and identify its basic characteristics.
· Explain why big data represents both a challenge and an opportunity.
· Define the term data management and state its overall goal.
· Define the terms data warehouse, data mart, and data lakes and explain how they
are different.
· Discuss the whole Hadoop computing environment and its various components.
· Define the term in-memory database and explain its advantages in processing big
data.
Data Fundamentals
Without data and the ability to process it, an organization cannot successfully complete its
business activities. It cannot pay employees, send out bills, order new inventory, or
produce information to assist managers in decision making. Recall that data consists of raw
facts, such as employee numbers and sales figures. For data to be transformed into useful
information, it must first be organized in a meaningful way. (Stair et al, 2018, p. 112)
Continue reading on page 112-117.
Because today’s businesses must keep track of and analyze so much data, they must keep
the data well organized so that it can be used effectively. A database should be designed to
store all data relevant to the business and to provide quick access and easy modification.
Moreover, it must reflect the business processes of the organization. When building a
database, an organization must carefully consider the following questions:
● Logical structure. How should data be arranged so that it makes sense to a given user?
Creating and implementing the right database system ensures that the database will
support both business activities and goals. But how do we actually create, implement, use,
and update a database? The answer is found in the database management system (DBMS).
As discussed earlier, a DBMS is a group of programs used as an interface between a
database and application programs or between a database and the user. Database
management systems come in a wide variety of types and capabilities, ranging from small
inexpensive software packages to sophisticated systems costing hundreds of thousands of
dollars. Continue reading on page 124-133.
Big Data
Big data is the term used to describe data collections that are so enormous (terabytes or
more) and complex (from sensor data to social media data) that traditional data
management software, hardware, and analysis processes are incapable of dealing with
them. Continue reading on page 134-140.
The raw data necessary to make sound business decisions is typically stored in a variety of
locations and formats. This data is initially captured, stored, and managed by transaction-
processing systems that are designed to support the day-to-day operations of an
organization. For decades, organizations have collected operational, sales, and financial
data with their online transaction processing (OLTP) systems. These OLTP systems put data
into databases very quickly, reliably, and efficiently, but they do not support the types of
big data analysis that today’s businesses and organizations require. Through the use of
data warehouses and data marts, organizations are now able to access the data gathered
via OLTP system and use it more effectively to support decision making. Continue reading
on page 140-146.