Unit 1

You are on page 1of 22

18CSE360T-INFORMATION STORAGE

MANAGEMENT

ABINAYA.G/CSE-SRMIST

Module 1: Introduction to Information Storage 1


Introduction to Information Storage

• Define data and information


• Describe types of data
• Describe the evolution of storage architecture
• Describe the core elements of a data center
• List the key characteristics of data center
• Provide an overview of virtualization and cloud computing

2
Why Information Storage and Management?
• Information is the knowledge derived from data
• Growth of digital information has resulted in information
explosion
• We live in an on-command, on-demand world
 We need information when and where required
• Increasing dependency on fast and reliable access to information
• Businesses seek to store, protect, optimize, and leverage the
information
 To gain competitive advantage
 To derive new business opportunity

3
What is Data?
Data

It is a collection of raw facts from which conclusions may be drawn.

• Data is converted into more


convenient form − digital data Digital Movie
Movie
• Factors for digital data growth 10101011010
00010101011
are: Photo
Digital Photo
01010101010
10101011010
 Increase in data-processing
00010101011
capabilities Book
e-Book
01010101010
10101010101
 Lower cost of digital storage
01010101010
email
 Affordable and faster Letter 10101010101

communication technology Digital Data

 Proliferation of applications and


smart devices

4
Types of Data
• Data can be classified as:
 Structured
 Unstructured
email Attachments PDFs
• Majority of data being Unstructured (90%)
X-rays
created is unstructured
Manuals Instant Messages
Images Documents

Forms
Web Pages
Contracts Rich Media

Invoices

Audio, Video
Structured (10%)
Database

5
Big Data
Big Data
It refers to data sets whose sizes are beyond the ability of commonly used
software tools to capture, store, manage, and process within acceptable
time limits.

• Includes both structured and unstructured data generated by


variety of sources
• Big data analysis in real time requires new techniques and tools
that provide:
 High performance
 Massively parallel processing (MPP) data platforms
 Advanced analytics
• Big data analytics provide an opportunity to translate large
volumes of data into right decisions

6
Storage
• Stores data created by individuals and organizations
 Provides access to data for further processing
• Examples of storage devices are:
 Media card in a cell phone or digital camera
 DVDs, CD-ROMs
 Disk drives
 Disk arrays
 Tapes

7
Evolution of Storage Architecture

Department 1 Department 2 Department 3


Server Server Server

Department 1 Department 2 Department 3


Server Server Server

Storage
Network

Server-centric Storage Architecture

Storage Device
Information-centric Storage Architecture

8
Data Center
Data Center

It is a facility that contains storage, compute, network, and other IT


resources to provide centralized data-processing capabilities.

• Core elements of a data center


 Application
 Database management system (DBMS)
 Host or Compute
 Network
 Storage
• These core elements work together to address data-processing
requirements

9
Data Center: Online Order Transaction System Example

Host/ Storage Array


Compute

Client

Storage
LAN/WAN Network

User
Interface OS and DBMS

10
Key Characteristics of a Data Center

Availability

Data Integrity Security

Manageability

Performance Capacity

Scalability

11
Managing Data Center
• Key management activities include
 Monitoring
 Continuous process of gathering information on various elements
and services running in a data center
 Reporting
 Details on resource performance, capacity, and utilization
 Provisioning
 Configuration and allocation of resources to meet the capacity,
availability, performance, and security requirements
• Virtualization and cloud computing have changed the way data
center infrastructure resources are provisioned and managed

12
Virtualization: An Overview
• Virtualization is a technique of abstracting physical resources and
making them appear as logical resources
 For example partitioning of raw disks
• Pools physical resources and provides an aggregated view of
physical resource capabilities
• Virtual resources can be created from pooled physical resources
 Improves utilization of physical IT resources

13
Cloud Computing: An Overview
• Enables individuals and organizations to use IT resources as a
service over network
• Enables self-service requesting and automates request-
fulfillment process
 Enables users to scale up or scale down the usage of computing
resources quickly
• Enables consumption-based metering
 Consumers pay only for the resources they use
 Example: CPU hours used, amount of data transferred, and
Gigabytes of data stored

14
Key Challenges in Managing Information
• Exploding digital universe
 Multifold increase of information growth

• Increasing dependency on information


 The strategic use of information plays

• Changing value of information


 Information that is valuable today may become less important
tomorrow

15
Some Constraints to Meeting the Requirements
Constraints include:
• Cost
• Physical environment
• Maintenance and support
• Compliance – regulatory and legal
• Hardware and software infrastructure
• Interoperability and compatibility

16
Data Center Environment –Application
• An application is a computer program that provides the logic for
computing operations
• The application sends requests to the underlying operating
system to perform read/write (R/W) operations on the storage
devices
• Some examples of these applications are e-mail, enterprise
resource planning (ERP), decision support system (DSS), resource
management, backup, authentication and antivirus applications,
and so on.

17
Database Management System (DBMS)

• Database Management System, is a complex set of software


program that controls the organization, storage,  management,
and retrieval of data from a database

• Software for creating and maintaining databases


• Acts as interface between application programs and physical
data files

• Separates logical and design views of data


• Solves many problems of the traditional data file approach

18
The Contemporary Database Environment

Click icon to add picture

19
Components of a DBMS
• Modeling language to define schema of each database hosted
according to the data model
• Data structures optimized to deal with huge data stored on a
permanent data storage device
• Query language and report writer
• Transaction mechanism to ensure data integrity

20
Functions of a DBMS
• Manages the data dictionary, or the definition and structure of
data
• Manages data storage
• Responsible for data transformation & presentation
• Manages data integrity & security
• Controls multi-user access
• Manages backup & recovery
• Manages database language & API (source code)
• Manages database communication interface

21
Importance of a DBMS
• DBMS facilitate effective & efficient data management

• DBMS facilitate access to large, organized data

• DBMS promote integrated view of an organization’s operations

• DBMS reduce chance of data inconsistency

22

You might also like