0% found this document useful (0 votes)
104 views22 pages

Unit 1

This document provides an introduction to information storage and management. It defines key concepts like data, structured and unstructured data, and big data. It describes the evolution of storage architecture from server-centric to information-centric. It also outlines the core elements of a data center including applications, database management systems, hosts/compute, networks and storage. It discusses characteristics of data centers like availability, security and scalability.

Uploaded by

abi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
0% found this document useful (0 votes)
104 views22 pages

Unit 1

This document provides an introduction to information storage and management. It defines key concepts like data, structured and unstructured data, and big data. It describes the evolution of storage architecture from server-centric to information-centric. It also outlines the core elements of a data center including applications, database management systems, hosts/compute, networks and storage. It discusses characteristics of data centers like availability, security and scalability.

Uploaded by

abi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
You are on page 1/ 22

18CSE360T-INFORMATION STORAGE

MANAGEMENT

ABINAYA.G/CSE-SRMIST

Module 1: Introduction to Information Storage 1


Introduction to Information Storage

• Define data and information


• Describe types of data
• Describe the evolution of storage architecture
• Describe the core elements of a data center
• List the key characteristics of data center
• Provide an overview of virtualization and cloud computing

2
Why Information Storage and Management?
• Information is the knowledge derived from data
• Growth of digital information has resulted in information
explosion
• We live in an on-command, on-demand world
 We need information when and where required
• Increasing dependency on fast and reliable access to information
• Businesses seek to store, protect, optimize, and leverage the
information
 To gain competitive advantage
 To derive new business opportunity

3
What is Data?
Data

It is a collection of raw facts from which conclusions may be drawn.

• Data is converted into more


convenient form − digital data Digital Movie
Movie
• Factors for digital data growth 10101011010
00010101011
are: Photo
Digital Photo
01010101010
10101011010
 Increase in data-processing
00010101011
capabilities Book
e-Book
01010101010
10101010101
 Lower cost of digital storage
01010101010
email
 Affordable and faster Letter 10101010101

communication technology Digital Data

 Proliferation of applications and


smart devices

4
Types of Data
• Data can be classified as:
 Structured
 Unstructured
email Attachments PDFs
• Majority of data being Unstructured (90%)
X-rays
created is unstructured
Manuals Instant Messages
Images Documents

Forms
Web Pages
Contracts Rich Media

Invoices

Audio, Video
Structured (10%)
Database

5
Big Data
Big Data
It refers to data sets whose sizes are beyond the ability of commonly used
software tools to capture, store, manage, and process within acceptable
time limits.

• Includes both structured and unstructured data generated by


variety of sources
• Big data analysis in real time requires new techniques and tools
that provide:
 High performance
 Massively parallel processing (MPP) data platforms
 Advanced analytics
• Big data analytics provide an opportunity to translate large
volumes of data into right decisions

6
Storage
• Stores data created by individuals and organizations
 Provides access to data for further processing
• Examples of storage devices are:
 Media card in a cell phone or digital camera
 DVDs, CD-ROMs
 Disk drives
 Disk arrays
 Tapes

7
Evolution of Storage Architecture

Department 1 Department 2 Department 3


Server Server Server

Department 1 Department 2 Department 3


Server Server Server

Storage
Network

Server-centric Storage Architecture

Storage Device
Information-centric Storage Architecture

8
Data Center
Data Center

It is a facility that contains storage, compute, network, and other IT


resources to provide centralized data-processing capabilities.

• Core elements of a data center


 Application
 Database management system (DBMS)
 Host or Compute
 Network
 Storage
• These core elements work together to address data-processing
requirements

9
Data Center: Online Order Transaction System Example

Host/ Storage Array


Compute

Client

Storage
LAN/WAN Network

User
Interface OS and DBMS

10
Key Characteristics of a Data Center

Availability

Data Integrity Security

Manageability

Performance Capacity

Scalability

11
Managing Data Center
• Key management activities include
 Monitoring
 Continuous process of gathering information on various elements
and services running in a data center
 Reporting
 Details on resource performance, capacity, and utilization
 Provisioning
 Configuration and allocation of resources to meet the capacity,
availability, performance, and security requirements
• Virtualization and cloud computing have changed the way data
center infrastructure resources are provisioned and managed

12
Virtualization: An Overview
• Virtualization is a technique of abstracting physical resources and
making them appear as logical resources
 For example partitioning of raw disks
• Pools physical resources and provides an aggregated view of
physical resource capabilities
• Virtual resources can be created from pooled physical resources
 Improves utilization of physical IT resources

13
Cloud Computing: An Overview
• Enables individuals and organizations to use IT resources as a
service over network
• Enables self-service requesting and automates request-
fulfillment process
 Enables users to scale up or scale down the usage of computing
resources quickly
• Enables consumption-based metering
 Consumers pay only for the resources they use
 Example: CPU hours used, amount of data transferred, and
Gigabytes of data stored

14
Key Challenges in Managing Information
• Exploding digital universe
 Multifold increase of information growth

• Increasing dependency on information


 The strategic use of information plays

• Changing value of information


 Information that is valuable today may become less important
tomorrow

15
Some Constraints to Meeting the Requirements
Constraints include:
• Cost
• Physical environment
• Maintenance and support
• Compliance – regulatory and legal
• Hardware and software infrastructure
• Interoperability and compatibility

16
Data Center Environment –Application
• An application is a computer program that provides the logic for
computing operations
• The application sends requests to the underlying operating
system to perform read/write (R/W) operations on the storage
devices
• Some examples of these applications are e-mail, enterprise
resource planning (ERP), decision support system (DSS), resource
management, backup, authentication and antivirus applications,
and so on.

17
Database Management System (DBMS)

• Database Management System, is a complex set of software


program that controls the organization, storage,  management,
and retrieval of data from a database

• Software for creating and maintaining databases


• Acts as interface between application programs and physical
data files

• Separates logical and design views of data


• Solves many problems of the traditional data file approach

18
The Contemporary Database Environment

Click icon to add picture

19
Components of a DBMS
• Modeling language to define schema of each database hosted
according to the data model
• Data structures optimized to deal with huge data stored on a
permanent data storage device
• Query language and report writer
• Transaction mechanism to ensure data integrity

20
Functions of a DBMS
• Manages the data dictionary, or the definition and structure of
data
• Manages data storage
• Responsible for data transformation & presentation
• Manages data integrity & security
• Controls multi-user access
• Manages backup & recovery
• Manages database language & API (source code)
• Manages database communication interface

21
Importance of a DBMS
• DBMS facilitate effective & efficient data management

• DBMS facilitate access to large, organized data

• DBMS promote integrated view of an organization’s operations

• DBMS reduce chance of data inconsistency

22

You might also like