0% found this document useful (0 votes)

29 views5 pages

Searchstorage Techtarget Com Definition Data-Deduplication

Data deduplication is a method of reducing storage needs by eliminating redundant data. Only a single copy of duplicate data is retained, while other instances reference the unique copy. This can significantly reduce storage demands. For example, 100 instances of a 1MB file could be reduced to just 1MB. Data deduplication offers benefits like lower storage costs, longer retention periods, and reduced backup data. It can operate at the file or block level, with block level generally being more efficient. Potential issues include hash collisions where the same hash is generated for different data.

Uploaded by

Brijesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views5 pages

Searchstorage Techtarget Com Definition Data-Deduplication

Uploaded by

Brijesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

9 Se archSt o rage

g
Ho m e > Ente rprise sto rage , planning and m anage m e nt > > data de duplicatio n (Inte llige nt co m pre ssio n o r single -
instance sto rage ) de finitio n

Da t a De d u p l i c a t i o n ( I n t e l l i g e n t
Co m p r e s s i o n O r Si n g l e -I n s t a n c e
St o r a g e ) De f i n i t i o n
P oste d by Margare t Rouse
WhatIs.co m

c s o n
2 C on tribu tor(s): S te ph e n J . B ig e low, J e ff Ha wkin s

Data deduplication (often called "intelligent compression" or "single-instance storage") is a

method of reducing storage needs by eliminating redundant data. Only one unique instance of
the data is actually retained on storage media, such as disk or tape. Redundant data is
replaced with a pointer to the unique data copy. For example, a typical email system might
contain 100 instances of the same one megabyte (MB) file attachment. If the email platform is
backed up or archived, all 100 instances are saved, requiring 100 MB storage space. With data
deduplication, only one instance of the attachment is actually stored; each subsequent
instance is just referenced back to the one saved copy. In this example, a 100 MB storage
demand could be reduced to only one MB.

2 0 1 5 P lanning : T he T o p 1 0 D at a St o rag e
D e f init io ns Yo u N e e d T o K no w

Whether you’re a seasoned IT expert or a relative newcomer, the jargon surrounding data
storage technologies can be overwhelming. Before you finalize your 2015 planning, refer
to this Special Report to find out the top 10 most important storage terms you need to
know today.
E- mail
Ad d r e s s :

D o wn l o a d N o w

B y sub mittin g yo ur p e rso n al in fo rmatio n , yo u ag re e to re ce ive e mails re g ard in g re le van t p ro d ucts

an d sp e cial o ffe rs fro m Te ch Targ e t an d its p artn e rs. Yo u also ag re e th at yo ur p e rso n al in fo rmatio n
may b e tran sfe rre d an d p ro ce sse d in th e U n ite d S tate s, an d th at yo u h ave re ad an d ag re e to th e
Te rms o f U se an d th e Privacy Po licy.

Data deduplication offers other benefits. Lower storage space requirements will save money
on disk expenditures. The more efficient use of disk space also allows for longer disk
retention periods, which provides better recovery time objectives (RTO) for a longer time and
reduces the need for tape backups. Data deduplication also reduces the data that must be
sent across a WAN for remote backups, replication, and disaster recovery.

Data deduplication can generally operate at the file or block level. File deduplication eliminates
duplicate files (as in the example above), but this is not a very efficient means of
deduplication. Block deduplication looks within a file and saves unique iterations of each block.
Each chunk of data is processed using a hash algorithm such as MD5 or SHA-1. This process
generates a unique number for each piece which is then stored in an index. If a file is updated,
only the changed data is saved. That is, if only a few bytes of a document or presentation are
changed, only the changed blocks are saved; the changes don't constitute an entirely new
file. This behavior makes block deduplication far more efficient. However, block deduplication
takes more processing power and uses a much larger index to track the individual pieces.

Hash collisions are a potential problem with deduplication. When a piece of data receives a
hash number, that number is then compared with the index of other existing hash numbers. If
that hash number is already in the index, the piece of data is considered a duplicate and does
not need to be stored again. Otherwise the new hash number is added to the index and the
new data is stored. In rare cases, the hash algorithm may produce the same hash number for
two different chunks of data. When a hash collision occurs, the system won't store the new
data because it sees that its hash number already exists in the index.. This is called a false
positive, and can result in data loss. Some vendors combine hash algorithms to reduce the
possibility of a hash collision. Some vendors are also examining metadata to identify data and
prevent collisions.

In actual practice, data deduplication is often used in conjunction with other forms of data
reduction such as conventional compression and delta differencing. Taken together, these
three techniques can be very effective at optimizing the use of storage space.

m CC oo nmtpi nr ue es sRi oe na dOi nr gS iAnbgol eu-tInDsattaanDc ee dSutpolri ac ga tei)oDn e( In t e llig e nt

f init io n
DRAM (d ynamic r and o m has hing

2 ac c e s s me mo r y)

2
s t o r ag e hyp e r vis o r s alt

2 2
s t o r ag e r e s o ur c e Re mo t e b ac kup s e r vic e s

2 manag e me nt (S RM)

2 FAQ

z 0 c o mme nt s O ld e s t 5

Share yo ur co mme nt

Send me notifications when other members comment.

Re gis t e r o r Lo gin
E- Mail

[email protected]

Us e r name / P as s wo r d

Username

Password

Comment

By subm itting yo u agre e to re ce ive e m ail fro m Te chTarge t and its partne rs. If yo u re side o utside o f the Unite d State s,
yo u co nse nt to having yo ur pe rso nal data transfe rre d to and pro ce sse d in the Unite d State s. Privacy

-ADS B Y G OOG LE

SOLID STATE STORAGE VIRTUAL STORAGE CLOUD STORAGE DISASTER RECOVERY DATA BACKUP

5
Se a rc h So lid St a t e St o ra ge

Dat r ium DVX s e r ve r - s id e f las h s t o r ag e c o me s o ut o f s t e alt h

Datrium built its DVX software and NetShelf system for customers that want to use low-cost flash in servers
to separately scale ...

Valle y He alt h S ys t e m p r e s c r ib e s all- f las h Vio lin s t o r ag e

Valley Health System is shifting to an all-flash storage model to improve the performance of its medical
records app and data ...

Abo ut Us Co ntact Us Privacy Po licy Vide o s Pho to Sto rie s

Guide s

Adve rtise rs Busine ss Partne rs Me dia Kit Co rpo rate Site Expe rts

Re prints Archive Site Map Eve nts E-Pro ducts

All Rights Re se rve d,
Co pyright 2000 - 2015, Te chTarge t

BIM-Based Collaborative Building Process Management
No ratings yet
BIM-Based Collaborative Building Process Management
192 pages
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
From Everand
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
AJIT DASH
2/5 (2)
BIG DATA 1 Unit
100% (1)
BIG DATA 1 Unit
17 pages
IBM Storage Discover Level 2 Quiz
100% (1)
IBM Storage Discover Level 2 Quiz
13 pages
Cs8592-Object Oriented Analysis and Design
No ratings yet
Cs8592-Object Oriented Analysis and Design
8 pages
Chapter - 06 Financial Accounting and Accounting Standard
100% (1)
Chapter - 06 Financial Accounting and Accounting Standard
33 pages
A Study On Data Deduplication Techniques For Optimized Storage
No ratings yet
A Study On Data Deduplication Techniques For Optimized Storage
7 pages
ExaGrid Systems Straight Talk About Disk Backup With Deduplication
100% (1)
ExaGrid Systems Straight Talk About Disk Backup With Deduplication
34 pages
Notes Ism Unit 1
No ratings yet
Notes Ism Unit 1
25 pages
Overview of Storage in Windows Server 2016
No ratings yet
Overview of Storage in Windows Server 2016
49 pages
Quiz SVC L2 Attempt Review PDF
100% (2)
Quiz SVC L2 Attempt Review PDF
11 pages
DATA Archival
0% (1)
DATA Archival
42 pages
Data Storage and AI
No ratings yet
Data Storage and AI
29 pages
Quantitative Test Bank Chapter 9
No ratings yet
Quantitative Test Bank Chapter 9
67 pages
Bosch Protocol Technical Information: en Application Note
No ratings yet
Bosch Protocol Technical Information: en Application Note
10 pages
Group Assignment Logistic and Supply Chain Management ALS2023
No ratings yet
Group Assignment Logistic and Supply Chain Management ALS2023
29 pages
SLP Comand
No ratings yet
SLP Comand
14 pages
Netbackup Firewall Requirements
No ratings yet
Netbackup Firewall Requirements
10 pages
Storage Networking Unit Wise Notes
No ratings yet
Storage Networking Unit Wise Notes
164 pages
Infoscale Presentation
0% (1)
Infoscale Presentation
35 pages
Netbackup Command 23
No ratings yet
Netbackup Command 23
18 pages
Netbackup Command 23
No ratings yet
Netbackup Command 23
18 pages
Oracle: Protect Your Data
From Everand
Oracle: Protect Your Data
Floribert TCHOKO
No ratings yet
SAN M32 New
No ratings yet
SAN M32 New
42 pages
Storage Lect III
No ratings yet
Storage Lect III
13 pages
Intro - Types of Machine Learning
No ratings yet
Intro - Types of Machine Learning
24 pages
Data Storage and AI
No ratings yet
Data Storage and AI
29 pages
Data-Archiving Definition PDF
No ratings yet
Data-Archiving Definition PDF
9 pages
Wipro RPS Assignment Day 6
No ratings yet
Wipro RPS Assignment Day 6
15 pages
Day 3 Storage
No ratings yet
Day 3 Storage
62 pages
Cambridge - English Vocabulary in Use (Pre-Intermediate & Intermediate) (1997) PDF
No ratings yet
Cambridge - English Vocabulary in Use (Pre-Intermediate & Intermediate) (1997) PDF
269 pages
Storage Management System Using Block Level Deduplication Technique in Cloud Computing
No ratings yet
Storage Management System Using Block Level Deduplication Technique in Cloud Computing
5 pages
4 2008 Snia
No ratings yet
4 2008 Snia
26 pages
DAY 6 Data Deduplication Windows Server 2019
No ratings yet
DAY 6 Data Deduplication Windows Server 2019
9 pages
Data Deduplication
No ratings yet
Data Deduplication
11 pages
Data Archiving
No ratings yet
Data Archiving
21 pages
IDC WP Data-Deduplication
No ratings yet
IDC WP Data-Deduplication
16 pages
Amol PCX - Report
No ratings yet
Amol PCX - Report
15 pages
#CH-2 2 4
No ratings yet
#CH-2 2 4
15 pages
Overview of Storage in Windows Server 2016
No ratings yet
Overview of Storage in Windows Server 2016
49 pages
Backup & Physical Security (HW Security)
No ratings yet
Backup & Physical Security (HW Security)
44 pages
AFF FAS8300 and FAS8700 Install and Setup
No ratings yet
AFF FAS8300 and FAS8700 Install and Setup
13 pages
h17072 Data Reduction With Dell Emc Powermax
No ratings yet
h17072 Data Reduction With Dell Emc Powermax
19 pages
AST-0012427 DCIG-Symantec Deduplication August 2009 Final
No ratings yet
AST-0012427 DCIG-Symantec Deduplication August 2009 Final
10 pages
Dell Course Storage Consolidation
No ratings yet
Dell Course Storage Consolidation
7 pages
BestPractice Ebook Storage (OK)
No ratings yet
BestPractice Ebook Storage (OK)
8 pages
A Study On Data Deduplication in HPC Storage Systems.
No ratings yet
A Study On Data Deduplication in HPC Storage Systems.
11 pages
SA Change Management Procedures
No ratings yet
SA Change Management Procedures
10 pages
Valuejet 1604
No ratings yet
Valuejet 1604
456 pages
EScholarship UC Item 9qn752v6
No ratings yet
EScholarship UC Item 9qn752v6
11 pages
Data Deduplication
No ratings yet
Data Deduplication
8 pages
Evaluating Deduplication Solutions?: What You Really Should Consider
No ratings yet
Evaluating Deduplication Solutions?: What You Really Should Consider
9 pages
Bengal College of Engineering and Technology: Report On Storage Strategies
No ratings yet
Bengal College of Engineering and Technology: Report On Storage Strategies
15 pages
Understanding Data Deduplication
No ratings yet
Understanding Data Deduplication
4 pages
Deduplication School
No ratings yet
Deduplication School
61 pages
OneFS SmartDedupe PDF
No ratings yet
OneFS SmartDedupe PDF
17 pages
HP Simply Storageworks: Introduction To Storage Technologies
No ratings yet
HP Simply Storageworks: Introduction To Storage Technologies
16 pages
Data Deduplication For Dummies: Submitted by 1.shashank Shekhar (11609052) 2.manisha (11609026)
No ratings yet
Data Deduplication For Dummies: Submitted by 1.shashank Shekhar (11609052) 2.manisha (11609026)
22 pages
Gene Nagle Adv Data Reduction Concepts 09-25-13
No ratings yet
Gene Nagle Adv Data Reduction Concepts 09-25-13
23 pages
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
SAQA - 115431 - Learner Guide
No ratings yet
SAQA - 115431 - Learner Guide
21 pages
Storage Overview and Architecture: Arvind Shrivastava
No ratings yet
Storage Overview and Architecture: Arvind Shrivastava
57 pages
Create A Subclient - CommVault Step by Step Tutorial - Backup Master
No ratings yet
Create A Subclient - CommVault Step by Step Tutorial - Backup Master
27 pages
Unit 2 and 3 (2 Part)
No ratings yet
Unit 2 and 3 (2 Part)
9 pages
Is Storage Management Overload Making IT Less Relevant?
No ratings yet
Is Storage Management Overload Making IT Less Relevant?
3 pages
Apple Storage Tech For Small Business
No ratings yet
Apple Storage Tech For Small Business
12 pages
How Storage Helps Reduce The Total Cost of Ownership For Mysap Solutions
No ratings yet
How Storage Helps Reduce The Total Cost of Ownership For Mysap Solutions
3 pages
Netapp Storage Efficiency: Author Srisuba Selvachamy
No ratings yet
Netapp Storage Efficiency: Author Srisuba Selvachamy
45 pages
MSDP
No ratings yet
MSDP
11 pages
Protected Steadfast Deduplication in Crossbreed Cloud Technique
No ratings yet
Protected Steadfast Deduplication in Crossbreed Cloud Technique
5 pages
Beginner's Guide for Cybercrime Investigators
From Everand
Beginner's Guide for Cybercrime Investigators
Nicolae Sfetcu
5/5 (1)
An Efficient Framework and Techniques of Data Deduplication in Data Center
No ratings yet
An Efficient Framework and Techniques of Data Deduplication in Data Center
5 pages
Techno Various Fonts Dafont - Com 3
No ratings yet
Techno Various Fonts Dafont - Com 3
1 page
Ii:Rrftsmrn'I: Owner's Manual
No ratings yet
Ii:Rrftsmrn'I: Owner's Manual
8 pages
File Sharing and Data Duplication Removal in Cloud Using File Checksum
No ratings yet
File Sharing and Data Duplication Removal in Cloud Using File Checksum
3 pages
Tarsnap Mastery: IT Mastery, #6
From Everand
Tarsnap Mastery: IT Mastery, #6
Michael W. Lucas
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
Office Manager - Admin Assistant Job Description1
No ratings yet
Office Manager - Admin Assistant Job Description1
2 pages
Idap 2019 8875953
No ratings yet
Idap 2019 8875953
6 pages
PLSQL and SQL Coding Guidelines
No ratings yet
PLSQL and SQL Coding Guidelines
196 pages
Storage Technologies Steven Singletary CIS 512: Enterprise Architechture Professor Wei Huang October 31, 2011
No ratings yet
Storage Technologies Steven Singletary CIS 512: Enterprise Architechture Professor Wei Huang October 31, 2011
5 pages
File Converter - Requirements
No ratings yet
File Converter - Requirements
6 pages
Gettingstartedwithmech Mindvisionsystem
No ratings yet
Gettingstartedwithmech Mindvisionsystem
34 pages
Lab 12 (1) Zoom
No ratings yet
Lab 12 (1) Zoom
5 pages
Heba Compiler Design Book - 2025
No ratings yet
Heba Compiler Design Book - 2025
133 pages
TYBSC (CS) - CS-3511 Blockchain Technology
No ratings yet
TYBSC (CS) - CS-3511 Blockchain Technology
2 pages
Introduction To Programming
No ratings yet
Introduction To Programming
11 pages
Opera Compatibility Matrix
No ratings yet
Opera Compatibility Matrix
2 pages
Apihackingin 90 Minutes 1660919248744
No ratings yet
Apihackingin 90 Minutes 1660919248744
51 pages
Assignment 01 Logika Matematika
No ratings yet
Assignment 01 Logika Matematika
14 pages
Octnov 23
No ratings yet
Octnov 23
3 pages
Contoh Resume Jurnal Pendidikan
No ratings yet
Contoh Resume Jurnal Pendidikan
4 pages
Data Compression: Unlocking Efficiency in Computer Vision with Data Compression
From Everand
Data Compression: Unlocking Efficiency in Computer Vision with Data Compression
Fouad Sabry
No ratings yet
Audio Visual Speech Recognition: Advancements, Applications, and Insights
From Everand
Audio Visual Speech Recognition: Advancements, Applications, and Insights
Fouad Sabry
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Human Visual System Model: Understanding Perception and Processing
From Everand
Human Visual System Model: Understanding Perception and Processing
Fouad Sabry
No ratings yet
ISO 7886-3 2005 (E) - Character PDF Document
No ratings yet
ISO 7886-3 2005 (E) - Character PDF Document
4 pages

Searchstorage Techtarget Com Definition Data-Deduplication

Uploaded by

Searchstorage Techtarget Com Definition Data-Deduplication

Uploaded by

9 Se archSt o rage

Data deduplication (often called "intelligent compression" or "single-instance storage") is a

B y sub mittin g yo ur p e rso n al in fo rmatio n , yo u ag re e to re ce ive e mails re g ard in g re le van t p ro d ucts

m CC oo nmtpi nr ue es sRi oe na dOi nr gS iAnbgol eu-tInDsattaanDc ee dSutpolri ac ga tei)oDn e( In t e llig e nt

Send me notifications when other members comment.

Dat r ium DVX s e r ve r - s id e f las h s t o r ag e c o me s o ut o f s t e alt h

Valle y He alt h S ys t e m p r e s c r ib e s all- f las h Vio lin s t o r ag e

Abo ut Us Co ntact Us Privacy Po licy Vide o s Pho to Sto rie s

Re prints Archive Site Map Eve nts E-Pro ducts

You might also like