BFC Project
BFC Project
Project By : Arun K
Roll No . 07
M.Tech – M2 CSE
09-May-2019
Contents
• Introduction
• Current system limitations
• An Efficient Storage System
Introduction
• Cloud storage systems are widely in use nowadays which allow
common people to store large files in distributed storage systems.
• Many companies are providing such facilities, they include Google
Drive, ZingMe, DropBox etc.
Limitations
• Efficiently storing, retrieving and managing big files in the system
• Data duplication which incur wastage of storage space which is due to storing same static
data from different users.
• A lightweight metadata design for big file. Every file has approximately same
size of metadata
• The file is uploaded in chunks so that parallel and resumable upload is
possible.
• Before uploading, the MD5 value of the file is calculated by the client. The
information is then sent to the server.
An Efficient Storage System
• The server compares the value with other
MD5 values of files it has.
• If a match is found, the file will not be
saved to server space. Only a reference is
kept.
Activities
• Uploading a file
• All the chunks which are generated from a file, except the last chunk
will have the same size (except the last chunk which may have an
equal or small size).
• After that, the ID generator will generate id for the file and the first
chunk with auto-increment mechanism.
Activities
• Data deduplication