Design Google Drive/Dropbox

The document outlines the design of a cloud storage service like Google Drive or Dropbox. It discusses requirements like file uploads, sharing, and synchronization. It describes the data storage needs of billions of users storing petabytes of data across multiple data centers. The document then describes the major components needed like the uploader service, metadata service, sync service, and clients.

Uploaded by

kirankaranth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views3 pages

Design Google Drive/Dropbox

Uploaded by

kirankaranth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Design Google Drive/Dropbox

FR:
- Upload files/media/
- Upload limit per file
- Shareable
- User Auth
- CRUD operations on uploaded file
- Syncronization

NFR:
- Data availbility
- Data integrity
- Fast downloads

Data storage:
- 1B users, average 15GB/user
- 15PB*3 replication = 45PB
- Need CRUD - Need ACID for sync
- SQL for file storage and S3 for blob storage

Components:
- Uploader service
- Updates to storage in chunks
- Updates Metadata servicewith with what was uploaded - this would probably
be a gRPC call
- Receives individual chunk from client and uploads to Blob

- Metadataservice
- Tallks to metadata DB
- Gets input from clients about the chunk and metadata they have
- Gets input from Uploader service about the chunks and metadata uploaded
from client
- Offloads syncing to all devices with updated metadata via sync service
- Sync service
- Powered by a messages queue
- Communicate only the diff
- Push pull model for all types of documents
- If file, push the metadata change and client can pull what it doesn’t
have
- If large file pull the entire file.
- Metadata DB
- Replication service
- Offline replication of all shards in all zones
- Clients
- Store some metadata of state of file on that client
- List of all files
- Chunk info of each file
- Locations
- Versions
- Last updated time
- Has a chunker that does the actual chunking work
- Has and indexer to store which chunk goes where and re-create index when
there a change in local client - talks to sync component
- Sends data to indexer whenever there is a change in local client
- Inline deduplication to avoid storing same files in server
- Metadata partitioning for scale
- Caching of hot files

System Design
No ratings yet
System Design
56 pages
GCP ACE Notes
No ratings yet
GCP ACE Notes
127 pages
GCP Architecture
No ratings yet
GCP Architecture
5 pages
Unit - 4-Cloud
No ratings yet
Unit - 4-Cloud
122 pages
Design Netflix
No ratings yet
Design Netflix
10 pages
GMM1
No ratings yet
GMM1
120 pages
Memoire de Cost Ergh I La in
No ratings yet
Memoire de Cost Ergh I La in
172 pages
Intro To Google Cloud Platform
No ratings yet
Intro To Google Cloud Platform
86 pages
05 Storage and Database Services
No ratings yet
05 Storage and Database Services
74 pages
But Can You Really Run Your App On 2 Clouds at The Same Time
No ratings yet
But Can You Really Run Your App On 2 Clouds at The Same Time
60 pages
Question Bank Unit-3-4-5-Answers
No ratings yet
Question Bank Unit-3-4-5-Answers
69 pages
CS6065 CCA 3b Cloud Infrastructure
No ratings yet
CS6065 CCA 3b Cloud Infrastructure
25 pages
UNIT-5 Cloud Web Design Approaches
No ratings yet
UNIT-5 Cloud Web Design Approaches
41 pages
System Design
No ratings yet
System Design
56 pages
Lecture 5 Distributed Storage Systems
No ratings yet
Lecture 5 Distributed Storage Systems
26 pages
Deepseek 3fs Webinar Part1 f2dd6949
No ratings yet
Deepseek 3fs Webinar Part1 f2dd6949
24 pages
CLOUD Ia2
No ratings yet
CLOUD Ia2
17 pages
CC - Lecture 8-Final
No ratings yet
CC - Lecture 8-Final
51 pages
DS Exam Prep
No ratings yet
DS Exam Prep
23 pages
Seafile Cloud Storage Platform TEACHING
No ratings yet
Seafile Cloud Storage Platform TEACHING
31 pages
Sender Module Design Specification First Release
No ratings yet
Sender Module Design Specification First Release
8 pages
Unit 3 IOT Programming
No ratings yet
Unit 3 IOT Programming
18 pages
15 Gfs
No ratings yet
15 Gfs
40 pages
UNIT V Cloud Platforms in Industry
No ratings yet
UNIT V Cloud Platforms in Industry
10 pages
Ccomputing Madurya
No ratings yet
Ccomputing Madurya
20 pages
Educative System Design Part2
No ratings yet
Educative System Design Part2
37 pages
5 Designing Dropbox
No ratings yet
5 Designing Dropbox
15 pages
Solve Paper
No ratings yet
Solve Paper
13 pages
Designing Dropbox - Grokking The System Design Interview
No ratings yet
Designing Dropbox - Grokking The System Design Interview
15 pages
U2
No ratings yet
U2
18 pages
Lec 22
No ratings yet
Lec 22
29 pages
Analysis of Six Distributed File Systems: Benjamin Depardon, Gaël Le Mahec, Cyril Séguin
No ratings yet
Analysis of Six Distributed File Systems: Benjamin Depardon, Gaël Le Mahec, Cyril Séguin
45 pages
Cloud Unit-4-2
No ratings yet
Cloud Unit-4-2
32 pages
Cloud Computing Module-5
No ratings yet
Cloud Computing Module-5
5 pages
Unit-4 DFS-1
No ratings yet
Unit-4 DFS-1
9 pages
BFC Project
No ratings yet
BFC Project
18 pages
Rapid Application Development and Short-Time To The Market Low Latency Scalability High Availability Consistent View of The Data
No ratings yet
Rapid Application Development and Short-Time To The Market Low Latency Scalability High Availability Consistent View of The Data
21 pages
GCD - Entwicklertag Presentation PDF
No ratings yet
GCD - Entwicklertag Presentation PDF
24 pages
Cloud Computing Unit-4
No ratings yet
Cloud Computing Unit-4
5 pages
The Google File System: Alexandru Costan
No ratings yet
The Google File System: Alexandru Costan
38 pages
Design Dropbox - Google Drive. Overview
No ratings yet
Design Dropbox - Google Drive. Overview
9 pages
Report
No ratings yet
Report
11 pages
Cloud Computing and Google Cloud Platform
No ratings yet
Cloud Computing and Google Cloud Platform
6 pages
Google App Engine and Google File System
No ratings yet
Google App Engine and Google File System
5 pages
CHPT 4 Ques
No ratings yet
CHPT 4 Ques
5 pages
GCP Overview
No ratings yet
GCP Overview
5 pages
Netflix Uber
No ratings yet
Netflix Uber
3 pages
Notes For OCI
No ratings yet
Notes For OCI
13 pages
FALLSEM2024-25 BCSE408L TH VL2024250101820 2024-10-22 Reference-Material-II
No ratings yet
FALLSEM2024-25 BCSE408L TH VL2024250101820 2024-10-22 Reference-Material-II
7 pages
System Design
No ratings yet
System Design
8 pages
Vineet Gupta - GM - Software Engineering - Directi: Intelligent People. Uncommon Ideas
No ratings yet
Vineet Gupta - GM - Software Engineering - Directi: Intelligent People. Uncommon Ideas
73 pages
Iot Unit5
No ratings yet
Iot Unit5
3 pages
Core Services
No ratings yet
Core Services
5 pages
Probo
No ratings yet
Probo
2 pages
ICS 408 Exam A
No ratings yet
ICS 408 Exam A
5 pages
Sys Design
No ratings yet
Sys Design
3 pages
5 Designing Dropbox - Grokking The System Design Interview
No ratings yet
5 Designing Dropbox - Grokking The System Design Interview
10 pages
Chapter 20 CDN
No ratings yet
Chapter 20 CDN
9 pages
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
From Everand
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
Eric Chou
No ratings yet

Design Google Drive/Dropbox

Uploaded by

Design Google Drive/Dropbox

Uploaded by

Design Google Drive/Dropbox

You might also like