0% found this document useful (0 votes)

41 views31 pages

Seafile Cloud Storage Platform TEACHING

The document summarizes Seafile, an open source scalable cloud storage system. It describes Seafile's features like fast file syncing between devices, scalability to large storage capacities, and collaborative features. The system design uses a lightweight database and object storage for metadata and file contents. This allows Seafile to scale horizontally and provide high performance. The document also outlines Seafile's roadmap including file locking and improved authorization.

Uploaded by

user.22x6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views31 pages

Seafile Cloud Storage Platform TEACHING

Uploaded by

user.22x6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Seafile - Scalable Cloud Storage System

Johnathan Xu
Seafile Ltd.
Agenda
Seafile Introduction
Feature Overview
System Design & Performance
Roadmap
What is Seafile?

Seafile is a FAST, SCALABLE, and PRIVATE

file sync & share solution
What can Seafile do?
• Fast and reliable file sync between cloud and
devices
• Scales to millions of files, PB class storage
• High performance, light weight
• Productive file collaboration
– Groups
– File prview, discussion
– Message and notification
Who are using Seafile?
• https://github.com/haiwen/seafile
2400+ stars
• Estimated at least 100K users worldwide, most
in Europe

Universities in belgian royal institute of

Rhineland-Palatine (Germany) natural sciences
Agenda
Seafile Introduction
Feature Overview
System Design & Performance
Roadmap
File Sync and Share
• Files are organized into Libraries
• Selective sync library to devices
• Sync with existing folder
• Client-side end-to-end data encryption
• Full platform support: Win, OSX, Linux, mobile
• Share to a person or a group
• Read-write and read-only share
• LDAP/AD integration
View all your libraries in the home page
All libraries shared to a group
Desktop Client

Selective sync library

Cloud file browser
Starred files
Notifications
Desktop Client
Collaboration
• File activities
• Group discussion
• File discussion
• Message notifications
File Activities
Message Notifications
Agenda
Seafile Introduction
Feature Overview
System Design & Performance
Roadmap
Server Architecture

Seafile is a “file system” built on top of object storage

Non-POSIX, User space, Light weight

File System Design
Head Commit ID

Relational DB

SHA-1 ID

Object Storage

Data model similar to Git

Design Advantage
• Object storage is more scalable than file system
– Heavy DB + Filesystem v.s. Light DB + Object Storage
• No database bottleneck
– Metadata is in object storage
– Filesystem level versioning v.s. File-level versioning
• File system designed for syncing
– Storage/Network deduplication
– No upload/download limit, fast upload
• Backend daemons implemented in C
Deduplication
Dedup with Content Defined Chunking (CDC) algorithm
Only store/send delta between file system snapshots

Back link
Commit 1 Commit 2

Dir Dir

File 1 v1 File 2 File 1 v2

Block 1 Block 2 Block 3 Block 4 Block 5

Cluster Architecture

MySQL cluster
Load Balancer
Ceph/Swift/S3
Seafile Servers

• Seafile server is stateless, scales horizontally

• Head commit ID and user-library mapping in MySQL cluster
• All data and metadata in object storage
Fast and Reliable File Syncing
• Detect file changes with OS mechanisms
• Low CPU usage on client and server side
• Sync 100K files easily and quickly
• No data transfer after rename/move
• Don’t send duplicate files. Delta dection.
• Handles conflicts
– Concurrent updates
– Case conflict: sync ABC.txt and abc.txt to Windows
• Never remove a file unless user does
Devil is in the details
How Syncing Works
2:Write objects
Relational DB Object Storage

3: Update head commit ID

after objects are saved

1: Client uploads commit,

dir, file, and block objects 4: Client download objects
and check out to folder

Almost looks like Git

Syncing Performance
• Keep version info for the whole fs tree
– Combine many file updates into 1 commit
– A few database writes for a few K files
• Results
– 1 core, 1GB memory VM server
– 40K small files, ~20 files/s upload and download;
single TCP connection; server CPU 2% - 5%
– Big file, ~8MB/s upload and download in 100bps
network; server CPU 50%
Agenda
Seafile Introduction
Feature Overview
System Design & Performance
Roadmap
Roadmap
• Sync & Share
– File locking for better collaboration
– Hierarchical access control within a library
• Auth integration
– OAuth
– Shibboleth
• Improve GUI responsibility with backbone.js
Conclusion
• Do one thing and do one thing well
– Reliablity
– Scalability Choose any three ;-)
– Performance
• Lightweight DB + Object Storage
• Git like data model, no client-side history
• Syncing model similar to Git, redesigned for
auto syncing
Thanks！
File Syncing Algorithm
• Client data 3 stages: worktree, index, repo
– Worktree: user visible folder, one worktree per library
– Index file: last modification time of each file in worktree
– Repo: Internal representation of the latest fs tree for the
library. Only have delta blocks.

commit commit
worktree index repo
checkout checkout
File Syncing Algorithm

Sync State Machine

File Syncing Algorithm
• Upload
– Client creates new commit from batch of local changes
– Diff between local repo and the cached server fs tree
– After objects are uploaded, update server head commit ID
in database
– Server do merge on concurrent updates, resolve conflicts

Commit from client A

HEAD commit on server

Commit from client B

File Syncing Algorithm
• Version Check（init）
– Client caches server’s head commit ID
– Compare with server every 30s, if not the same trigger
download
• download
– Server calculate update list with diff
– Client download and apply the update to worktree
– Update cached server head commit ID

This Study Resource Was
33% (18)
This Study Resource Was
9 pages
PDF SSTT X Jamal Browner Deadlift Specialization Vol 2 Y6y6sg Compress
No ratings yet
PDF SSTT X Jamal Browner Deadlift Specialization Vol 2 Y6y6sg Compress
147 pages
Seafile Server Manual
100% (1)
Seafile Server Manual
339 pages
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
No ratings yet
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
46 pages
Distributed File Systems
No ratings yet
Distributed File Systems
42 pages
3distributed File System
No ratings yet
3distributed File System
42 pages
Repli
No ratings yet
Repli
38 pages
The Google File System: Kenneth Chiu
No ratings yet
The Google File System: Kenneth Chiu
40 pages
Distributed File Systems
No ratings yet
Distributed File Systems
18 pages
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
No ratings yet
Andrew - Cmu.edu: Let's Start With A Familiar Example: Andrew 10,000s of People Terabytes of Disk
7 pages
Distributed File Systems (DFS) : A Resource Management Component of A Distributed Operating System
No ratings yet
Distributed File Systems (DFS) : A Resource Management Component of A Distributed Operating System
16 pages
Educative System Design Part2
No ratings yet
Educative System Design Part2
37 pages
Distributed File Systems
No ratings yet
Distributed File Systems
28 pages
AFS Presentation
No ratings yet
AFS Presentation
36 pages
Other File Systems: LFS, NFS, and Afs
No ratings yet
Other File Systems: LFS, NFS, and Afs
37 pages
Designing Dropbox - Grokking The System Design Interview
No ratings yet
Designing Dropbox - Grokking The System Design Interview
15 pages
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
No ratings yet
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
51 pages
5 Designing Dropbox
No ratings yet
5 Designing Dropbox
15 pages
P2P File Sharing
No ratings yet
P2P File Sharing
43 pages
L8 DFS
No ratings yet
L8 DFS
35 pages
Distributed File System
No ratings yet
Distributed File System
68 pages
A Case Study On Different Applications and Security Issues in Distributed Systems
No ratings yet
A Case Study On Different Applications and Security Issues in Distributed Systems
10 pages
Unit-4 DFS-1
No ratings yet
Unit-4 DFS-1
9 pages
CC U3
No ratings yet
CC U3
40 pages
Seafile Presentation Slides
100% (1)
Seafile Presentation Slides
23 pages
GMM1
No ratings yet
GMM1
120 pages
Lecture 08
No ratings yet
Lecture 08
25 pages
Distributed File System Implementation
100% (1)
Distributed File System Implementation
30 pages
L6 DFS
No ratings yet
L6 DFS
27 pages
Design and Implementation of The Sun Network Filesystem: R. Sandberg, D. Goldberg S. Kleinman, D. Walsh, R. Lyon
No ratings yet
Design and Implementation of The Sun Network Filesystem: R. Sandberg, D. Goldberg S. Kleinman, D. Walsh, R. Lyon
34 pages
04 en Network File Systems
No ratings yet
04 en Network File Systems
57 pages
Alfresco 3 Records Management
From Everand
Alfresco 3 Records Management
Dick Weisinger
No ratings yet
Distributed-File Systems Background
No ratings yet
Distributed-File Systems Background
9 pages
Distributed File Systems
No ratings yet
Distributed File Systems
31 pages
Unit-3 Part1
No ratings yet
Unit-3 Part1
57 pages
Requirements For Distributed File Systems
No ratings yet
Requirements For Distributed File Systems
4 pages
DFS Design and Implementation: Brent R. Hafner
No ratings yet
DFS Design and Implementation: Brent R. Hafner
40 pages
DFS Design and Implementation
No ratings yet
DFS Design and Implementation
40 pages
Caching in Distributed File System: Ke Wang CS614 - Advanced System Apr 24, 2001
No ratings yet
Caching in Distributed File System: Ke Wang CS614 - Advanced System Apr 24, 2001
56 pages
Oschapter 8
No ratings yet
Oschapter 8
27 pages
XtreemFS-A Cloud File System
No ratings yet
XtreemFS-A Cloud File System
27 pages
CSCI319 Distributed Systems
No ratings yet
CSCI319 Distributed Systems
26 pages
Dynamo: Amazon'S Highly Available Key-Value Store: Csci 8101: Advanced Operating Systems Presented By: Chaithra KN
No ratings yet
Dynamo: Amazon'S Highly Available Key-Value Store: Csci 8101: Advanced Operating Systems Presented By: Chaithra KN
23 pages
Distributed File Systems & Name Services: UNIT-4
No ratings yet
Distributed File Systems & Name Services: UNIT-4
70 pages
NFS
No ratings yet
NFS
27 pages
Distributed File Systems: Arvind Krishnamurthy Spring 2001
No ratings yet
Distributed File Systems: Arvind Krishnamurthy Spring 2001
3 pages
Ccs 3ra Ia
No ratings yet
Ccs 3ra Ia
17 pages
GFS_Architecture M5 GFS_Architecture M5
No ratings yet
GFS_Architecture M5 GFS_Architecture M5
25 pages
Lecture 4.1 - Hadoop - MapReduce - Hbase
No ratings yet
Lecture 4.1 - Hadoop - MapReduce - Hbase
94 pages
Gfs Google File System 13331
No ratings yet
Gfs Google File System 13331
28 pages
SIT102 Lecture 8.2
No ratings yet
SIT102 Lecture 8.2
32 pages
Discrete Computing
No ratings yet
Discrete Computing
25 pages
Networked File System: CS 537 - Introduction To Operating Systems
No ratings yet
Networked File System: CS 537 - Introduction To Operating Systems
23 pages
UNIT5
No ratings yet
UNIT5
34 pages
@klwks - Bot Os Co-4 Ha-4
No ratings yet
@klwks - Bot Os Co-4 Ha-4
17 pages
He-Phan-Bo - Thoai-Nam - Distributedsystem - 16 - Fileservice - (Cuuduongthancong - Com)
No ratings yet
He-Phan-Bo - Thoai-Nam - Distributedsystem - 16 - Fileservice - (Cuuduongthancong - Com)
28 pages
Gytha John Harikrishnan Hridya S7Cse: Presented by
No ratings yet
Gytha John Harikrishnan Hridya S7Cse: Presented by
17 pages
Distributed File Systems
No ratings yet
Distributed File Systems
38 pages
Ds Part C Sanjana
No ratings yet
Ds Part C Sanjana
15 pages
Pond: The Oceanstore Prototype
No ratings yet
Pond: The Oceanstore Prototype
14 pages
DS Lecture 5
No ratings yet
DS Lecture 5
28 pages
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
No ratings yet
Lecture 25: Distributed File Systems: Indranil Gupta (Indy)
27 pages
Database Notes
No ratings yet
Database Notes
15 pages
Forensic Analysis in Cybersecurity
No ratings yet
Forensic Analysis in Cybersecurity
17 pages
Kofax Analytics For Capture
No ratings yet
Kofax Analytics For Capture
21 pages
Unit III - Storage Fundamentals
No ratings yet
Unit III - Storage Fundamentals
97 pages
Hellcat Pilot Instructions
No ratings yet
Hellcat Pilot Instructions
22 pages
Banking Management
100% (1)
Banking Management
17 pages
PySpark SQL Cheat Sheet Python
100% (2)
PySpark SQL Cheat Sheet Python
1 page
How To Download Exported Data File Using API - Cloud Customer Connect
No ratings yet
How To Download Exported Data File Using API - Cloud Customer Connect
8 pages
Mysql Assignment 1
No ratings yet
Mysql Assignment 1
2 pages
Database List
No ratings yet
Database List
3 pages
11g Rac To Rac Data Guard Setup
No ratings yet
11g Rac To Rac Data Guard Setup
7 pages
PL-SQL Assignment-2 - 101803108-Coe6
100% (1)
PL-SQL Assignment-2 - 101803108-Coe6
4 pages
Chapter 11: Mass-Storage Systems: Silberschatz, Galvin and Gagne ©2018 Operating System Concepts - 10 Edition
No ratings yet
Chapter 11: Mass-Storage Systems: Silberschatz, Galvin and Gagne ©2018 Operating System Concepts - 10 Edition
49 pages
Automated College Timetable Generator - S126
No ratings yet
Automated College Timetable Generator - S126
11 pages
ADF Code Corner: 106. Drag-And-Drop Reordering of Table Rows
No ratings yet
ADF Code Corner: 106. Drag-And-Drop Reordering of Table Rows
8 pages
F2FS
No ratings yet
F2FS
2 pages
Gr11 Nov Prac 2018 Final
No ratings yet
Gr11 Nov Prac 2018 Final
6 pages
Different Database Software Used in Bank
No ratings yet
Different Database Software Used in Bank
18 pages
Oracle RMAN Interview Questions
No ratings yet
Oracle RMAN Interview Questions
3 pages
Azure Services Overview
No ratings yet
Azure Services Overview
2 pages
SQL Programs For Elementary Practice
No ratings yet
SQL Programs For Elementary Practice
8 pages
SNIA Data Protection Best Practice 2025-01-27
No ratings yet
SNIA Data Protection Best Practice 2025-01-27
47 pages
Bugreport WDT 2024 12 01 17 09 43 - Log
No ratings yet
Bugreport WDT 2024 12 01 17 09 43 - Log
2 pages
IBM I 7.1 System MGMT - Performance Ref Info
No ratings yet
IBM I 7.1 System MGMT - Performance Ref Info
278 pages
Vaibhav Resume
No ratings yet
Vaibhav Resume
1 page
Jeremiah Curtin PDF
No ratings yet
Jeremiah Curtin PDF
599 pages
Unit 2 - Object and Object-Relational Databases
No ratings yet
Unit 2 - Object and Object-Relational Databases
86 pages
Chapter
No ratings yet
Chapter
33 pages

Seafile Cloud Storage Platform TEACHING

Uploaded by

Seafile Cloud Storage Platform TEACHING

Uploaded by

Seafile - Scalable Cloud Storage System

Seafile is a FAST, SCALABLE, and PRIVATE

Universities in belgian royal institute of

Selective sync library

Seafile is a “file system” built on top of object storage

Non-POSIX, User space, Light weight

Data model similar to Git

File 1 v1 File 2 File 1 v2

Block 1 Block 2 Block 3 Block 4 Block 5

• Seafile server is stateless, scales horizontally

3: Update head commit ID

1: Client uploads commit,

Almost looks like Git

Sync State Machine

Commit from client A

Commit from client B

You might also like