0% found this document useful (0 votes)

821 views10 pages

Understanding Advanced Data Compression: F5 White Paper

WAN optimization appliances store and use network data to achieve high compression ratios. How they achieve these gains, and the limitations of certain routines, vary widely. Many WAN optimization solutions are focused wholly on network-layer optimizations.

Uploaded by

Stroke Lovingly

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

821 views10 pages

Understanding Advanced Data Compression: F5 White Paper

Uploaded by

Stroke Lovingly

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

F5 White Paper

Understanding Advanced
Data Compression
Nearly all WAN optimization appliances store and use
previously transferred network data to achieve high
compression ratios, while leveraging advanced compression
routines to improve application performance. How they
achieve these gains, and the limitations of certain
routines, vary widely and can significantly impact the
improvements and benefits associated with WAN
application delivery services.

by Lori MacVittie
Senior Technical Marketing Manager, Application Services
White Paper
Understanding Advanced Data Compression

Contents
Inroduction 3

Implementation Approaches 3
Packets versus Sessions 3

Dictionary Size 4

Heaps’ Law 5

Zipf’s Law 5

Blocks versus Bytes 6

Static versus Adaptive Compression 7

Application versus Network 8

Does Throughput Matter? 9

Conclusion 9

2
White Paper
Understanding Advanced Data Compression

Inroduction “…within 12 months roughly

half of [168 enterprise IT
professionals who were
The increasingly distributed nature of users and the prevalence of teleworkers, surveyed] will be using WAN
optimization technology to
coupled with emerging application deployment models that leverage external
help them to successfully
cloud computing, introduce additional stress on existing network connections deliver applications to branch
in the form of more data being exchanged more often. Employee productivity offices. The technologies
that they will use include
can be dramatically impacted by slow networks that result in poorly performing
techniques such as compression,
applications. Business continuity plans—no matter how carefully thought out deduplication, caching, quality
and implemented—can go awry when backups fail to complete, take more time of service, and protocol
acceleration.”
than expected, and cause some applications to go unprotected.

Organizations have turned to WAN optimization as a way to combat the challenges Source: “Keys to Unlocking
IT Value Through WAN
of assuring application performance and help ensure timely transfer of large Optimization,” Dr. Jim Metzler
data sets across constrained network links. Many WAN optimization solutions
are focused wholly on network-layer optimizations and operate based on rigid
configurations. Not only are these solutions inflexible, but they also fail to
include optimizations that can further enhance the performance of applications
commonly delivered over WAN links.

Implementation Approaches
Packets versus Sessions
To date, most network compression systems have been packet-based. Packet-
based compression systems buffer packets destined for a remote network with a
decompressor. These packets are then compressed either one at a time or
as a group and then sent to the decompressor where the process is reversed
(See Figure 1). Packet-based compression has been available for many years and
can be found in routers, VPN clients, and Juniper Networks WX and WXC
application acceleration appliances.

Packet-based compression systems have additional problems. When compressing

packets, these systems must choose between writing small packets to the network
and performing additional work to aggregate and encapsulate multiple packets.
Neither option produces optimal results. Writing small packets to the network
increases TCP/IP header overhead, while aggregating and encapsulating packets
adds encapsulation headers to the stream.

3
White Paper
Understanding Advanced Data Compression

Packet
Compressor

Figure 1: Packet-based compression

Unlike previous compression solutions, the F5® BIG-IP® Local Traffic Manager™
(LTM) product with BIG-IP® WAN Optimization Module™ (WOM) operates at the
session layer (Figure 2). This enables BIG-IP LTM with the WAN Optimization Module
to apply compression across a completely homogenous data set while addressing
all application types, resulting in higher compression ratios than comparable
packet-based systems.

Session
Compressor

Figure 2: Session-based compression

Furthermore, by operating at the session layer, packet boundary and repacketization

problems are eliminated. Session layer compression enables a WOM-enabled BIG-IP
LTM device to easily find matches in data streams which at layer 3 might be many
bytes apart, but at layer 5 are contiguous. System throughput is also increased
when compression is performed at the session layer through the elimination of the
encapsulation stage.

Dictionary Size
One limitation all compression routines have in common is limited storage space.
Some routines, such as those used by GNUzip (gzip), store as little as 64 kilobytes
(KBs) of data. Others techniques, such as disk-based compression systems, can store

4
White Paper
Understanding Advanced Data Compression

as much as 1 terabyte of data. In order to understand the impact of dictionary size, Zipf’s and Heaps’ laws are
a basic understanding of cache management is required. linguistics-derived mathematical
equations used to predicting
the repetitiveness of a
Similar to requests to a website, not all bytes transferred on the network repeat with
vocabulary subset in a finite text.
the same frequency. Some byte patterns occur with great frequency because they Both laws are applicable outside
are part of a popular document or common network protocol. Other byte patterns linguistics to describe observed
patterns of repetitiveness in
occur only once and are never repeated again. The relationship between frequently data. Both are often used in data
repeating byte sequences and less frequently repeating ones is seen in both Zipf’s deduplication and compression
and Heaps’ laws. algorithms as aids to predict
and optimize the elimination of
repeating byte patterns.
Heaps’ Law
Heaps’ law states that the number of unique words V in a collection with N words
is approximately Sqrt[N]. A plot graph of data that exhibits Heaps’ Law will have a
slope of approximately 0.5.

Zipf’s Law
Zipf’s law provides a mathematical formula for determining the frequency
distribution of words in a language.

r = rank of a word

N = total number of words in the collection (not number of unique words

r * freq(r) = A * N

Zipf’s law states that the frequency of any word in a collection is inversely proportional
to its rank in the frequency table. The most frequent word will occur twice as often
as the second most frequent, and so on. A plot graph of data that exhibits Zipf’s law
will have a slope of -1.

All modern dictionary-based compression systems leverage uneven distribution

by storing more frequently accessed data and discarding less frequently accessed
data. Through this type of optimization, a dictionary that stores less than 10 percent
of all the byte patterns can achieve a hit ratio well in excess of 50 percent. The
effect of this uneven distribution of byte patterns is evident in the effectiveness
of common compression programs. For example, while gzip stores only 64KB of
history, it averages approximately 64 percent compression. However, bzip2 stores
between 100KB and 900KB of history and averages 66 percent compression. The

5
White Paper
Understanding Advanced Data Compression

reason gzip and bzip2 perform so well despite lacking a substantial data store is
that the most frequently occurring sequences of bytes represent the majority of
bytes on a network.

Blocks versus Bytes

Block-based systems, such as Juniper Networks WXC and Riverbed Technology’s
Steelhead appliances, store segments of previously transferred data flowing across
the WAN. When these blocks are encountered a second time, references to the blocks
are transmitted to the remote appliance, which then reconstructs the original data.

A critical shortcoming of block-based systems is that repetitive data almost never

is exactly the length of a block. As a result, matches are almost always only partial
matches, which leave some of the repetitive data uncompressed. Figure 3 illustrates
what happens when a system using a 256-byte block size attempts to compress
512 bytes of data.

512 Bytes of Network Data

392 Bytes of Previously Transferred Data

256 Bytes Cached Block 256 Bytes Cached Block

1 Block Matched = 256 Bytes Saved

Figure 3: Block-based data reduction

Similar to Riverbed and Juniper’s approach of using previously transferred data to

reduce network utilization, BIG-IP LTM with WOM builds a dictionary of previously
transferred bytes using the F5 Transparent Data Reduction™ (TDR) feature. Unlike
the WXC and Steelhead appliances, though, BIG-IP LTM with WOM matches and
sends references with byte-level granularity. Figure 4 illustrates how BIG-IP LTM
with WOM addresses the same 512 bytes of data.

Unlike block-based systems, the entire repeating pattern is matched and compressed
by BIG-IP LTM with WOM. In the previous examples, instead of matching only

6
White Paper
Understanding Advanced Data Compression

256 bytes of data, BIG-IP LTM with WOM is able to match and reduce all 392 bytes
of repetitive data. This level of granularity enables BIG-IP LTM with WOM to achieve
greater levels of compression than competing block-based systems—not only on
documents, but also on application layer protocol headers.

512 Bytes of Network Data

392 Bytes of Previously Transferred Data

392 Bytes Segment Matched

392 Bytes Matched = 392 Bytes Saved

Figure 4: Transparent data reduction

Static versus Adaptive Compression

Most compression capabilities on WAN optimization devices are statically configured.
This means the algorithm, whether optimal for the network link and conditions
or not, is always applied to the data being transferred across the WAN. Unique
to F5 devices is symmetric adaptive compression, which automatically picks the
right compression algorithm to maximize compression while maintaining high
throughput. This feature is native to F5 TMOS® architecture and is part of a larger
symmetric optimization feature set known as iSessions.

As noted in Figure 5, the performance of compression algorithms varies greatly;

furthermore, performance is highly dependent on the type of data being exchanged.
Symmetric adaptive compression automatically selects a high-compression codec for
slow link speeds; it will never select a compression codec that is too slow for the
link. It also includes a CPU saver mode for data that is known not to compress

7
White Paper
Understanding Advanced Data Compression

2870
2740

2080

1212

509
323

Deflate LZO Adaptive

Multiple Connections (64) Single Connection

Figure 5: Comparison of compression algorithm throughput performance

with BIG-IP version 8900

well. This feature is advantageous to organizations that have multiple WAN links
with varying speeds: CPU saver mode minimizes concern over less-than-ideal WAN
optimization that can result from differences in WAN characteristics.

Application versus Network

By virtue of their beginnings as network-focused solution sets, WAN optimization
solutions have traditionally focused on the network. These solutions optimize a few
application layer protocols, but those protocols are generally focused on the transfer
of large data sets from shared file systems such as Common Internet File System
(CIFS), Microsoft’s file access protocol, and Samba.

BIG-IP LTM with WOM provides specific policies for file sharing across CIFS, to
optimize traffic between servers running Microsoft Exchange Server and clients running
Microsoft Office Outlook, and for optimizing web applications. These optimization
policies reduce chattiness of the protocols and add web-application-specific
acceleration options that can improve response time and overall performance of
applications delivered via the WAN. These optimizations and acceleration techniques

8
White Paper
Understanding Advanced Data Compression

are possible because of TMOS, which enables WAN optimization and application
acceleration solutions to share a unified internal architecture. This architecture
enhances the ability to apply multiple techniques to the same data, ensuring it
performs as well as possible.

Does Throughput Matter?

While achieving a high compression ratio is vital to improving application
performance on networks with limited bandwidth, system throughput also plays
an important role. The performance gains from a given compression technology can
be assessed by considering the technology’s expected compression ratio, the
device’s peak compression throughput, and the network bandwidth. If the
compression ratio is too low, the network will remain saturated and performance
gains will be minimal. Similarly, if compression speed is too low, the compressor
will become the bottleneck.

TDR, as implemented in BIG-IP LTM with WOM, has been optimized to maintain
high throughput. While the Riverbed Steelhead 5520 peaks at 540 Mbps,
BIG-IP LTM with WOM can sustain speeds of up to 10,000 Mbps with a single
appliance (BIG-IP version 8900). When TDR is coupled with symmetric adaptive
compression capabilities, BIG-IP LTM with WOM can sustain up to 10,600 Mbps
with the same single appliance.

Conclusion
Achieving substantial application performance gains through compression requires
a good compression algorithm and a system architecture that is designed for
performance. The compression system must precisely match repetitive patterns
to achieve high compression ratios. When possible, the most efficient compression
algorithm based on the network link should be applied automatically. This
system must manage stored data and incoming application traffic to maximize
effectiveness, and it should optimize and accelerate the performance of applications
commonly accessed via a WAN link (see Figure 6). Finally, this system must do all
this quickly to minimize latency and continue to fill the network.

9
White Paper
Understanding Advanced Data Compression

Raw Data

Step 1 Step 2 Step 3 Step 4 Step 5 Step 6

Application Layer Data De-duplication Symmetric Adaptive SSL Encryption TCP Optimization Bandwith Allocation
Acceleration Compression

Optimized Data

Figure 6: How BIG-IP LTM with WOM optimizes applications and data transfers

BIG-IP LTM with WOM and the TDR feature were designed from the ground up
to meet these demands the requirements that a system not only provide significant
compression to improve data transfer rates but simultaneously accelerate and
optimize applications delivered over the WAN. By leveraging the capabilities
afforded by deployment on a unified application delivery platform, BIG-IP LTM
with WOM is able to apply compression algorithms dynamically, optimize
and accelerate web application and email access, reduce bandwidth utilization,
and minimize the time required to transfer large data sets across constrained
WAN links.

F5 Networks, Inc. 401 Elliott Avenue West, Seattle, WA 98119 888-882-4447 www.f5.com

F5 Networks, Inc. F5 Networks F5 Networks Ltd. F5 Networks

Corporate Headquarters Asia-Pacific Europe/Middle-East/Africa Japan K.K.
[email protected] [email protected] [email protected] [email protected]

© 2010 F5 Networks, Inc. All rights reserved. F5, F5 Networks, the F5 logo, BIG‑IP, FirePass, iControl, TMOS, and VIPRION are trademarks
or registered trademarks of F5 Networks, Inc. in the U.S. and in certain other countries. CS01-00009 0510

Content-Based Textual Big Data Analysis and Compression: Fei Gao Ananya Dutta Jiangjiang Liu
No ratings yet
Content-Based Textual Big Data Analysis and Compression: Fei Gao Ananya Dutta Jiangjiang Liu
6 pages
Data Compression UNIT1
No ratings yet
Data Compression UNIT1
74 pages
Improvised GZIP Published Eai.1!10!2019.160599
No ratings yet
Improvised GZIP Published Eai.1!10!2019.160599
8 pages
Computer Networks - A Top Down Approach: Notes (Chapters 1, 2, 3, 9)
No ratings yet
Computer Networks - A Top Down Approach: Notes (Chapters 1, 2, 3, 9)
69 pages
12 Computer Science Notes CH08 Communication and Open Source Concepts
50% (2)
12 Computer Science Notes CH08 Communication and Open Source Concepts
16 pages
Computer Network Notes
No ratings yet
Computer Network Notes
58 pages
Literature Survey
No ratings yet
Literature Survey
5 pages
High Speed Networks Two Marks Questions and Answers 2MARKS-Libre
No ratings yet
High Speed Networks Two Marks Questions and Answers 2MARKS-Libre
27 pages
Foundation: Larry L. Peterson and Bruce S. Davie
No ratings yet
Foundation: Larry L. Peterson and Bruce S. Davie
60 pages
Module 1: Networking Today: Introduction To Networks v7.0 (ITN)
No ratings yet
Module 1: Networking Today: Introduction To Networks v7.0 (ITN)
59 pages
E-Commerce MCQ
No ratings yet
E-Commerce MCQ
24 pages
Seminar Report On Internet: Submitted in Partial Fulfillement For The Degree of Masters of Business Administration
No ratings yet
Seminar Report On Internet: Submitted in Partial Fulfillement For The Degree of Masters of Business Administration
24 pages
Computer Networks Questions & Answers - Basics - 1
No ratings yet
Computer Networks Questions & Answers - Basics - 1
219 pages
Hobbes Internet Timeline - The Definitive Arpanet Internet History
0% (1)
Hobbes Internet Timeline - The Definitive Arpanet Internet History
33 pages
M.SC Computer Science: Mother Teresa Women'S University
No ratings yet
M.SC Computer Science: Mother Teresa Women'S University
96 pages
Computer Networking: A Top Down Approach: A Note On The Use of These PPT Slides
No ratings yet
Computer Networking: A Top Down Approach: A Note On The Use of These PPT Slides
75 pages
Internet
No ratings yet
Internet
33 pages
CN MCQ
No ratings yet
CN MCQ
43 pages
p22 - 0x04 - A Novice's Guide To Hacking (1989 Edition) - by - The Mentor
No ratings yet
p22 - 0x04 - A Novice's Guide To Hacking (1989 Edition) - by - The Mentor
13 pages
Module 1: Introduction To Network Basics and Administration
100% (1)
Module 1: Introduction To Network Basics and Administration
40 pages
Packet Radio
No ratings yet
Packet Radio
2 pages
Advances in Computer Architecture
No ratings yet
Advances in Computer Architecture
6 pages
Data Link Layer
No ratings yet
Data Link Layer
121 pages
WWW Infosectrain Com Blog Cisa Domain 4 Information Systems Operations Maintenan
No ratings yet
WWW Infosectrain Com Blog Cisa Domain 4 Information Systems Operations Maintenan
43 pages
Network and Network Types
No ratings yet
Network and Network Types
31 pages
Transmission and Switching Techniques
No ratings yet
Transmission and Switching Techniques
22 pages
Lecture 1
No ratings yet
Lecture 1
15 pages
Wan Introduction: © 2006 Cisco Systems, Inc. All Rights Reserved. Cisco Public ITE I Chapter 6
No ratings yet
Wan Introduction: © 2006 Cisco Systems, Inc. All Rights Reserved. Cisco Public ITE I Chapter 6
37 pages
Network Lesson 6
No ratings yet
Network Lesson 6
12 pages
Computer Network and Data Communication: Course Instructor: Engr. Bilal Hasan
No ratings yet
Computer Network and Data Communication: Course Instructor: Engr. Bilal Hasan
32 pages
Chapter-1: 1.1 Introduction To Digital Telephony
No ratings yet
Chapter-1: 1.1 Introduction To Digital Telephony
27 pages
Virtual Circuit - Computer Networks Questions & Answers
No ratings yet
Virtual Circuit - Computer Networks Questions & Answers
7 pages
History of Internet: Accounting Information System of Technology Faculty of Economic Gunadarma University 2019/2020
No ratings yet
History of Internet: Accounting Information System of Technology Faculty of Economic Gunadarma University 2019/2020
16 pages
MOCK EXAM 1-I
No ratings yet
MOCK EXAM 1-I
5 pages
InfiniBand Architecture and Implementation: Definitive Reference for Developers and Engineers
From Everand
InfiniBand Architecture and Implementation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Bytewax for Pythonic Stream Processing: The Complete Guide for Developers and Engineers
From Everand
Bytewax for Pythonic Stream Processing: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Mastering System Programming with C: Files, Processes, and IPC
From Everand
Mastering System Programming with C: Files, Processes, and IPC
Larry Jones
No ratings yet
Cerebras GPT: Wafer-Scale Architectures for Large Language Models
From Everand
Cerebras GPT: Wafer-Scale Architectures for Large Language Models
William Smith
No ratings yet
Software Defined Networking (SDN) - a definitive guide
From Everand
Software Defined Networking (SDN) - a definitive guide
Rajesh Kumar Sundararajan
2/5 (2)
Software-Defined Networks: A Systems Approach
From Everand
Software-Defined Networks: A Systems Approach
Larry Peterson
5/5 (1)
Rust for Network Programming and Automation, Second Edition
From Everand
Rust for Network Programming and Automation, Second Edition
Gilbert Stew
No ratings yet
NATS Architecture and Implementation Guide: Definitive Reference for Developers and Engineers
From Everand
NATS Architecture and Implementation Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Crafting Data-Driven Solutions: Core Principles for Robust, Scalable, and Sustainable Systems
From Everand
Crafting Data-Driven Solutions: Core Principles for Robust, Scalable, and Sustainable Systems
Peter Jones
No ratings yet
Optimized Caching Techniques: Application for Scalable Distributed Architectures
From Everand
Optimized Caching Techniques: Application for Scalable Distributed Architectures
Peter Jones
No ratings yet
Rust for Network Programming and Automation, Second Edition: Work around designing networks, TCP/IP protocol, packet analysis and performance monitoring using Rust 1.68
From Everand
Rust for Network Programming and Automation, Second Edition: Work around designing networks, TCP/IP protocol, packet analysis and performance monitoring using Rust 1.68
Gilbert Stew
No ratings yet
Mastering the Art of Network Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Art of Network Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
TinyGo for Embedded Systems and WebAssembly: The Complete Guide for Developers and Engineers
From Everand
TinyGo for Embedded Systems and WebAssembly: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
DENT Network Operating System in Practice: The Complete Guide for Developers and Engineers
From Everand
DENT Network Operating System in Practice: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Liftbridge Streams over NATS: The Complete Guide for Developers and Engineers
From Everand
Liftbridge Streams over NATS: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
RisingWave for Real-Time Data Processing: The Complete Guide for Developers and Engineers
From Everand
RisingWave for Real-Time Data Processing: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Liftbridge Message Streams for Distributed Systems: The Complete Guide for Developers and Engineers
From Everand
Liftbridge Message Streams for Distributed Systems: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Advanced Apache Tez Techniques: Definitive Reference for Developers and Engineers
From Everand
Advanced Apache Tez Techniques: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Boost.Asio Techniques and Applications: Definitive Reference for Developers and Engineers
From Everand
Boost.Asio Techniques and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Kestra Pipeline Orchestration Essentials: The Complete Guide for Developers and Engineers
From Everand
Kestra Pipeline Orchestration Essentials: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Rsync Solutions: Definitive Reference for Developers and Engineers
From Everand
Rsync Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Red Hat AMQ Streams for Cloud-Native Messaging: The Complete Guide for Developers and Engineers
From Everand
Red Hat AMQ Streams for Cloud-Native Messaging: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Private 5G: A Systems Approach
From Everand
Private 5G: A Systems Approach
Larry L Peterson
No ratings yet
GASNet Programming and Architecture: Definitive Reference for Developers and Engineers
From Everand
GASNet Programming and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Efficient Data Science Workflows with Vaex: Definitive Reference for Developers and Engineers
From Everand
Efficient Data Science Workflows with Vaex: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Reliable Messaging with Nanomsg: Definitive Reference for Developers and Engineers
From Everand
Reliable Messaging with Nanomsg: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
LEMP Architecture and Administration: Definitive Reference for Developers and Engineers
From Everand
LEMP Architecture and Administration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Efficient Parallel Computing with Dask: Definitive Reference for Developers and Engineers
From Everand
Efficient Parallel Computing with Dask: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Logstash Essentials: Definitive Reference for Developers and Engineers
From Everand
Logstash Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Comprehensive DHCP Protocol Design and Deployment: Definitive Reference for Developers and Engineers
From Everand
Comprehensive DHCP Protocol Design and Deployment: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Yarn Essentials: Definitive Reference for Developers and Engineers
From Everand
Yarn Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Kinesis Stream Processing Essentials: Definitive Reference for Developers and Engineers
From Everand
Kinesis Stream Processing Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Networking Programming with C++: Build Efficient Communication Systems
From Everand
Networking Programming with C++: Build Efficient Communication Systems
Robert Johnson
No ratings yet
Storm Systems for Real-Time Data Processing: Definitive Reference for Developers and Engineers
From Everand
Storm Systems for Real-Time Data Processing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Network File System in Practice: Definitive Reference for Developers and Engineers
From Everand
Network File System in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Designing Resilient Distributed Systems with CAP: Definitive Reference for Developers and Engineers
From Everand
Designing Resilient Distributed Systems with CAP: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Kafka for Distributed Systems: Definitive Reference for Developers and Engineers
From Everand
Kafka for Distributed Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Distributed File Systems Engineering: Definitive Reference for Developers and Engineers
From Everand
Distributed File Systems Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
SystemTap Essentials: Definitive Reference for Developers and Engineers
From Everand
SystemTap Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Practical Dataflow Engineering: Definitive Reference for Developers and Engineers
From Everand
Practical Dataflow Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Advanced Resilient Distributed Datasets in Distributed Computing: Definitive Reference for Developers and Engineers
From Everand
Advanced Resilient Distributed Datasets in Distributed Computing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Principles of Real-Time Data Streaming: Definitive Reference for Developers and Engineers
From Everand
Principles of Real-Time Data Streaming: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
HSRP: Design and Implementation
From Everand
HSRP: Design and Implementation
Richard Johnson
No ratings yet
Debezium in Action: Definitive Reference for Developers and Engineers
From Everand
Debezium in Action: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Tcpdump in Depth: Definitive Reference for Developers and Engineers
From Everand
Tcpdump in Depth: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Stream Processing Techniques and Patterns: Definitive Reference for Developers and Engineers
From Everand
Stream Processing Techniques and Patterns: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Distributed Cluster Operations with DC/OS: Definitive Reference for Developers and Engineers
From Everand
Distributed Cluster Operations with DC/OS: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Network Address Translation Protocols and Design: Definitive Reference for Developers and Engineers
From Everand
Network Address Translation Protocols and Design: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Flannel Networking Essentials: Definitive Reference for Developers and Engineers
From Everand
Flannel Networking Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Weave Networking for Cloud-Native Infrastructure: Definitive Reference for Developers and Engineers
From Everand
Weave Networking for Cloud-Native Infrastructure: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Zipkin: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
NetFlow Protocols and Applications: Definitive Reference for Developers and Engineers
From Everand
NetFlow Protocols and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Rapid Spanning Tree Protocol for Modern Networks: Definitive Reference for Developers and Engineers
From Everand
Rapid Spanning Tree Protocol for Modern Networks: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Principles of Multiple Spanning Tree Protocol: Definitive Reference for Developers and Engineers
From Everand
Principles of Multiple Spanning Tree Protocol: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DNP3 Protocol Engineering: Definitive Reference for Developers and Engineers
From Everand
DNP3 Protocol Engineering: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
NB-IoT Systems and Protocols: Definitive Reference for Developers and Engineers
From Everand
NB-IoT Systems and Protocols: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet

Understanding Advanced Data Compression: F5 White Paper

Uploaded by

Understanding Advanced Data Compression: F5 White Paper

Uploaded by

F5 White Paper

Blocks versus Bytes 6

Static versus Adaptive Compression 7

Application versus Network 8

Does Throughput Matter? 9

Inroduction “…within 12 months roughly

Packet-based compression systems have additional problems. When compressing

Figure 1: Packet-based compression

Figure 2: Session-based compression

Furthermore, by operating at the session layer, packet boundary and repacketization

N = total number of words in the collection (not number of unique words

All modern dictionary-based compression systems leverage uneven distribution

Blocks versus Bytes

A critical shortcoming of block-based systems is that repetitive data almost never

512 Bytes of Network Data

392 Bytes of Previously Transferred Data

256 Bytes Cached Block 256 Bytes Cached Block

1 Block Matched = 256 Bytes Saved

Figure 3: Block-based data reduction

Similar to Riverbed and Juniper’s approach of using previously transferred data to

512 Bytes of Network Data

392 Bytes of Previously Transferred Data

392 Bytes Segment Matched

392 Bytes Matched = 392 Bytes Saved

Figure 4: Transparent data reduction

Static versus Adaptive Compression

As noted in Figure 5, the performance of compression algorithms varies greatly;

Deflate LZO Adaptive

Multiple Connections (64) Single Connection

Figure 5: Comparison of compression algorithm throughput performance

Application versus Network

Does Throughput Matter?

Step 1 Step 2 Step 3 Step 4 Step 5 Step 6

F5 Networks, Inc. F5 Networks F5 Networks Ltd. F5 Networks

You might also like