0% found this document useful (0 votes)

50 views

huffman code

Uploaded by

anil shinde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views

huffman code

Uploaded by

anil shinde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

OPTIMAL STORAGE ON TAPES

There are n programs that are to be stored on a computer tape of length l.

Associated with each program i is a length li, 1≤i≤n. All programs can be stored on the tape if and only if
the sum of the lengths of the programs is at most l.

We assume that whenever a program is to be retrieved from this tape, the tape is initially positioned at
the front.

Hence, if the programs are stored in the order I= i1, i2, ... , in, the time tj needed to retrieve program ij is
proportional to ∑l≤k≤j lik.

If all programs are retrieved equally often, then the expected or mean retrieval time (MRT) is (1/n)
∑l≤j≤n tj.

In the optimal storage on tape problem, we are required to find a permutation for the n programs so
that when they are stored on the tape in this order the MRT is minimized. This problem fits the ordering
paradigm.

Minimizing the MRT is equivalent to minimizing d(I) =∑l≤j≤n∑l≤k≤j lik.

Example: Let n = 3 and (li, l2, l3) = (5, 10, 3). There are n! = 6 possible orderings. These orderings and
their respective d values are:

The optimal ordering is 3, 1, 2.

□ A greedy approach to building the required permutation would choose the next program on the basis
of some optimization measure. The next program to be stored on the tape would be one that minimizes

the increase in d. We observe that the increase in d is minimized if the next program chosen is the one
with the least length from among the remaining programs.

1 Algorithm Store(n, m)

2 / / n is the number of programs and m the number of tapes.

3{
4 j := O; / / Next tape to store on

5 for i := 1 to n do

7 write ("append program", i,

8 "to permutation for tape", j);

9 j := (j +1) mod m;

10 }

OPTIMAL MERGE PATTERNS

The optimal merge pattern problem describes that there two sorted files containing n and m records
respectively could be merged together to obtain one sorted file in time O(n +m).

When more than two sorted files are to be merged together, the merge can be accomplished by
repeatedly merging sorted files in pairs.

Thus, if files x1 , x2, x3 and x4 are to be merged, we could first merge x1 and x2 to get a file y1. Then we
could merge y1 and x3 to get y2. Finally, wecould merge y2 and x4 to get the desired sorted file.

Alternatively, we could first merge x1 and x2 getting yl, then merge x3 and x4 and get y2, and finally
merge y1 and y2 and get the desired sorted file.

Given n sorted files, there are many ways in which to pairwise merge them into a single sortedfile.
Different pairings require differing amounts of computing time.

The problem we address ourselves to now is that of determining an optimal wayto pairwise merge n
sorted files.

Example: Method 1:The files x1, x2 and x3 are three sorted files of length 30, 20, and 10 records each.
Merging x1 and x2 requires 50 record moves. Merging the result with x3 requires another 60 moves. The
total number of record moves required to merge the three files this way is 110.

Method 2: Merge x2 and X3 (taking 30 moves) and then x1 (taking 60 moves), the total record moves
made is only 90.

Hence, the second merge pattern is faster than the first.

A greedy attempt to obtain an optimal merge pattern is easy to formulate. Since merging an n-record
file and an m-record file requires possibly n +m record moves, the obvious choice for a selection
criterion is: at each step merge the two smallest size files together.

Thus, if we have five files (x1,... , x5) with sizes (20, 30, 10, 5, 30), our greedy rule would generate the
following merge pattern:

1. Merge x4 and x 3 to get z1 (15 moves)

2. Merge z1 and x1 to get z2 (35 moves) 3. Merge x2 and x5 to get z

3 (60 moves)

4. Merge z2 and z3 to get the answer z4 (95 moves) The total number of record moves is 205.

The merge pattern such as the one just described will be referred to as a two-way merge pattern (each
merge step involves the merging of two files). The two-way merge patterns can be represented by
binary merge trees.

Figure shown below a binary merge tree representing the optimal merge pattern obtained for the above
five files.

The leaf nodes are drawn as squares and represent the given five files.

These nodes are called external nodes. The remaining nodes are drawn as circles and are called internal
nodes. Each internal node has exactly two children, and it represents the file obtained by merging the
files represented by its two children. The number in each node is the length (i.e., the number of records)
of the file represented by that node.

The external node X4 is at a distance of 3 from the root node z4 ( a node at level i is at a distance of i - 1
from the root). Hence, the records of file x4 are moved three times, once to get z1, once again to get z2,
and finally one more time to get Z4. If di is the distance from the root to the external node for file Xi and
qi, the length of Xi is then the total number of record moves for this binary merge tree is
This sum is called the weighted external path length of the tree.

treenode = record { treenode* !child; treenode* rchild; integer weight;

};

Algorithm Tree(n) { for i := 1 to n - 1 do { pt:= newtreenode;

/ / Get a new tree node. [child) := Least(list);

/ / Merge two trees with( pt rchild) := Least(list);

/ / smallest lengths.( pt weight)lchild) weight) := ((pt( pt weight);rchild) + ((pt Insert(list, pt); }

return Least(list);

/ / Tree left in list is the merge tree.

Huffman Coding Algorithm

Data may be compressed using the Huffman Coding technique to become smaller without losing any of its
information. After David Huffman, who created it in the beginning? Data that contains frequently repeated
characters is typically compressed using Huffman coding.

A well-known Greedy algorithm is Huffman Coding. The size of code allocated to a character relies on the
frequency of the character, which is why it is referred to be a greedy algorithm. The short-length variable
code is assigned to the character with the highest frequency, and vice versa for characters with lower
frequencies. It employs a variable-length encoding, which means that it gives each character in the provided
data stream a different variable-length code.

Prefix Rule

Essentially, this rule states that the code that is allocated to a character shall not be another code's prefix. If
this rule is broken, various ambiguities may appear when decoding the Huffman tree that has been created.

Let's look at an illustration of this rule to better comprehend it: For each character, a code is provided, such
as:

1. a-0
2. b-1
3. c - 01

Assuming that the produced bit stream is 001, the code may be expressed as follows when decoded:

1. 0 0 1 = aab
2. 0 01 = ac
What is the Huffman Coding process?

The Huffman Code is obtained for each distinct character in primarily two steps:

o Create a Huffman Tree first using only the unique characters in the data stream provided.
o Second, we must proceed through the constructed Huffman Tree, assign codes to the characters,
and then use those codes to decode the provided text.

Steps to Take in Huffman Coding

The steps used to construct the Huffman tree using the characters provided

1. Input:
2. string str = "abbcdbccdaabbeeebeab"

If Huffman Coding is employed in this case for data compression, the following information must be
determined for decoding:

o For each character, the Huffman Code

o Huffman-encoded message length (in bits), average code length
o Utilizing the formulas covered below, the final two of them are discovered.

How Can a Huffman Tree Be Constructed from Input Characters?

The frequency of each character in the provided string must first be determined.

Character Frequency

a 4

b 7

c 3

d 2

e 4

1. Sort the characters by frequency, ascending. These are kept in a Q/min-heap priority queue.
2. For each distinct character and its frequency in the data stream, create a leaf node.
3. Remove the two nodes with the two lowest frequencies from the nodes, and the new root of the tree
is created using the sum of these frequencies.
o Make the first extracted node its left child and the second extracted node its right child
while extracting the nodes with the lowest frequency from the min-heap.
o To the min-heap, add this node.
o Since the left side of the root should always contain the minimum frequency.
4. Repeat steps 3 and 4 until there is only one node left on the heap, or all characters are represented
by nodes in the tree. The tree is finished when just the root node remains.

Examples of Huffman Coding

Let's use an illustration to explain the algorithm:

Algorithm for Huffman Coding

Step 1: Build a min-heap in which each node represents the root of a tree with a single node and holds 5
(the number of unique characters from the provided stream of data).
Step 2: Obtain two minimum frequency nodes from the min heap in step two. Add a third internal node,
frequency 2 + 3 = 5, which is created by joining the two extracted nodes.

o Now, there are 4 nodes in the min-heap, 3 of which are the roots of trees with a single element each,
and 1 of which is the root of a tree with two elements.

Step 3: Get the two minimum frequency nodes from the heap in a similar manner in step three. Additionally,
add a new internal node formed by joining the two extracted nodes; its frequency in the tree should be 4 + 4
= 8.

o Now
that the minimum heap has three nodes, one node serves as the root of trees with a single element
and two heap nodes serve as the root of trees with multiple nodes.

Step 4: Get the two minimum frequency nodes in step four. Additionally, add a new internal node formed by
joining the two extracted nodes; its frequency in the tree should be 5 + 7 = 12.
o When creating a Huffman tree, we must ensure that the minimum value is always on the left side
and that the second value is always on the right side. Currently, the image below shows the tree
that has formed:

Step 5: Get the following two minimum frequency nodes in step 5. Additionally, add a new internal node
formed by joining the two extracted nodes; its frequency in the tree should be 12 + 8 = 20.

Continue until all of the distinct characters have been added to the tree. The Huffman tree created for the
specified cast of characters is shown in the above image.

Now, for each non-leaf node, assign 0 to the left edge and 1 to the right edge to create the code for each
letter.

Rules to follow for determining edge weights:

o We should give the right edges weight 1 if you give the left edges weight 0.
o If the left edges are given weight 1, the right edges must be given weight 0.
o Any of the two aforementioned conventions may be used.
o However, follow the same protocol when decoding the tree as well.

Following the weighting, the modified tree is displayed as follows:

Understanding the Code

o We must go through the Huffman tree until we reach the leaf node, where the element is present, in
order to decode the Huffman code for each character from the resulting Huffman tree.
o The weights across the nodes must be recorded during traversal and allocated to the items located
at the specific leaf node.
o The following example will help to further illustrate what we mean:
o To obtain the code for each character in the picture above, we must walk the entire tree (until all
leaf nodes are covered).
o As a result, the tree that has been created is used to decode the codes for each node. Below is a list
of the codes for each character:

Character Frequency/count Code

a 4 01

b 7 11

c 3 101

d 2 100

e 4 00

DAA Unit-4
No ratings yet
DAA Unit-4
26 pages
ADA Unit 2 - 1711437399
No ratings yet
ADA Unit 2 - 1711437399
124 pages
4.6 Huffman Coding, Optimal Merge Patterns.
No ratings yet
4.6 Huffman Coding, Optimal Merge Patterns.
9 pages
Presentation On: Presented To Dr. Vinay Pathak
89% (19)
Presentation On: Presented To Dr. Vinay Pathak
37 pages
3 Greedy Method1(Final)
No ratings yet
3 Greedy Method1(Final)
13 pages
Combinedm
No ratings yet
Combinedm
22 pages
Greedy Algorithm
No ratings yet
Greedy Algorithm
28 pages
Algorithm Design Techiques I
No ratings yet
Algorithm Design Techiques I
35 pages
3.5 Optimal Merge Patterns
No ratings yet
3.5 Optimal Merge Patterns
9 pages
Huffman Code1
100% (1)
Huffman Code1
13 pages
Unit 2 - Analysis Design of Algorithm - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Analysis Design of Algorithm - WWW - Rgpvnotes.in
11 pages
Greedy and DFS in Huffman Coding
No ratings yet
Greedy and DFS in Huffman Coding
5 pages
Unit III
No ratings yet
Unit III
28 pages
Greedy Method
No ratings yet
Greedy Method
25 pages
Huffman Coding
No ratings yet
Huffman Coding
10 pages
Huffman Code
No ratings yet
Huffman Code
5 pages
Unit 4
No ratings yet
Unit 4
11 pages
Unit 4
No ratings yet
Unit 4
18 pages
4.6 Huffman Coding, Optimal Merge Pattern
No ratings yet
4.6 Huffman Coding, Optimal Merge Pattern
24 pages
Unit III - Daa
No ratings yet
Unit III - Daa
127 pages
DAA -Lecture 21 - Optimal Merge Patterns
No ratings yet
DAA -Lecture 21 - Optimal Merge Patterns
14 pages
DAA Unit-IV
No ratings yet
DAA Unit-IV
12 pages
Multi Way Merge Sort
100% (2)
Multi Way Merge Sort
20 pages
Haufmann Coding
No ratings yet
Haufmann Coding
6 pages
Unit Iii Greedy and Dynamic Programming
No ratings yet
Unit Iii Greedy and Dynamic Programming
120 pages
Unit 2 CA209
No ratings yet
Unit 2 CA209
29 pages
16 Greedy Algorithms
No ratings yet
16 Greedy Algorithms
21 pages
Elements of The Greedy Strategy
No ratings yet
Elements of The Greedy Strategy
5 pages
DCA7104 Analysis and Design of Algorithms
No ratings yet
DCA7104 Analysis and Design of Algorithms
12 pages
Eedy Method
No ratings yet
Eedy Method
32 pages
Greedy Model: Analysis of Algorithms
No ratings yet
Greedy Model: Analysis of Algorithms
8 pages
Huffman
No ratings yet
Huffman
35 pages
DCA7104 - Analysis and Design of Algorithms
No ratings yet
DCA7104 - Analysis and Design of Algorithms
14 pages
Unit 2
No ratings yet
Unit 2
28 pages
DAA Lec 9 Greedy Approach
No ratings yet
DAA Lec 9 Greedy Approach
51 pages
What Is Huffman Coding and Its History
No ratings yet
What Is Huffman Coding and Its History
5 pages
A3
No ratings yet
A3
5 pages
1202
No ratings yet
1202
11 pages
3a.huffman Encoding
No ratings yet
3a.huffman Encoding
4 pages
Unite 4-Greedy Method - CSE
No ratings yet
Unite 4-Greedy Method - CSE
41 pages
Unit 2 - Analysis and Design of Algorithm - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Analysis and Design of Algorithm - WWW - Rgpvnotes.in
22 pages
DCA7104 - Analysis and Design of Algorithms - Set1&2
No ratings yet
DCA7104 - Analysis and Design of Algorithms - Set1&2
13 pages
Mini Project
No ratings yet
Mini Project
26 pages
16 Greedy Algorithms
No ratings yet
16 Greedy Algorithms
21 pages
16 Greedy Algorithms
No ratings yet
16 Greedy Algorithms
21 pages
Unit-3
No ratings yet
Unit-3
122 pages
DS unit-IV
No ratings yet
DS unit-IV
39 pages
Sorting Algorithms 20222 Notes
No ratings yet
Sorting Algorithms 20222 Notes
135 pages
Eedy Algorithms
No ratings yet
Eedy Algorithms
63 pages
DCA 1202 DATA STRUCTURE & ALGORITHM
No ratings yet
DCA 1202 DATA STRUCTURE & ALGORITHM
11 pages
Huffman Codes
No ratings yet
Huffman Codes
8 pages
M1 Greedy - Huffman Codes
No ratings yet
M1 Greedy - Huffman Codes
2 pages
Priority Queue
No ratings yet
Priority Queue
7 pages
Lecture 14
No ratings yet
Lecture 14
25 pages
Assignment No: 02 Title: Huffman Algorithm
No ratings yet
Assignment No: 02 Title: Huffman Algorithm
7 pages
Unit Iii Greedy and Dynamic Programming
No ratings yet
Unit Iii Greedy and Dynamic Programming
16 pages
Daa - Unit 3 1
No ratings yet
Daa - Unit 3 1
26 pages
Lecture# 08 Greedy Algorithms
No ratings yet
Lecture# 08 Greedy Algorithms
63 pages
Huffman
No ratings yet
Huffman
22 pages
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)
Unit-4 Linked-List
No ratings yet
Unit-4 Linked-List
57 pages
Ad3311 Set 1
No ratings yet
Ad3311 Set 1
2 pages
DSA10
No ratings yet
DSA10
13 pages
List Methods in Python - Set 1 (In, Not In, Len, Min, Max )
No ratings yet
List Methods in Python - Set 1 (In, Not In, Len, Min, Max )
3 pages
Queue Operation of Link List: Lab Report No.13
No ratings yet
Queue Operation of Link List: Lab Report No.13
4 pages
Lecture 9 - SVMs
No ratings yet
Lecture 9 - SVMs
8 pages
PYTHON Chapter 2 PDF
No ratings yet
PYTHON Chapter 2 PDF
11 pages
Iii Sem - Design Analysis Algorithm PDF
No ratings yet
Iii Sem - Design Analysis Algorithm PDF
36 pages
18CS644-NOTES Module 2
100% (1)
18CS644-NOTES Module 2
25 pages
Flood-Fill Algorithm, Also Called Seed Fill Algorithm
No ratings yet
Flood-Fill Algorithm, Also Called Seed Fill Algorithm
6 pages
Daa Shivaniiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
No ratings yet
Daa Shivaniiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
6 pages
SUCCESS
No ratings yet
SUCCESS
23 pages
Ds 7-Merge Sort
No ratings yet
Ds 7-Merge Sort
10 pages
SVM - Friend or Foe?: Reason 1
No ratings yet
SVM - Friend or Foe?: Reason 1
9 pages
Machine Learning KNN - Supervised
No ratings yet
Machine Learning KNN - Supervised
9 pages
Linked List
No ratings yet
Linked List
1,578 pages
Shortest-Job-First-SJF-Algorithm
No ratings yet
Shortest-Job-First-SJF-Algorithm
14 pages
CA229 Unit 04
No ratings yet
CA229 Unit 04
15 pages
AIML_CIE_III
No ratings yet
AIML_CIE_III
7 pages
Introduction To Algorithms Greedy: CSE 680 Prof. Roger Crawfis
No ratings yet
Introduction To Algorithms Greedy: CSE 680 Prof. Roger Crawfis
69 pages
Lecture#9 - Computational Geometry
No ratings yet
Lecture#9 - Computational Geometry
42 pages
4. Frequent pattern based clustering
No ratings yet
4. Frequent pattern based clustering
4 pages
Sequence Problem
No ratings yet
Sequence Problem
19 pages
IBM Coding Questions With Answers 2024
No ratings yet
IBM Coding Questions With Answers 2024
13 pages
Association Rule Mining Example
No ratings yet
Association Rule Mining Example
12 pages
Robo 7
No ratings yet
Robo 7
5 pages
DS Assignment-8
No ratings yet
DS Assignment-8
18 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
GJHKJ
No ratings yet
GJHKJ
5 pages
Data Structures and Algorithms
No ratings yet
Data Structures and Algorithms
61 pages

huffman code

Uploaded by

huffman code

Uploaded by

OPTIMAL STORAGE ON TAPES

There are n programs that are to be stored on a computer tape of length l.

Minimizing the MRT is equivalent to minimizing d(I) =∑l≤j≤n∑l≤k≤j lik.

The optimal ordering is 3, 1, 2.

2 / / n is the number of programs and m the number of tapes.

7 write ("append program", i,

8 "to permutation for tape", j);

OPTIMAL MERGE PATTERNS

Hence, the second merge pattern is faster than the first.

1. Merge x4 and x 3 to get z1 (15 moves)

2. Merge z1 and x1 to get z2 (35 moves) 3. Merge x2 and x5 to get z

treenode = record { treenode* !child; treenode* rchild; integer weight;

Algorithm Tree(n) { for i := 1 to n - 1 do { pt:= newtreenode;

/ / Get a new tree node. [child) := Least(list);

/ / Merge two trees with( pt rchild) := Least(list);

/ / smallest lengths.( pt weight)lchild) weight) := ((pt( pt weight);rchild) + ((pt Insert(list, pt); }

/ / Tree left in list is the merge tree.

Huffman Coding Algorithm

Steps to Take in Huffman Coding

o For each character, the Huffman Code

How Can a Huffman Tree Be Constructed from Input Characters?

Examples of Huffman Coding

Algorithm for Huffman Coding

Rules to follow for determining edge weights:

Following the weighting, the modified tree is displayed as follows:

Character Frequency/count Code

You might also like