0% found this document useful (0 votes)

85 views10 pages

Karantp

Huffman coding is an entropy encoding algorithm used for lossless data compression. It was developed by David A. Huffman while he was a Ph.D. Student at MIT. The algorithm uses a specific method for choosing the representation for each symbol.

Uploaded by

Karanbir Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

85 views10 pages

Karantp

Uploaded by

Karanbir Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

TERM PAPER CSE-408

TOPIC:HUFFMAN CODES

SUBMITTED TO:MR.VIJAY GARG

SUBMITTED BY:KARANBIR SINGH B.TECH CSE 10804631 RK1R08B39

ACKNOWLEDGEMENT

First and foremost I, KARANBIR SINGH is very thankful to Lect.VIJAY GARG who assigned me this term paper HUFFMAN CODES. I am hearty thankful to college library for providing the books, my roommates and classmates for helping me in assembling the notes related to this topic. Last but not the least; I am very thankful to my parents who give me financial support to complete my term paper.

KARANBIR SINGH

Contents

1) Introduction 2) Types of Huffman coding a)N-ray Huffman coding b)Adaptive Huffman coding c)Huffman template algorithm d)Length limited Huffman coding e)Huffman coding with unequal letter costs f)Hu-tucker coding g)Canonical Huffman code 3) Properties

4) Advantages

5) Disadvantages

6) Applications 7) References

INTRODUCTION

Huffman coding is an entropy encoding algorithm used for lossless data compression. It was developed by David A. Huffman while he was a Ph.D. student at MIT, and published in the 1952 ".Huffman coding is based on the frequency of occurrence of a data item (pixel in images). The principle is to use a lower number of bits to encode the data that occurs more frequently. Codes are stored in a Code Book which may be constructed for each image or a set of images. In all cases the code book plus encoded data must be transmitted to enable decoding.Huffman coding uses a specific method for choosing the representation for each symbol, resulting in a prefix code sometimes called "prefix-free codes", that is, the bit string representing some particular symbol is never a prefix of the bit string representing any other symbol that expresses the most common source symbols using shorter strings of bits than are used for less common source symbols. Huffman was able to design the most efficient compression method of this type ,no other mapping of individual source symbols to unique strings of bits will produce a smaller average output size when the actual symbol frequencies agree with those used to create the code. A method was later found to design a Huffman code in linear time if input probabilities (also known as weights) are sorted. Huffman coding is equivalent to simple binary block encoding, e.g., ASCII coding. Huffman coding is such a widespread method for creating prefix codes that the term "Huffman code" is widely used as a synonym for "prefix code" even when such a code is not produced by Huffman's algorithm.

TYPES OF HUFFMAN CODING N-ary Huffman coding

The n-ary Huffman algorithm uses the {0, 1, ... , n 1} alphabet to encode message and build an n-ary tree. This approach was considered by Huffman in his original paper. The same algorithm applies as for binary (n equals 2) codes, except that the n least probable symbols are taken together, instead of just the 2 least probable. Note that for n greater than 2, not all sets of source words can properly form an n-ary tree for Huffman coding. In this case, additional 0-probability place holders must be added. This is because the tree must form an n to 1 contractor; for binary coding, this is a 2 to 1 contractor, and any sized set can form such a contractor. If the number of source words is congruent to 1 modulo n-1, then the set of source words will form a proper Huffman tree.

Adaptive Huffman coding

A variation called adaptive Huffman coding involves calculating the probabilities dynamically based on recent actual frequencies in the sequence of source symbols, and changing the coding tree structure to match the updated probability estimates.

Huffman template algorithm

Most often, the weights used in implementations of Huffman coding represent numeric probabilities, but the algorithm given above does not require this; it requires only that the weights form a totally ordered commutative monoid, meaning a way to order weights and to add them. The Huffman template algorithm enables one to use any kind of weights (costs, frequencies, pairs of weights, non-numerical weights) and one of many combining methods (not just addition).

Such algorithms can solve other minimization problems, such as minimizing design. , a problem first applied to circuit

Length-limited Huffman coding limited

Length-limited Huffman coding is a variant where the goal is still to limited achieve a minimum weighted path length, but there is an additional restriction that the length of each codeword must be less than a given constant. The package-merge algorithm solves this problem with a simple greedy approach very similar to that used by Huffman's algorithm. Its time complexity is , where is the maximum length of a codeword. No algorithm is known to solve this problem to in linear or linearithmic time, unlike the pre-sorted and unsorted conventional Huffman problems, respectively.

Huffman coding with unequal letter costs ng

In the standard Huffman coding problem, it is assumed that each symbol in the set that the code words are constructed from has an equal cost to transmit: a code word whose length is N digits will always have a cost of N no matter how many of those digits are 0s, N, how many are 1s, etc. When working under this assumption, minimizing the total cost of the message and minimizing the total number of digits are the same thing. Huffman coding with unequal letter costs is the generalization in ization which this assumption is no longer assumed true: the letters of the encoding alphabet may have non uniform lengths, due to non-uniform characteristics of the transmission medium. An example is the encoding alphabet of Morse code, where a 'dash' takes longer to send code, than a 'dot', and therefore the cost of a dash in transmission time is higher. The goal is still to minimize the weighted average codeword length, but it is no longer sufficient just to minimize the number of symbols used by the message. No algorithm is known to solve this in

the same manner or with the same efficiency as conventional Huffman coding.

Optimal alphabetic binary trees (Hu Tucker coding) (Hu-Tucker

In the standard Huffman coding problem, it is assumed that any codeword can correspond to any input symbol. In the alphabetic version, the alphabetic order of inputs and outputs must be identical. Thus, for example, could not be assigned code , but instead should be assigned either or . This is also known as the Hu-Tucker problem, after the authors of the paper Tucker presenting the first linearithmic solution to this optimal binary alphabetic problem, which has some similarities to Huffman algorithm, but is not a variation of this algorithm. These optimal alphabetic binary trees are often used as binary search trees trees.

Canonical Huffman code ical

If weights corresponding to the alphabetically ordered inputs are in numerical order, the Huffman code has the same lengths as the optimal alphabetic code, which can be found from calculating these lengths, rendering Hu-Tucker coding unnecessary. The code resulting from numerically (re-)ordered input is sometimes called the canonical )ordered Huffman code and is often the code used in practice, due to ease of encoding/decoding. The technique for finding this code is sometimes called Huffman-Shannon Shannon-Fano coding, since it is optimal like , Huffman coding, but alphabetic in weight probability, like ShannonFano coding. The Huffman . Huffman-Shannon-Fano code corresponding to the Fano example is , which, having the same codeword lengths as the original solution, is also optimal.

PROPERTIES
1. Unique Prefix Property: no code is a prefix to any other code (all symbols are at the leaf nodes) -> great for decoder, unambiguous. 2. If prior statistics are available and accurate, then Huffman coding is very good 3. The frequencies used can be generic ones for the application domain that are based on average experience, or they can be the actual frequencies found in the text being compressed. 4. Huffman coding is optimal when the probability of each input symbol is a negative power of two. 5. The worst case for Huffman coding can happen when the probability of a symbol 6 cedes 2-1 = 0.5, making the upper limit of inefficiency unbounded. These situations often respond well to a form of blocking called run-length encoding.

ADVANTAGES
Algorithm is easy to implement Produce a lossless compression of images

DISADVANTAGES
Efficiency depends on the accuracy of the statistical model used and type of image.

Algorithm varies with different formats, but few get any better than 8:1 compression. Compression of image files that contain long runs of identical pixels by Huffman is not as efficient when compared to RLE. The Huffman encoding process is usually done in two passes. During the first pass, a statistical model is built, and then in the second pass the image data is encoded based on the generated model. From here we can see that Huffman encoding is a relatively slow process as time is required to build the statistical model in order to archive an efficient compression rate. Another disadvantage of Huffman is that, all codes of the encoded data are of different sizes (not of fixed length). Therefore it is very difficult for the decoder to know that it has reached the last bit of a code, and the only way for it to know is by following the paths of the up-side down tree and coming to an end of it (one of the branch). Thus, if the encoded data is corrupted with additional bits added or bits missing, then whatever that is decoded will be wrong values, and the final image displayed will be garbage. It is required to send Huffman table at the beginning of the compressed file ,otherwise the decompressor will not be able to decode it. This causes overhead.

APPLICATIONS
1. Arithmetic coding can be viewed as a generalization of Huffman coding; indeed, in practice arithmetic coding is often preceded by Huffman coding, as it is easier to find an arithmetic code for a binary input than for a nonbinary input. 2. Huffman coding is in wide use because of its simplicity, high speed and lack of encumbrance by patents. 3. Huffman coding today is often used as a "back-end" to some other compression method. DEFLATE (PKZIP's algorithm) and multimedia codecs such as JPEG and MP3 have a front-end model and quantization followed by Huffman coding.

REFERENCES
1.www.google.com/Huffman 2. http://en.wikipedia.org/huffman_codes 3. A.V.Aho, J.E. Hopcroft and J.D.Ullman, The Design and Analysis Of Computer Algorithms, Pearson Education Asia, 2007 4. T.H. Cormen, C.E. Leiserson, R.L. Rivest and C. Stein, Introduction to Algorithms, PHI Pvt. Ltd., 2007

Static Huffman Coding Term Paper
No ratings yet
Static Huffman Coding Term Paper
23 pages
Graph Theory - Important Application of Trees Huffman Coding
No ratings yet
Graph Theory - Important Application of Trees Huffman Coding
50 pages
Huffman Coding 1
No ratings yet
Huffman Coding 1
54 pages
Huffman Coding Technique
No ratings yet
Huffman Coding Technique
13 pages
Group-8 DIP Presentation
No ratings yet
Group-8 DIP Presentation
100 pages
5.3. Entropy Coding: Prof. Dr. Paul Müller, ICSY Lab, University of Kaiserslautern, Germany
No ratings yet
5.3. Entropy Coding: Prof. Dr. Paul Müller, ICSY Lab, University of Kaiserslautern, Germany
45 pages
Huffman
No ratings yet
Huffman
53 pages
Huffman Code
No ratings yet
Huffman Code
5 pages
Huffman Encoding Report
No ratings yet
Huffman Encoding Report
36 pages
Unit 2
No ratings yet
Unit 2
82 pages
ICT - Module 1 Lecture 3
No ratings yet
ICT - Module 1 Lecture 3
43 pages
EC3021D
No ratings yet
EC3021D
22 pages
Huffman Coding
No ratings yet
Huffman Coding
11 pages
Mathematical Analysis and Its Applications 2015
No ratings yet
Mathematical Analysis and Its Applications 2015
752 pages
Ut 1 PPT
No ratings yet
Ut 1 PPT
77 pages
Huffman Coding-: Step-01
No ratings yet
Huffman Coding-: Step-01
19 pages
Dce 1
No ratings yet
Dce 1
21 pages
Huffman Coding: Greedy Algorithm
No ratings yet
Huffman Coding: Greedy Algorithm
27 pages
Huffman Code
No ratings yet
Huffman Code
51 pages
Unite 4-Greedy Method - CSE
No ratings yet
Unite 4-Greedy Method - CSE
41 pages
Imc14 03 Huffman Codes PDF
No ratings yet
Imc14 03 Huffman Codes PDF
31 pages
Unit 2 CA209
No ratings yet
Unit 2 CA209
29 pages
11 Huffman Coding
No ratings yet
11 Huffman Coding
25 pages
Data Compression - Unit 2
No ratings yet
Data Compression - Unit 2
31 pages
Lecture
No ratings yet
Lecture
75 pages
Huffman Encoding
No ratings yet
Huffman Encoding
16 pages
Huffman Coding Ms 140400147 Sadia Yunas Butt
No ratings yet
Huffman Coding Ms 140400147 Sadia Yunas Butt
9 pages
Huffman Coding, RLE, LZW
No ratings yet
Huffman Coding, RLE, LZW
41 pages
Huffman Code
No ratings yet
Huffman Code
29 pages
4 Huffman and Shannon Fano Coding
No ratings yet
4 Huffman and Shannon Fano Coding
23 pages
Data Compression
No ratings yet
Data Compression
28 pages
Huffman Coding
No ratings yet
Huffman Coding
12 pages
ICS 220 - Data Structures and Algorithms: Dr. Ken Cosh
No ratings yet
ICS 220 - Data Structures and Algorithms: Dr. Ken Cosh
22 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Improvement of Huffmancoding
No ratings yet
Improvement of Huffmancoding
7 pages
HuffmanCoding 2
No ratings yet
HuffmanCoding 2
16 pages
Modification of Adaptive Huffman Coding For Use in
No ratings yet
Modification of Adaptive Huffman Coding For Use in
6 pages
Huffman Encoding: WWW - Cis.Upenn - Edu/ Matuszek/Cit594-2002/SLIDES/HUFFMAN
No ratings yet
Huffman Encoding: WWW - Cis.Upenn - Edu/ Matuszek/Cit594-2002/SLIDES/HUFFMAN
13 pages
Huffman Coding
No ratings yet
Huffman Coding
16 pages
Unit 2
No ratings yet
Unit 2
28 pages
Huffman Coding: A Case Study of A Comparison Between Three Different Type Documents
No ratings yet
Huffman Coding: A Case Study of A Comparison Between Three Different Type Documents
5 pages
Huffman Coding A Case Study of A Comparison
No ratings yet
Huffman Coding A Case Study of A Comparison
2 pages
S 2
No ratings yet
S 2
8 pages
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
No ratings yet
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
24 pages
5c. Huffman
No ratings yet
5c. Huffman
13 pages
Huffman Coding
No ratings yet
Huffman Coding
7 pages
Huffman Coding Algorithm
No ratings yet
Huffman Coding Algorithm
4 pages
#G9943LK
No ratings yet
#G9943LK
4 pages
What Is Huffman Coding and Its History
No ratings yet
What Is Huffman Coding and Its History
5 pages
An Introduction To Arithmetic Coding: Glen G. Langdon, JR
No ratings yet
An Introduction To Arithmetic Coding: Glen G. Langdon, JR
15 pages
Compression and Decompression Using Huffman Convention Synopsis
No ratings yet
Compression and Decompression Using Huffman Convention Synopsis
10 pages
University of Management & Technology: Submitted By: Usama Dastagir 14030027011 Hassan Humayoun 14030027043
No ratings yet
University of Management & Technology: Submitted By: Usama Dastagir 14030027011 Hassan Humayoun 14030027043
7 pages
Huffman Coding
No ratings yet
Huffman Coding
23 pages
Huff Man
No ratings yet
Huff Man
8 pages
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
No ratings yet
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
28 pages
IEEE Paper
No ratings yet
IEEE Paper
2 pages
Huffman Trees and Codes: Greedy Technique
No ratings yet
Huffman Trees and Codes: Greedy Technique
6 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
5 pages
Mini Project 2
No ratings yet
Mini Project 2
4 pages
Khushwant Singh
No ratings yet
Khushwant Singh
3 pages
Short Question EE
No ratings yet
Short Question EE
6 pages
01 Intro To Course DSA
No ratings yet
01 Intro To Course DSA
35 pages
Zhu Et Al. - 2024 - Propagation Structure-Aware Graph Transformer For
No ratings yet
Zhu Et Al. - 2024 - Propagation Structure-Aware Graph Transformer For
12 pages
Simple Exponential Smoothing
No ratings yet
Simple Exponential Smoothing
32 pages
Ai 2
No ratings yet
Ai 2
10 pages
Machine Learning: by Team 2
No ratings yet
Machine Learning: by Team 2
41 pages
Lecture13 BlockCipher
No ratings yet
Lecture13 BlockCipher
60 pages
Topic 1 - Basic Notions
No ratings yet
Topic 1 - Basic Notions
36 pages
Quantitative Psychology The 86th Annual Meeting of The Psychometric Society, Virtual, 2021 Full Access Download
100% (16)
Quantitative Psychology The 86th Annual Meeting of The Psychometric Society, Virtual, 2021 Full Access Download
16 pages
Chapter8 - Solution Manual
No ratings yet
Chapter8 - Solution Manual
6 pages
Finding Probability
No ratings yet
Finding Probability
5 pages
@2018 Regression Tree Ensembles For Wind Energy and Solar Radiation
No ratings yet
@2018 Regression Tree Ensembles For Wind Energy and Solar Radiation
10 pages
How To Solve Age Problems Part 1: The Number Word Problem Series
No ratings yet
How To Solve Age Problems Part 1: The Number Word Problem Series
4 pages
Random Forest
No ratings yet
Random Forest
30 pages
Q No Questions Marks BTL CO PO CO 3 PO1: 1 2 F 1.5+20PG1+0.1 (PG1) F2 1.9+30PG2+0.1 (PG2) 3 4
No ratings yet
Q No Questions Marks BTL CO PO CO 3 PO1: 1 2 F 1.5+20PG1+0.1 (PG1) F2 1.9+30PG2+0.1 (PG2) 3 4
1 page
Week 4 Homework: This Is A Preview of The Published Version of The Quiz
No ratings yet
Week 4 Homework: This Is A Preview of The Published Version of The Quiz
7 pages
Intro To LINGO
No ratings yet
Intro To LINGO
21 pages
Scream and Gunshot Detection and Localization For Audio-Surveillance Systems
No ratings yet
Scream and Gunshot Detection and Localization For Audio-Surveillance Systems
6 pages
Some Addmath Solutions
No ratings yet
Some Addmath Solutions
12 pages
Back Tracking
No ratings yet
Back Tracking
12 pages
GIS Interpolation
No ratings yet
GIS Interpolation
14 pages
Mat PROJEKT MAT 1 PDF
No ratings yet
Mat PROJEKT MAT 1 PDF
10 pages
Secure Joint Communication and Sensing
No ratings yet
Secure Joint Communication and Sensing
9 pages
ARIMA Forecasting Using R
No ratings yet
ARIMA Forecasting Using R
9 pages
09 Domain Analysis Testing Examples - Done
No ratings yet
09 Domain Analysis Testing Examples - Done
6 pages
Emotion Recognition From Formal Text (Poetry)
No ratings yet
Emotion Recognition From Formal Text (Poetry)
3 pages
Winter 2022
No ratings yet
Winter 2022
2 pages
Euler Differential Equation PDF
No ratings yet
Euler Differential Equation PDF
2 pages
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Visual Word: Unlocking the Power of Image Understanding
From Everand
Visual Word: Unlocking the Power of Image Understanding
Fouad Sabry
No ratings yet

Karantp

Uploaded by

Karantp

Uploaded by

TERM PAPER CSE-408

SUBMITTED TO:MR.VIJAY GARG

SUBMITTED BY:KARANBIR SINGH B.TECH CSE 10804631 RK1R08B39

TYPES OF HUFFMAN CODING N-ary Huffman coding

Adaptive Huffman coding

Huffman template algorithm

Length-limited Huffman coding limited

Huffman coding with unequal letter costs ng

Optimal alphabetic binary trees (Hu Tucker coding) (Hu-Tucker

Canonical Huffman code ical

You might also like