0% found this document useful (0 votes)

21 views11 pages

HPC Miniproject

The document presents a mini project report on implementing Huffman Encoding using GPU technology through CUDA, aimed at enhancing performance in data compression tasks. It outlines the project's objectives, methodology, and the advantages of using parallel processing to accelerate character frequency counting and data encoding. The report concludes that the hybrid CPU-GPU approach significantly improves the efficiency of Huffman Encoding, especially for large datasets.

Uploaded by

Arbaz Shaikh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views11 pages

HPC Miniproject

Uploaded by

Arbaz Shaikh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

.

Department Of Computer Engineering

STES’S SINHGAD ACADEMY OF ENGINEERING
KONDHWA BK, PUNE 411048

2024-2025
“Implement Huffman Encoding on GPU”

Submitted to the

Savitribai Phule Pune University

In partial fulfillment for the award of the Degree of

Bachelor of Engineering
in

Computer Engineering
By

1) Snehal Abnave COBA03

2) Shubham Upadhyay COBA53
3) Sahil Narale COBC12
4) Shreyash Chalke COBC13

Under the guidance of

Prof. P.R. Dongre

1
CERTIFICATE

This is to certify that the mini project report entitled “Implement Huffman Encoding on
GPU” being submitted by Snehal Abnave COBA03, Shubham Upadhyay COBA53, Sahil
Narale COBC12 and Shreyas Chalke COBC13 is a record of bonafide work carried out by
him/her under the supervision and guidance of Prof. P.R. Dongre in partial fulfillment of the
requirement for BE (Computer Engineering) – 2019 course of Savitribai Phule Pune
University, Pune in the academic year 2024-2025.

Date:

Place: Pune

Subject Coordinator Head of the Department

Principal

This Mini Project report has been examined by us as per the Savitribai Phule Pune University,
Pune requirements at SINHGAD ACADEMY OF ENGINEERING Pune – 411048

Internal Examiner External Examiner

2
ACKNOWLEDGEMENT

First and foremost, praises and thanks to the God, the Almighty, for showers
of blessings throughout my project work to complete the research
successfully.
I would like to express my deep and sincere gratitude to my subject teacher
Prof. P.R.Dongre for giving us the opportunity to do this project and provide
invaluable guidance throughout this project. Her dynamism, vision, sincerity,
and motivation have deeply inspired us. She has taught us the methodology to
carry out the research and to present the project works as clearly as possible.
It was a great privilege and honor to work and study under her guidance. We
are extremely grateful for what she has offered us. We would also like to
thank him for his friendship, empathy, and great sense of humor.
We are extremely grateful to all group members Snehal Abnave, Shubham
Upadhyay, Sahil Narale, and Shreyash Chalke for their dedication and
consistency towards this mini project. And also thankful for all the resources
which are provided by each group member and that played a very crucial role
in the accomplishment of this project.

Name Sign
Snehal Abnave
Shubham Upadhyay
Sahil Narale
Shreyash Chalke

(Student's Name & Signature)

3
CONTENTS
Sr. No TITLE Page no

1. Abstract 5

2. Introduction 6

3. Problem Statement 7

4. Motivation 7

5. Objectives 8

6. Theory and Outputs 8-10

7. Conclusion 11

8. References 11

4
Abstract
Huffman Encoding is a fundamental data compression technique that reduces the size of
files without losing any data. It works by assigning shorter binary codes to frequently
occurring characters and longer codes to rare ones.

This project focuses on implementing Huffman Encoding using CUDA to leverage GPU
parallelism. The goal is to accelerate parts of the Huffman process such as character
frequency counting and data encoding. By using the parallel computation capability of the
GPU, we aim to optimize performance and reduce processing time, especially for large
inputs.

This report explains the objectives, theory, implementation strategy, output analysis, and
final conclusions based on the CUDA-based Huffman encoder we developed.

5
Introduction
Data compression plays an essential role in computer science, allowing efficient
storage and transmission of information. Huffman encoding is a popular
technique that uses variable-length codes to represent characters based on their
frequency. This project explores the GPU-accelerated version of Huffman
encoding using CUDA. The goal is to speed up parts of the process, such as
frequency counting and parallel encoding, by leveraging the parallel processing
power of modern GPUs.

6
Problem Statement

Traditional Huffman encoding is inherently sequential and can be slow

on large datasets. With the increasing demand for high-speed
compression in big data and real-time systems, there is a need to
accelerate this process using parallel computing. The project aims to
implement Huffman encoding using CUDA to harness the power of the
GPU.

Motivation

In today’s digital age, data is generated at an enormous scale, and

efficient compression techniques have become essential for storage,
transmission, and processing. Huffman Encoding is a classical and
widely used lossless compression algorithm that has been the backbone
of compression systems for decades.
However, as the volume of data continues to grow exponentially,
traditional CPU-based Huffman encoding may become a performance
bottleneck, especially in real-time and large-scale applications. This
motivated us to explore the use of GPU parallel processing with CUDA
to accelerate the Huffman encoding algorithm.
The motivation behind this mini project is to combine classical
algorithms with modern parallel computing to make data compression
faster, smarter, and future-ready.

7
Objective

➢ Implement the Huffman Encoding algorithm using GPU (CUDA).

➢ Use GPU kernels to calculate the character frequency in parallel.

➢ Generate the Huffman Tree and binary codes on the CPU.

➢ Encode the input data using GPU-based parallel processing.

➢ Compare performance and output efficiency.

Theory

Huffman Encoding – Overview

Huffman Encoding is a lossless data compression algorithm developed

by David A. Huffman in 1952. It is based on the principle of assigning
shorter binary codes to frequently occurring characters and longer codes
to rarer characters. This helps reduce the total number of bits required to
represent the data.
The key property of Huffman codes is that they are prefix-free, meaning
no code is a prefix of another. This ensures that the encoded data can be
decoded unambiguously.

Steps in Huffman Encoding

1. Frequency Calculation
Count how often each character appears in the input data.
2. Build a Min-Heap (Priority Queue)
Each node in the heap represents a character and its frequency.
3. Construct the Huffman Tree
8
Repeatedly combine the two lowest-frequency nodes into a new internal
node. This forms a binary tree with frequencies as weights.

4. Assign Binary Codes

Traverse the tree to assign codes to each character:
• Left edge = 0
• Right edge = 1
5. Encode the Input
Replace every character in the input string with its binary code.

Why Use CUDA and GPU?

• Huffman encoding is traditionally implemented on the CPU. However,

certain parts of the process are compute-intensive and can be
parallelized, especially:
• Character frequency calculation: Each thread can count frequencies in a
chunk of the input.
• Parallel encoding: Each thread encodes one character using the
generated Huffman table.
• CUDA (Compute Unified Device Architecture) is a parallel computing
platform by NVIDIA that allows writing GPU programs using C/C++. It
provides thousands of threads to work simultaneously, which is ideal for
data-parallel operations like those in Huffman encoding.

Advantages of Huffman Encoding

• Simple and effective for text-based compression.

• Produces optimal prefix codes for known character frequencies.
• Widely used in formats like JPEG, MP3, PNG, and ZIP files.

9
OUTPUT

Fig: Code of Huffman encoding

Fig: Output of the given string/text

10
Conclusion

Huffman Encoding is a powerful algorithm for data compression. In this

project, we successfully implemented a hybrid CPU-GPU system for
encoding using Huffman’s method.
We offloaded frequency calculation and encoding to the GPU, which
makes it faster than traditional CPU-only methods, especially for large
inputs. The project proves that classical algorithms can benefit from
modern parallel processing platforms like CUDA.
This approach can be extended further by adding decoding on the GPU
or compressing actual files like .txt or .csv for practical use.

References

• GeeksforGeeks - Huffman Encoding

• Wikipedia - Huffman Coding
• CUDA Toolkit Documentation - NVIDIA
• Rivera, C., Di, S., Tian, J., Yu, X., Tao, D., & Cappello, F. (2022).
Optimizing Huffman Decoding for Error-Bounded Lossy Compression on
GPUs.

Parallel Programming With CUDA - Architecture, Analysis
No ratings yet
Parallel Programming With CUDA - Architecture, Analysis
93 pages
PHD Thesis Computer Science PDF Download
100% (4)
PHD Thesis Computer Science PDF Download
5 pages
Thesis Delft
100% (3)
Thesis Delft
5 pages
Data Compression Project-Huffman Algorithm
56% (9)
Data Compression Project-Huffman Algorithm
54 pages
Huffman Coding by Akas
100% (1)
Huffman Coding by Akas
54 pages
Grade 2 Cause Effect B
No ratings yet
Grade 2 Cause Effect B
3 pages
Thesis Gpu Programming
100% (2)
Thesis Gpu Programming
6 pages
DMS Report
No ratings yet
DMS Report
5 pages
Mini Project
No ratings yet
Mini Project
26 pages
Huffman Coding Project
No ratings yet
Huffman Coding Project
9 pages
Report
No ratings yet
Report
43 pages
Daa Report Kashya
No ratings yet
Daa Report Kashya
20 pages
Huffman Coding by Akas
No ratings yet
Huffman Coding by Akas
48 pages
Tecnomatix Plant Simulation Basics, Methods, and Strategies Student Guide - 2012
100% (1)
Tecnomatix Plant Simulation Basics, Methods, and Strategies Student Guide - 2012
764 pages
HPC Report 1
No ratings yet
HPC Report 1
12 pages
Lập Trình Trên Bộ Xử Lý Song Song GPU Có Hỗ Trợ Lõi CUDA
No ratings yet
Lập Trình Trên Bộ Xử Lý Song Song GPU Có Hỗ Trợ Lõi CUDA
18 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
163 pages
HPC Print 2
No ratings yet
HPC Print 2
8 pages
Algorithm Analysis and Implementation of Huffman Coding For Grayscale Image Compression Using Python
No ratings yet
Algorithm Analysis and Implementation of Huffman Coding For Grayscale Image Compression Using Python
18 pages
Algorithm Analysis of Huffman Coding Using Python
No ratings yet
Algorithm Analysis of Huffman Coding Using Python
16 pages
Mini Project
No ratings yet
Mini Project
7 pages
Project Huffman Code (Final)
No ratings yet
Project Huffman Code (Final)
20 pages
01 - Lecture Intro To HPC
No ratings yet
01 - Lecture Intro To HPC
62 pages
Huffman Tree and Construction and Encoding Analysis - Compressed
No ratings yet
Huffman Tree and Construction and Encoding Analysis - Compressed
13 pages
lp5 HPC Miniproject
No ratings yet
lp5 HPC Miniproject
4 pages
Kali Yerzat TI Project 2
No ratings yet
Kali Yerzat TI Project 2
19 pages
Quarter 1 Week 7 English 4
100% (1)
Quarter 1 Week 7 English 4
22 pages
Huffman Coding and Encoding Data Methods
No ratings yet
Huffman Coding and Encoding Data Methods
6 pages
Rakib Project
No ratings yet
Rakib Project
14 pages
A History of Political Thought Plato To Marx 2nd Edition 2nd Subrata Mukherjee Instant Download
No ratings yet
A History of Political Thought Plato To Marx 2nd Edition 2nd Subrata Mukherjee Instant Download
84 pages
DSA Documentation 1
No ratings yet
DSA Documentation 1
4 pages
Mini Project HPC
No ratings yet
Mini Project HPC
3 pages
Assignment No-05
No ratings yet
Assignment No-05
3 pages
HW311
No ratings yet
HW311
3 pages
YASH HPC Final
No ratings yet
YASH HPC Final
13 pages
Algebra Book
No ratings yet
Algebra Book
804 pages
Huffman Encoder
No ratings yet
Huffman Encoder
5 pages
Character Traits Lesson
No ratings yet
Character Traits Lesson
4 pages
ADA Report
No ratings yet
ADA Report
12 pages
PP MiniProject
No ratings yet
PP MiniProject
8 pages
XXBEC00xx VL2020210102036 DA
No ratings yet
XXBEC00xx VL2020210102036 DA
3 pages
Huffman
No ratings yet
Huffman
13 pages
Huffman Tasks
No ratings yet
Huffman Tasks
5 pages
Huffman Encoding Report
No ratings yet
Huffman Encoding Report
36 pages
COSC 320 Project - 2020 - Spring
No ratings yet
COSC 320 Project - 2020 - Spring
2 pages
Bi 4
No ratings yet
Bi 4
6 pages
DAA Lab Manual
No ratings yet
DAA Lab Manual
30 pages
BI Miniproject B-25
No ratings yet
BI Miniproject B-25
14 pages
Guitar L1 Curriculum
No ratings yet
Guitar L1 Curriculum
65 pages
Part 3
No ratings yet
Part 3
113 pages
Algorithmic Trading Example
No ratings yet
Algorithmic Trading Example
16 pages
Haufman 1
No ratings yet
Haufman 1
8 pages
Haufman
No ratings yet
Haufman
8 pages
Manual GRP A - Assignment 2 .Docx 1 1
No ratings yet
Manual GRP A - Assignment 2 .Docx 1 1
15 pages
Win Runner Automation Testing Tool
No ratings yet
Win Runner Automation Testing Tool
13 pages
PP Lavanya
No ratings yet
PP Lavanya
4 pages
Huffman Encoder and Decoder Using Verilog
No ratings yet
Huffman Encoder and Decoder Using Verilog
3 pages
Optimizing Huffman Coding For Modern GPU Architectures
No ratings yet
Optimizing Huffman Coding For Modern GPU Architectures
10 pages
AUM CE368 - Project - SUMMER2023
No ratings yet
AUM CE368 - Project - SUMMER2023
11 pages
Project Report Huffman Algorithm: Jinnah University For Women
No ratings yet
Project Report Huffman Algorithm: Jinnah University For Women
11 pages
12 - Huffman Coding Algorithm
No ratings yet
12 - Huffman Coding Algorithm
16 pages
X Maths (Basic) Pre-Board Paper 1
No ratings yet
X Maths (Basic) Pre-Board Paper 1
11 pages
MLSP Lab Exp2
No ratings yet
MLSP Lab Exp2
6 pages
ADM Eng10 Q1 M3 Identifyingthewriterspurpose Final
No ratings yet
ADM Eng10 Q1 M3 Identifyingthewriterspurpose Final
23 pages
Plural Nouns Online Exercise For BEGINNER
No ratings yet
Plural Nouns Online Exercise For BEGINNER
1 page
Deep Dive Into Huffman Coding Techniques
No ratings yet
Deep Dive Into Huffman Coding Techniques
3 pages
Design and Analysis of Algorithms (COM336) : Huffman Coding
No ratings yet
Design and Analysis of Algorithms (COM336) : Huffman Coding
1 page
Uber ml1 - Jupyter Notebook
No ratings yet
Uber ml1 - Jupyter Notebook
10 pages
Language, Dialects, and Varieties
0% (1)
Language, Dialects, and Varieties
5 pages
Huffman Coding
No ratings yet
Huffman Coding
11 pages
Getting Started: Huffman Coding
No ratings yet
Getting Started: Huffman Coding
5 pages
Sandraleemckay
No ratings yet
Sandraleemckay
6 pages
Steps of Huffman Encoding:: Calculate The Frequency of Each Character Build A Priority Queue Build A Binary Tree
No ratings yet
Steps of Huffman Encoding:: Calculate The Frequency of Each Character Build A Priority Queue Build A Binary Tree
1 page
Image Compression by Retaining Image Quality - Ieee Format
No ratings yet
Image Compression by Retaining Image Quality - Ieee Format
4 pages
Explore SAP S4HANA Cloud CDS Views On SAP API Business Hub
No ratings yet
Explore SAP S4HANA Cloud CDS Views On SAP API Business Hub
11 pages
Introduction To Ict
No ratings yet
Introduction To Ict
8 pages
C++ - Convert STD - Bind To Function Pointer - Stack Overflow
No ratings yet
C++ - Convert STD - Bind To Function Pointer - Stack Overflow
7 pages
I Dedicate My Victory To Palestine' Afaf Raed Sharif, 17, From Palestine
No ratings yet
I Dedicate My Victory To Palestine' Afaf Raed Sharif, 17, From Palestine
2 pages
Grammar Practice 2
No ratings yet
Grammar Practice 2
2 pages
Visvesvaraya Technological University: "File Compression Using Huffman Coding"
No ratings yet
Visvesvaraya Technological University: "File Compression Using Huffman Coding"
5 pages
Five Design Principles
No ratings yet
Five Design Principles
6 pages
Berkeley
No ratings yet
Berkeley
5 pages
Epistemology
No ratings yet
Epistemology
3 pages
Bilingualism and Multilingualisme in Basque Education
No ratings yet
Bilingualism and Multilingualisme in Basque Education
56 pages
Digital Logic Design (DLD) : Lecturer: Engr. Ali Iqbal
No ratings yet
Digital Logic Design (DLD) : Lecturer: Engr. Ali Iqbal
18 pages
B, Inggris Ukk.
No ratings yet
B, Inggris Ukk.
8 pages
Step by Step Guide LyncDebugTools - Snooper 2013
No ratings yet
Step by Step Guide LyncDebugTools - Snooper 2013
13 pages
Strings (ALL PROGRAMS)
No ratings yet
Strings (ALL PROGRAMS)
4 pages
Mini Project 2
No ratings yet
Mini Project 2
4 pages
ADU AS-3 S-5 Addendum 8502558 PDF
No ratings yet
ADU AS-3 S-5 Addendum 8502558 PDF
20 pages
5 - The Bellman Equation
No ratings yet
5 - The Bellman Equation
7 pages
22 Ai 4
No ratings yet
22 Ai 4
4 pages
Q Tips: Fast, Scalable, and Maintainable Kdb+
From Everand
Q Tips: Fast, Scalable, and Maintainable Kdb+
Nick Psaris
No ratings yet
Accelerated Computing with HIP
From Everand
Accelerated Computing with HIP
Yifan Sun
4.5/5 (2)

HPC Miniproject

Uploaded by

HPC Miniproject

Uploaded by

.

Department Of Computer Engineering

Savitribai Phule Pune University

1) Snehal Abnave COBA03

Under the guidance of

Prof. P.R. Dongre

Subject Coordinator Head of the Department

Internal Examiner External Examiner

(Student's Name & Signature)

6. Theory and Outputs 8-10

Traditional Huffman encoding is inherently sequential and can be slow

In today’s digital age, data is generated at an enormous scale, and

➢ Implement the Huffman Encoding algorithm using GPU (CUDA).

➢ Use GPU kernels to calculate the character frequency in parallel.

➢ Generate the Huffman Tree and binary codes on the CPU.

➢ Encode the input data using GPU-based parallel processing.

➢ Compare performance and output efficiency.

Huffman Encoding – Overview

Huffman Encoding is a lossless data compression algorithm developed

Steps in Huffman Encoding

4. Assign Binary Codes

Why Use CUDA and GPU?

• Huffman encoding is traditionally implemented on the CPU. However,

Advantages of Huffman Encoding

• Simple and effective for text-based compression.

Fig: Code of Huffman encoding

Fig: Output of the given string/text

Huffman Encoding is a powerful algorithm for data compression. In this

• GeeksforGeeks - Huffman Encoding

You might also like