Assignment2

The document outlines Assignment 2 for CSCI376, focusing on multicore and GPU programming tasks. It includes tasks on OpenCL vector datatype manipulation, implementing a shift cipher in both C/C++ and OpenCL, and performing parallel image processing techniques such as luminance conversion, Gaussian blurring, and creating a bloom effect. Each task has specific requirements and marks allocation, with instructions for submission and assessment criteria.

Uploaded by

jeremiahkanhs

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Assignment2

Uploaded by

jeremiahkanhs

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

SCIT

School of Computing & Information Technology

CSCI376  Multicore and GPU Programming

SIM Session 2 2020

Assignment 2

Task 1 – OpenCL Vector Datatype (5 marks)

Write a program that does the following:
 In the host, create two arrays as follows:
o Array 1: An 8 element array of ints with random values between 10 and 20.
o Array 2: A 16 element array of ints. Initialise the first half of the array with values
from 2 to 9 and the second half with values from -9 to -2.
(1 mark)
 Write a kernel that
o Accepts array 1 as an array of int4s, array 2 and an output array
o Reads the contents from array 1 and 2 into local memory
 Copy the contents of array 1 into an int8 vector called v
 Copy the contents (using vloadn) of array 2 into two int8 vectors called v1
and v2
o Creates an int8 vector in private memory called results. The contents of this vector
should be filled as follows:
 Check whether any of the elements in v are greater than 15
 If there are, then for elements that are greater than 15, copy the
corresponding elements from v1 into results; for elements less than or
equal to 15, copy the elements from v2 into results. (Use select).
 If not, fill the first 4 elements of results with the contents from the
first 4 elements of v1; and fill the next 4 elements of results with
contents from the first 4 elements of v2.
o Stores the contents of v, v1, v2 and results in the output array (using vstoren)
Note that the host will only have to enqueue 1 work item.
(3 marks)
 In the host, check that the results are correct and display the contents of the output array.
(1 mark)

CSCI376 – Multicore and GPU Programming -1- © 2020 University of Wollongong

Task 2 – Shift Cipher (10 marks)
A shift cipher (a.k.a. Caesar’s cipher) is a simple substitution cipher in which each letter in the
plaintext is replaced with another letter that is located a certain number, n, positions away in the
alphabet. The value of n can be positive or negative. For positive values, replace letters with letters
located n places on its right (i.e. ‘shifted’ by n positions to the right). For negative values, replace
letters with letters located n places on its left. If it reaches the end/start of the alphabets, wrap
around to the start/end.
For example: If n = -3, each letter in the plaintext is replaced with a letter 3 positions before that
letter in the alphabet list.
Plaintext: The quick brown fox jumps over the lazy dog.
Ciphertext: QEB NRFZH YOLTK CLU GRJMP LSBO QEB IXWV ALD.
Note that in the example above, c → Z, since 3 positions before ‘c’ wraps around to the end of the
alphabet list and continues from ‘Z’. Similarly, a → X and b → Y.
Leave anything that is not an alphabet as is (i.e. punctuations and spaces).1
Decrypting the ciphertext is simply a matter of reversing the shift.

Task 2a
Write a normal C/C++ program (not using OpenCL) that reads the contents from a text file called
“plaintext.txt” (a test file has been provided). The program should prompt the user to input a valid n
value, then encrypt the plaintext using the shift cipher method described above, and output the
ciphertext into an output text file called “ciphertext.txt”. To ensure that the encryption was
performed correctly, your program must also decrypt the ciphertext into a file called “decrypted.txt”
to check whether it matches the original plaintext (albeit in upper case).
(3 marks)
Task 2b
Write an OpenCL program to perform the same functionality as in Task 2a, but in parallel. Note
that it is more efficient to use OpenCL vector datatypes for processing in the kernel, as less work-
items will be required.
(3 marks)
Task 2c
Write an OpenCL program to perform parallel encryption, and decryption, by substituting
characters based on the following lookup table:
a b c d e f g h i j k l m n o p q r s t u v w x y z
G X S Q F A R O W B L M T H C V P N Z U I E Y D K J
Based on the table above, for encryption the letter a (or A) will be replaced by G, b (or B) will be
replaced by X, c (or C) will be replaced by S, etc.
(4 marks)

1
Note that to avoid leaking information (e.g., word length), by convention the ciphertext is usually converted to upper
case letters that are organised in groups of five-letter blocks and anything that is not a letter is removed. However, for
this assignment, DO NOT remove anything that is not a letter and DO NOT organise in groups of five-letters.

CSCI376 – Multicore and GPU Programming -2- © 2020 University of Wollongong

Task 3 – Parallel Image Processing (10 marks)
For this section, a test image, “peppers.bmp”, has been provided.

Task 3a (Image Luminance)

Write a parallel program to convert the RGB values (i.e. Red, Green and Blue colour channels) in
an image to luminance values (this approach is used to convert a colour image into a greyscale
image).
For each pixel, calculate:
Luminance = 0.299*R + 0.587*G + 0.114*B
Save the luminance image into a 24-bit BMP file. To do this, set the RGB values of each pixel to
the luminance value.
Note that if you use the example code from the tutorial on image processing, the R, G, and B values
range from 0 to 255 on the host (unsigned char), and 0.0 to 1.0 (float) on the device.
(1 mark)
Task 3b (Gaussian Blurring)
Gaussian blurring is a commonly used technique to image processing and graphics to create a
smooth blurring effect using a Gaussian function. The weights of the filter depend on the size of the
Gaussian filter window. The following are example weights for a 7x7 windows:
0.000036 0.000363 0.001446 0.002291 0.001446 0.000363 0.000036
0.000363 0.003676 0.014662 0.023226 0.014662 0.003676 0.000363
0.001446 0.014662 0.058488 0.092651 0.058488 0.014662 0.001446
0.002291 0.023226 0.092651 0.146768 0.092651 0.023226 0.002291
0.001446 0.014662 0.058488 0.092651 0.058488 0.014662 0.001446
0.000363 0.003676 0.014662 0.023226 0.014662 0.003676 0.000363
0.000036 0.000363 0.001446 0.002291 0.001446 0.000363 0.000036

i. Write an OpenCL program that accepts a colour image and outputs a filtered image using
Gaussian blurring based on the 7x7 window weights provided above.
(1 mark)
ii. Instead of using the 7x7 window (the naïve approach), an alternate approach is to run the
filter in 2 passes. The first pass will perform blurring in the horizontal direction; the result
will then undergo a second pass to blur it in the vertical direction (enqueue the kernel twice
to perform blurring in each direction). The result will be similar to the single window
approach, but the amount of computation will be different.
For example, using a 7x7 window approach, each pixel will have to perform a weighted sum
on 49 pixels. In the 2-pass approach, each pixel will have to perform a weighted sum on 7
pixels in each pass, processing a total of 14 pixels. This is illustrated below:

7x7 window horizontal pass vertical pass

Your task is to implement the parallel 2-pass approach. For this, use the following weights
for the horizontal pass as well as the vertical pass:
0.00598 0.060626 0.241843 0.383103 0.241843 0.060626 0.00598
(3 marks)
Task 3c (Bloom effect)
Bloom effects are commonly used in graphics, movies, video games, etc. This part combines the
work from Tasks 3a and 3b. The basic steps to create an image with a bloom effect are illustrated as
follows:

Figure 1: Bloom effect steps.

1. The image in Fig. 1(a) shows the original image
2. The image in Fig. 1(b) shows an image where the glowing pixels are kept, while the rest are
set to black2. For this assignment, allow the user to input a valid threshold luminance value.
Pixels above the threshold luminance value are kept, while pixels below this luminance
value are set to black. This step is related to Task 3a.
3. The image in Fig. 1(b) undergoes a horizontal blur pass, then a vertical blur pass to obtain
the image depicted in Fig. 1(c). This step is related to Task 3b.

2
Note that Figure 1(b) shows a down-sampled image (i.e. the image has been shrunk). It is more efficient to process a
down-sampled image during the blurring step because there are fewer pixels to process. This also works well for
blurring since referencing the pixels from the smaller image in the final step will also cause blurring (blurring caused by
effectively up-sampling back to the original image size. In fact, some approaches simply use down-sampling for
blurring). To make things easier, for this assignment you do not have to perform down-sampling/up-sampling.

4. Finally, the pixel values in the images shown in Fig. 1(a) and Fig. 1(c) are added together to
form the final image shown in Fig. 1(d). Note that the values above the maximum colour
value should be clamped to the maximum value.
Write a parallel program in OpenCL to perform the Bloom effect on an input image. For the
threshold value (in step 2), allow the user to enter a valid threshold value.
Your program should output the following images:
 an image after step 2 (i.e. image showing the glowing pixels)
 an image after the horizontal blur pass
 an image after the vertical blur pass
 the final image with the bloom effect
(5 marks)

For ALL tasks, include screenshots with your submission. The screenshots are to demonstrate that
the programs work on your computer.
For Task 3, include examples of output images obtained from your program.

Instructions and Assessment

Submit your tasks as one zip file with three folders, named Task1, Task2 and Task3, and include all
the required files (e.g., .cpp, .h, .cl) in your submission.
The assignment must be your own work. If asked, you must be able to explain what you did and
how you did it. Marks will be deducted if you cannot correctly explain your code. The marking
allocations shown above are merely a guide. Marks will be awarded based on the overall quality of
your work. Marks may be deducted for other reasons, e.g., if your code is too messy or inefficient,
is not well commented, if you cannot correctly explain your code, etc. For code that does not
compile, does not work or for programs that crash, the most you can get is half the assessment
marks or less.

References
The images were sourced from
 http://http.developer.nvidia.com/GPUGems/gpugems_ch21.html

2017MR1 BeginnersGuide Tank EN
No ratings yet
2017MR1 BeginnersGuide Tank EN
20 pages
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Manual Pre-Implementation Steps
No ratings yet
Manual Pre-Implementation Steps
3 pages
2024_25_COL100_Lab_3_Function (4)
No ratings yet
2024_25_COL100_Lab_3_Function (4)
7 pages
EC4060 Lab3 Data-Link Layer
No ratings yet
EC4060 Lab3 Data-Link Layer
4 pages
Chapter Parallel Prefix Sum
No ratings yet
Chapter Parallel Prefix Sum
21 pages
Final - Project
No ratings yet
Final - Project
4 pages
Second Examination: Name: Netid: Lab Section (Day/Time)
No ratings yet
Second Examination: Name: Netid: Lab Section (Day/Time)
14 pages
assignment1-3
No ratings yet
assignment1-3
7 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
6 pages
National University of Computer and Emerging Sciences, Lahore Campus
No ratings yet
National University of Computer and Emerging Sciences, Lahore Campus
9 pages
DC4_lab1_py
No ratings yet
DC4_lab1_py
5 pages
MIT6 094IAP10 Assn02
No ratings yet
MIT6 094IAP10 Assn02
10 pages
ImageProcessingTutorial
No ratings yet
ImageProcessingTutorial
28 pages
Computer Applications Specimen 2020
No ratings yet
Computer Applications Specimen 2020
6 pages
Assignment # 5 Arrays-Functions (1)
No ratings yet
Assignment # 5 Arrays-Functions (1)
7 pages
(BESTFITTERS) Inverse Image Captioning Using Generative Adversarial Networks
No ratings yet
(BESTFITTERS) Inverse Image Captioning Using Generative Adversarial Networks
12 pages
ast10201-2122a-exam-question
No ratings yet
ast10201-2122a-exam-question
6 pages
Computer Applications Specimen 2020
No ratings yet
Computer Applications Specimen 2020
7 pages
ASSIGN 310SE QP APR2024 v7.15
No ratings yet
ASSIGN 310SE QP APR2024 v7.15
9 pages
C Programming Exam: Game of Life 25-04-2017, 8:45-10.30
No ratings yet
C Programming Exam: Game of Life 25-04-2017, 8:45-10.30
22 pages
SEM-5 IT137 AMP JOURNAL Removed
No ratings yet
SEM-5 IT137 AMP JOURNAL Removed
69 pages
Computer Scien PDF
No ratings yet
Computer Scien PDF
24 pages
Project 1 - ANN With Backprop
No ratings yet
Project 1 - ANN With Backprop
3 pages
Lab 2 Gnuradio Implementation
No ratings yet
Lab 2 Gnuradio Implementation
8 pages
X CA Board Question Pagenumber
No ratings yet
X CA Board Question Pagenumber
258 pages
Assign 2
No ratings yet
Assign 2
3 pages
Digital Signal Processing: Lab Manual
No ratings yet
Digital Signal Processing: Lab Manual
5 pages
File-549830172-549830172 Assignment3 6597654431802515
No ratings yet
File-549830172-549830172 Assignment3 6597654431802515
3 pages
FINAL PROJECT SYNOPSIS.PDF
No ratings yet
FINAL PROJECT SYNOPSIS.PDF
12 pages
DL unit 3
No ratings yet
DL unit 3
18 pages
CAB202 Tutorial 7 - v2 PDF
No ratings yet
CAB202 Tutorial 7 - v2 PDF
7 pages
ML Lab 11 Manual - Neural Networks (Ver4)
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
8 pages
As A Single PDF
No ratings yet
As A Single PDF
3 pages
Inputting A String of Data From Keyboard (Int 21h Option 0ah)
No ratings yet
Inputting A String of Data From Keyboard (Int 21h Option 0ah)
7 pages
Pattern Printing Assignment
No ratings yet
Pattern Printing Assignment
14 pages
Programming Paradigms - C++ FS 2017: Universit at Basel
100% (1)
Programming Paradigms - C++ FS 2017: Universit at Basel
4 pages
Cryptography - Exercises: 1 Historic Ciphers
No ratings yet
Cryptography - Exercises: 1 Historic Ciphers
7 pages
Lab Exercise 04 Object Oriented Programming Lab: National University of Computer and Emerging Sciences
No ratings yet
Lab Exercise 04 Object Oriented Programming Lab: National University of Computer and Emerging Sciences
8 pages
CS5242 Assignment 2
No ratings yet
CS5242 Assignment 2
12 pages
ICTD 351: Introduction To Computer Programming For The Mathematics Teacher
No ratings yet
ICTD 351: Introduction To Computer Programming For The Mathematics Teacher
37 pages
CS021 - Assessment 10 2213686117142407
100% (1)
CS021 - Assessment 10 2213686117142407
3 pages
Final_Questions_for_OOPD_23M____Rubrics
No ratings yet
Final_Questions_for_OOPD_23M____Rubrics
5 pages
DAA All-Quizzes
No ratings yet
DAA All-Quizzes
9 pages
2
No ratings yet
2
3 pages
Insem2 Scheme
No ratings yet
Insem2 Scheme
6 pages
Assignment 01
No ratings yet
Assignment 01
7 pages
problems_chaptr_3
No ratings yet
problems_chaptr_3
4 pages
CG Manual
No ratings yet
CG Manual
33 pages
Ghost Cells
No ratings yet
Ghost Cells
16 pages
Assignment 2 Help
No ratings yet
Assignment 2 Help
24 pages
MAE3456 - MEC3456 LAB 02: Due: 11:59PM (Sharp), Friday 19 March 2021 (End of Week 3)
No ratings yet
MAE3456 - MEC3456 LAB 02: Due: 11:59PM (Sharp), Friday 19 March 2021 (End of Week 3)
7 pages
Cglab
No ratings yet
Cglab
43 pages
Asgn2-20aug2024
No ratings yet
Asgn2-20aug2024
3 pages
Reverse Engineering Assignment
No ratings yet
Reverse Engineering Assignment
8 pages
Hw1 Release v11
No ratings yet
Hw1 Release v11
12 pages
2023A IP Questions
No ratings yet
2023A IP Questions
38 pages
ECE568 2010 Ts 1303709634
No ratings yet
ECE568 2010 Ts 1303709634
10 pages
Project 3
No ratings yet
Project 3
5 pages
RCA Y1 T1 CAT FOP 2023 2024 Marking Guide
No ratings yet
RCA Y1 T1 CAT FOP 2023 2024 Marking Guide
9 pages
Gd Script
From Everand
Gd Script
Marijo Trkulja
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
WX25 Features
No ratings yet
WX25 Features
47 pages
Engineering Proposal: Prepared For: Prepared by
No ratings yet
Engineering Proposal: Prepared For: Prepared by
6 pages
Fundamentals of Business Writing
No ratings yet
Fundamentals of Business Writing
13 pages
Reviewer Ni Aj Potanginang Procu
No ratings yet
Reviewer Ni Aj Potanginang Procu
11 pages
OSN - Buyer's Guide To Connecting 11i - 115snwbg
No ratings yet
OSN - Buyer's Guide To Connecting 11i - 115snwbg
30 pages
DM Q Bank Ii Combinatorics
No ratings yet
DM Q Bank Ii Combinatorics
3 pages
Dr.M.vinoth Computer Networks Lab
No ratings yet
Dr.M.vinoth Computer Networks Lab
62 pages
Open Workbook of Cryptology
No ratings yet
Open Workbook of Cryptology
92 pages
32877.TOC - CHO 3rd Sem
No ratings yet
32877.TOC - CHO 3rd Sem
9 pages
Its Od 101 Networking Pearson
No ratings yet
Its Od 101 Networking Pearson
3 pages
Chapter 4 AI
No ratings yet
Chapter 4 AI
33 pages
Exam: 000-355 Titl E: Iseries System Administration V5R2
No ratings yet
Exam: 000-355 Titl E: Iseries System Administration V5R2
31 pages
Sf2460i Line 163
No ratings yet
Sf2460i Line 163
123 pages
L109_UL4600
No ratings yet
L109_UL4600
4 pages
Gcube Series Receipt Printer: User Guide
No ratings yet
Gcube Series Receipt Printer: User Guide
24 pages
NCP Computer Science and Entrepreneurship 9-12
No ratings yet
NCP Computer Science and Entrepreneurship 9-12
272 pages
Blackberry Uem: Installation and Upgrade
No ratings yet
Blackberry Uem: Installation and Upgrade
54 pages
A Step by Step Guide For Operations Orchestration-NA
No ratings yet
A Step by Step Guide For Operations Orchestration-NA
18 pages
checklist itgc 20.01
No ratings yet
checklist itgc 20.01
64 pages
Ensayo Sobre Mí Mismo - Ejemplo
100% (1)
Ensayo Sobre Mí Mismo - Ejemplo
8 pages
Installation OpenMeetings 7.1.0 On Ubuntu 22.10
No ratings yet
Installation OpenMeetings 7.1.0 On Ubuntu 22.10
21 pages
Spesifikasi Mobile Operating Table (MEERA CL)
No ratings yet
Spesifikasi Mobile Operating Table (MEERA CL)
2 pages
1-Strongly Disagree 2 - Disagree 3 - Agree 4 - Strongly Agree
No ratings yet
1-Strongly Disagree 2 - Disagree 3 - Agree 4 - Strongly Agree
3 pages
Definitive Amazon Method-1
No ratings yet
Definitive Amazon Method-1
3 pages
HP LaserJet Enterprise MFP M631, M632, M633 and HP LaserJet Managed MFP E62555, E62565, and E62575 - Control Panel Message Document (CPMD)
No ratings yet
HP LaserJet Enterprise MFP M631, M632, M633 and HP LaserJet Managed MFP E62555, E62565, and E62575 - Control Panel Message Document (CPMD)
254 pages
4_5967446226990275918
No ratings yet
4_5967446226990275918
2 pages
Free Access to Using and Understanding Mathematics 6th Edition Bennett Solutions Manual Chapter Answers
100% (13)
Free Access to Using and Understanding Mathematics 6th Edition Bennett Solutions Manual Chapter Answers
52 pages
Project Charter Template D5 2
No ratings yet
Project Charter Template D5 2
4 pages