Visual Generative AI Applications
Visual Generative AI Applications
VGAP Assignment
ASSIGNMENT
Submission Deadline:
Presentation: 5-6th Dec 2024 11:59PM
Report and Code: 7th Dec 2024 11:59PM
Students Name :
Student Number :
1
SDGAI Oct Semester 2024/25
Official Open
VGAP Assignment
1. Assignment Objectives
This assignment aims to assess your competencies in the following areas:
- Controlled Generation
Implement controlled generative processes to influence and guide the output of AI models.
2. Assignment Overview
You will use the Fashion MNIST dataset in two parts:
Part 1: Build and train unconditional generative models to generate images of fashion
items.
Part 2: Build and train conditional generative models to control the image generation
process.
3. Dataset
Download the dataset from PoliteMall, which contains a collection of images on fashion items
from 10 different categories:
• T-shirt/Top
• Trouser
• Pullover
• Dress
2
SDGAI Oct Semester 2024/25
Official Open
VGAP Assignment
• Coat
• Sandal
• Shirt
• Sneaker
• Bag
• Ankle Boot
The dataset consists of 60,000 training images across 10 categories. Each image is grayscale
with a resolution of 28x28 pixels, where the pixel intensity values range from 0 to 255. Figure
1 displays 25 random examples from this dataset.
3
SDGAI Oct Semester 2024/25
Official Open
VGAP Assignment
4. Building Generative Models
4.1. Part 1: Unconditional Generative Model
Design and implement a generative model capable of creating images of fashion items from
10 distinct categories (classes). Your model should generate visually diverse and accurate
representations for each class.
The Jupyter notebook starter file (ASG_starter.ipynb) downloads the MNIST fashion item
dataset into a dataloader.
Your task is to complete this notebook by implementing your various generative models. It is
recommended that you follow the suggested structure provided below to tackle the problem
effectively.
You will need to develop 2 models for unconditional image generation following the
recommended steps below:
4
SDGAI Oct Semester 2024/25
Official Open
VGAP Assignment
Step 3 – Evaluate your model.
- Use the model to generate new images.
- Provide a random (noise) tensor as input to generate your desired image.
ASG1_unGAN_[your_name].ipynb
ASG1_unDiffusion _[your_name].ipynb.
5
SDGAI Oct Semester 2024/25
Official Open
VGAP Assignment
4.2. Part 2: Conditional Generative Model
Design and implement a generative model capable of creating images of fashion items from
10 distinct categories (classes). Your model should generate visually diverse and accurate
representations for each class.
You will need to develop 2 models for conditional image generation following the
recommended steps:
ASG1_cGAN_[your_name].ipynb.
6
SDGAI Oct Semester 2024/25
Official Open
VGAP Assignment
4.2.2. Conditional Diffusion Model
ASG1_cDiffusion _[your_name].ipynb.
7
SDGAI Oct Semester 2024/25
Official Open
VGAP Assignment
5. Deliverables
The deliverable for this assignment includes:
- Jupyter Notebook files
- Report
- Online live presentation.
Zip the 4 files into one file, rename the file as ASG1 _[your_name].zip. Submit your zip file
on PoliteMall by 7th Dec 2024 11:59PM.
DO NOT PLAGIARIZE (please refer to Ngee Ann Polytechnic Plagiarism Policy webpage
for more information)
Suggested content description and word count are provided for each section. You are free to
include other relevant information you deem necessary in the sections.
(Note: For a page with 1-inch margins, 12-point Arial font, and minimal spacing elements, a good rule
of thumb is 500 words for a single-spaced page)
8
SDGAI Oct Semester 2024/25
Official Open
VGAP Assignment
Suggested Report Sections & Content Guidelines Word Count
1. Table of Contents NA
2. Overview
• Describe the problem, the objective and the approach. Min: 300 words
Max: 1000 words
6. Summary
• Summarize your model performance and provide Min: 100 words
suggestions for further improvements Max: 500 words
9
SDGAI Oct Semester 2024/25
Official Open
VGAP Assignment
• Evaluate AI ethics and apply ethical considerations in the
design and deployment of generative AI models,
emphasizing the role of prompt engineering in shaping ethical
outcomes.
5.3. Presentation
You are required to do an online live presentation and share your findings. The presentation
should not exceed 15 minutes, followed by a Question-and-Answer session.
You are encouraged to use your Jupyter notebook for your presentation. The Online
Presentation is scheduled in the week starting 2nd Dec 2024. The exact timing will be arranged
by your tutor. If class is affected by public holiday, alternative arrangements will be made.
6. Grading Criteria
Component
Grading Criteria
Weightage
a) Quality of work
b) Flow of presentation based on content guidelines
(see section 5.2)
Presentation 20%
c) Presentation and articulation skills
d) Clear comprehension of code and fundamental
concepts.
a) Quality of work
b) Completeness of report based on suggested report
Final Report sections and content guidelines (see section 5.2) 80%
c) Clarity of report, Quality of analysis and discussions
d) Use of proper visual aids and Use of proper grammar
10
SDGAI Oct Semester 2024/25