Proposal
Proposal
1
TABLE OF CONTENTS
2 Literature Review
5 Conclusion
3 Research Methodology
6 Reference
2
Introduction
3
Motivation
4
Objective
5
Problem Statement
6
Literature Review
Title Authors Published year Result Limitation
A Deep Learning-Based Abdul Hady Akash et al. 2022 The study proposed a The study's focus on
Approach to Image model for Bangla image image captioning,
Captioning in Bengali captioning using ResNet
as a feature extractor rather than direct
text-to-image
generation, may also
limit its relevance to
certain applications.
Zero-Shot Text-to-Image Ramesh et al. 2021 Generates high-quality, Lacks fine control,
Generation." diverse images from text exhibits biases, and is
Proceedings of the prompts with zero-shot
International capabilities. computationally
Conference on Machine expensive, with poor
Learning (ICML). Bengali text support.
7
Literature Review
Title Authors Published year Result Limitation
Toward Multimodal Zhu et al. 2017 Effectively translates GANs suffer from mode
Image-to-Image images (e.g., sketches to collapse, training
Translation." Advances photos) with multimodal instability, and require
in Neural Information outputs. large datasets, making
Processing Systems Bengali adaptation
(NeurIPS) challenging.
Photorealistic Text-to- Saharia et al. 2022 Produces highly realistic Slow inference, high
Image Diffusion Models and diverse images with computational cost,
with Deep Learning. better text-image and limited support for
alignment. Bengali-language text.
8
Research Methodology
9
Proposed Methods
10
Soft ware & Hardware Requirements
11
Flowchart
12
Work Plan
13
Conclusion
14
Conclusion
15
Reference
16
ANY
QUESTIONS?
17