Full Stack AI Image Generator Project Report With Overview
Full Stack AI Image Generator Project Report With Overview
I express my sincere gratitude to Prof. [Leave space], Dept. Computer Science &
Engineering, CGC- College of Engineering Landran, Mohali, for his stimulating
guidance, continuous encouragement, and supervision throughout the course of this
project.
The app includes a credits system where users consume 1 credit per image generation.
Credits can be purchased via Razorpay integration, ensuring smooth and secure payment
handling. This addition introduces a monetization model into the project, showcasing
real-world applicability.
The frontend is built using React.js and Tailwind CSS for a responsive, clean user
experience. Backend services are powered by Express.js and Node.js, handling prompt
submissions, image generation logic, payment webhook handling, and storage of image
data in a MongoDB database. Users can generate images, view past generations, and
contribute to a shared feed. Features like image downloading, prompt copying, Razorpay
transactions, and credit management are implemented.
2. Literature Survey
3. Present Work
6. References
Chapter 1: INTRODUCTION
The recent advancements in artificial intelligence have enabled machines to understand
and generate images based on textual prompts. This project aims to provide a user-
friendly platform where users can interact with DALL·E, a powerful text-to-image
model, through a custom-built MERN stack application. The motivation behind this work
is to blend frontend development with AI integration and to build a product that
showcases the power of AI in creative fields.
Objectives:
- To implement a text-to-image generator using OpenAI’s DALL·E API.
- To build a full stack application using MERN stack.
- To create a community-driven image sharing platform.
- To integrate Razorpay for a credit-based generation system.
Chapter 2: LITERATURE SURVEY
Various platforms and applications such as MidJourney, DALL·E, and Stable Diffusion
have proven the potential of AI in generating creative visual content. OpenAI's DALL·E
model uses deep learning and natural language processing to understand prompts and
create images from scratch.
In this project, the following technologies and systems were studied in detail:
- OpenAI API documentation and integration.
- MongoDB Atlas for scalable NoSQL database usage.
- RESTful API design with Express.js for secure communication.
- Frontend frameworks and routing with React.js.
- Razorpay payment gateway for real-time credit purchasing.
- Prompt-to-image response validation and image optimization.
[Leave space for sample API diagrams, UI flowcharts, and Razorpay integration demo]
Chapter 3: PRESENT WORK
This chapter details the actual implementation of the AI Image Generator and the
integrated Razorpay payment system. The core idea is to allow users to create AI-
generated images from text prompts, and manage their usage via a credit system.
[Leave space for UI wireframes, flowcharts, screenshots of Razorpay popup and credit
deduction logic]
Chapter 4: RESULT & DISCUSSION
The final deployed web application demonstrates the power of AI-based image
generation combined with a practical monetization model. Key results and observations:
Challenges faced:
- Handling API limits and billing
- Managing image cache, file formats, and compression
- Ensuring security in payment handling (used Razorpay webhooks with secret keys)
- UI responsiveness and loading states
[Leave space for screenshot comparisons, prompt vs generated image, Razorpay receipts]
Chapter 5: CONCLUSION & FUTURE SCOPE
Conclusion:
The Full Stack AI Image Generator is a successful implementation of AI-powered design,
bridging the gap between creativity and technology. It enables users to generate stunning
visuals with simple text, and supports a self-sustaining business model through credit
purchases.
Future Scope:
- Add user authentication (JWT + Google OAuth)
- Advanced generation settings (image size, styles)
- Implement Stable Diffusion / MidJourney APIs
- Integration of image upscaling or editing tools
- Leaderboard and reward system for most creative prompts
- Mobile app version with React Native
This application can serve as a base for startups and educational tools focusing on visual
creativity and generative AI.
Core Functionalities
1. User Authentication
- Secure sign-up and login system using email and password.
- Passwords are encrypted and stored using hashing algorithms.
- Logged-in users can track remaining credits, access their history, and manage
public/private generations.
- JWT (JSON Web Token) used for secure, token-based access control to protected
routes.
2. Prompt-to-Image Generation
- Users input creative prompts which are processed by the OpenAI DALL·E API.
- AI generates a unique image based on the description.
- Images are stored in MongoDB and displayed instantly in the UI.
3. Credit System
- Every image generation deducts 1 credit from the user’s account.
- Users receive initial free credits on signup.
- System prevents generation if credits are insufficient.
5. Community Feed
- Users can choose to share generated images publicly.
- All public images are shown in the community gallery.
- Promotes creativity and discovery among users.