Presentify Automated Presentation Slide Generation
Presentify Automated Presentation Slide Generation
1
IOE, Thapathali Campus
ABSTRACT This study presents a novel approach for automating the generation of presentation slides
from academic research papers. We leverage Natural Language Processing (NLP) techniques, particularly
a fine-tuned T5 transformer model, to streamline knowledge dissemination within the computer science
domain. The model, trained on a custom dataset of research articles, extracts key information and
transforms it into well-structured presentation slides. The T5 model’s proficiency in text-to-text transfer
tasks is enhanced through fine-tuning with domain-specific metrics. This approach enables the automatic
conversion of research articles into concise and informative presentation slides. This research has the
potential to significantly impact how technical information is summarized and communicated within the
research community, fostering greater efficiency and clarity.
INDEX TERMS NLP, T5 model, Gemini model ,Slide generation, Text extraction and Section extraction
T5 transformer model for content summarization and a summaries for each section. The T5 model has been
dedicated section extraction model (e.g., Gemini model) specifically adjusted to optimize the summarization pro-
to preserve the structure and key points of the original re- cess for research articles.
search paper.
4 Slide Generation
III METHODOLOGY The summarized content for each section is then passed
A DATASET DESCRIPTION to the presentation module, which formats the information
This research leverages a domain-specific dataset sourced into a standardized presentation slide format. This ensures
exclusively from arXiv.org, a reputable online repository the slides have a clear and coherent structure, with each
for computer science research articles. This targeted se- slide covering a specific section of the research paper.
lection ensures the dataset’s relevance to the research fo- 5 Customization Options
cus and provides a well-defined corpus for model training The system also provide options for users to customize the
and evaluation. appearance and layout of the generated slides, allowing
The dataset consists of 15,655 entries meticulously di- them to tailor the presentation to their specific needs.
vided into training (10,958), testing (3,131), and valida-
tion sets (1,566) to facilitate robust model performance as- C SYSTEM REQUIREMENTS
sessment. Notably, ”Computer Vision and Pattern Recog- Following are the basic requirements for the system:
nition” emerges as the most prevalent category, with 675 1 Hardware Requirements
instances. • Multi-core CPU (Intel Xeon or AMD Ryzen)
Each data point comprises two key features: text and sum-
mary. The ”text” column houses the extracted content • RAM (16GB or more)
from research articles, potentially including sections like
introductions, literature reviews, methodologies, results, • System Type – 64-bit Operating System
and conclusions. The corresponding ”summary” column
contains a concise representation of the extracted text, of- 2 Software Requirements
fering a succinct version of the original content. This • Operating System – Windows
dataset serves as the foundation for training and evalu-
ating the employed NLP models. Ultimately, it facilitates • RAM Programming Language – Python
the optimization of the system’s ability to generate infor- • Python Packages - Numpy, Pandas, BeautifulSoup,
mative presentation slides tailored for academic research PyMuPDF
papers within the computer science domain.
B PROPOSED METHODOLOGY Python
This research proposes a novel methodology for automat- Python is a high level, general purpose, interpreted, dy-
ing the generation of presentation slides from academic namic programming language. Python supports multiple
research papers in the computer science domain. The programming paradigms, including object-oriented, im-
methodology utilizes Natural Language Processing (NLP) perative and functional programming or procedural styles.
techniques to achieve information extraction, summariza- NumPy
tion, and presentation structure creation. NumPy is a Python library for numerical computing, of-
fering powerful data structures and tools for working with
1 Text Extraction arrays, matrices, and mathematical functions. Its efficient
The system first extracts the textual content from the PDF operations make it essential for scientific computing, data
file using PyMuPDF, a powerful Python library for work- analysis, and machine learning in Python.
ing with PDFs. This involves parsing the PDF and ex- Pandas
tracting the text, including sections, paragraphs, and other A Python library specializing in data manipulation and
relevant information. analysis, providing high performance, easy-to-use data
2 Section Extraction with Gemini structures and tools for working with structured data like
tables and time series.
The extracted text is then passed to Gemini, a tool capa-
BeautifulSoup
ble of identifying and extracting distinct sections from the
Beautiful Soup is a Python package for parsing HTML
research paper, such as Introduction, Literature Review,
and XML documents (including having malformed
Methodology, Results, and Conclusions.
markup, i.e., non-closed tags, so named after tag soup).
3 Summarization with Fine-Tuned T5 Model It creates a parse tree for parsed pages that can be used to
The sections extracted by Gemini are then fed into a fine- extract data from HTML, which is useful for web scrap-
tuned T5 model, which transforms the text into concise ing. Beautiful Soup
PythonPPTX IV RESULTS
PythonPPTX is a Python library used for creating and A User Interface
manipulating Microsoft PowerPoint (.pptx) files program-
matically. With this library, you can automate the pro-
cess of generating presentations by adding slides, insert-
ing text, images, shapes, tables, and charts, and customiz-
ing their properties such as formatting, styles, and layouts.
It enables tasks like batch processing multiple presenta-
tions, dynamically generating slides based on data, and
applying design templates for consistent styling. Over-
all, PythonPPTX streamlines the creation and modifica-
tion of PowerPoint files, making it a valuable tool for re-
port generation, data visualization, and automated presen-
tation generation.
Figure 5: Slide 1
Figure 6: Slide 2
Figure 7: Slide 3
V CONCLUSION
This research presented a novel tool that utilizes Natural
Language Processing (NLP) techniques to automate the
generation of presentation slides from academic research
papers in the computer science domain. The system
leverages a fine-tuned T5 model for content
summarization and the Gemini model for section
extraction, enabling the extraction of key information
from research articles. This information is then
transformed into well-structured and visually appealing
presentation slides. The system architecture incorporates
Figure 9: slide5 functionalities for data preparation, model training, text
extraction, summarization, and slide generation. The user
interface allows for user interaction through features like
PDF upload, URL input, slide preview, and theme
selection.
The system’s performance was evaluated using loss
curves and ROUGE scores, demonstrating the
effectiveness of the T5 model in summarizing research
articles. This evaluation ensures the accuracy and
conciseness of the generated summaries. Additionally,
the user interface offers a user-friendly experience for
researchers, educators, and professionals, empowering
them to create informative presentations with greater
efficiency.
Figure 10: slide6 A ADVANTAGE
• Demonstrates precision in extracting relevant
information from documents.
B DISADVANTAGE
• Projects success depends on Gemini’s accuracy.