AI Phase2

This document outlines the process of creating a chatbot using Python, focusing on key components such as tokenization with the GPT-2 tokenizer, data preprocessing, and the creation of a custom PyTorch dataset. It details the training arguments necessary for fine-tuning two separate GPT-2 models and employing an ensemble method to enhance performance. The conclusion emphasizes the improvements achieved through fine-tuning and the use of a custom dataset for effective model training.

Uploaded by

msthygarajan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views9 pages

AI Phase2

Uploaded by

msthygarajan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

CREATE A CHATBOT

USING PYTHON

PRESENTED BY
SIVASUBRAMANIAN TJ
1. GPT-2 TOKENIZER
2. DATA PREPROCESSING
3. CUSTOM PYTORCH DATASET
4. TRAINING ARGUMENTS
5. FINE-TUNING & ENSEMBLE METHOD
6. CONCLUSION
GPT-2 TOKENIZER
● Tokenization is a crucial step in natural language processing
(NLP) that involves breaking down text into smaller units, or
tokens, for further analysis and processing. In the provided
code, we tokenize text using the GPT-2 tokenizer.
DATA PREPROCESSING
● This part of the code is responsible for reading and processing
conversational data from a dataset file, splitting it into
individual messages, and tokenizing the messages using the
GPT-2 tokenizer.
CUSTOM PYTORCH DATASET
• By converting our data into a PyTorch dataset, we make it
compatible with PyTorch's data loaders, making it easier to
iterate through, shuffle, and batch our data for training our
GPT-2 models. The dataset is to be used in the training process,
ensuring that our data is in a suitable format for model training.
TRAINING ARGUMENTS

These training arguments define the training environment and

conditions, such as the number of epochs, batch size, checkpoint saving
frequency, and random seed. This information is crucial for configuring
how the GPT-2 models are fine-tuned and how training progress is
monitored and recorded.
FINE-TUNING & ENSEMBLE METHOD

We train two separate GPT-2 models (model1 and model2) with

the same dataset and training configuration. The goal is to have
two fine-tuned models that can be used in ensemble methods
to generate text-based responses
• Ensemble methods are a powerful technique in machine
learning that involve combining the predictions of multiple
models to improve overall performance and robustness.

• In this code, we utilized an ensemble method to leverage the

strengths of two separately fine-tuned GPT-2 language models
(model1 and model2) for generating text-based responses.
CONCLUSION

• We have taken a pre-trained gpt-2 model and using

ensemble technique we improved the models performance
by combining the predictions of two models and
manipulating the training parameters.

• We fine-tuned the model with the provided dataset and we

used custom PyTorch dataset to make the dataset
compatible for Pytorch data loaders and make it suitable for
model training.

GPT 2 - Learninhg 4
0% (1)
GPT 2 - Learninhg 4
2 pages
Building Transformer Models With Attention Crash Course Build A Neural Machine Translator in 12 Days
No ratings yet
Building Transformer Models With Attention Crash Course Build A Neural Machine Translator in 12 Days
33 pages
Python Chat Bot Project
100% (1)
Python Chat Bot Project
6 pages
Langchain Onepager
No ratings yet
Langchain Onepager
1 page
Python Chatbot Project
No ratings yet
Python Chatbot Project
10 pages
Python Chatbot Project: January 2022
No ratings yet
Python Chatbot Project: January 2022
6 pages
Tensor Flow Chat Bot
No ratings yet
Tensor Flow Chat Bot
44 pages
Language Model Evaluation in Open-Ended Text Gener
No ratings yet
Language Model Evaluation in Open-Ended Text Gener
70 pages
ChatBot With GANs
No ratings yet
ChatBot With GANs
61 pages
Slides - ChatGPT - Jousef Murad
No ratings yet
Slides - ChatGPT - Jousef Murad
33 pages
GPT in 60 Lines of NumPy - Jay Mody
No ratings yet
GPT in 60 Lines of NumPy - Jay Mody
41 pages
Britto
No ratings yet
Britto
16 pages
Retorno 1
No ratings yet
Retorno 1
29 pages
Fine-Tuned Vs RAG Short Notes ?
No ratings yet
Fine-Tuned Vs RAG Short Notes ?
25 pages
Chatbot: Abhishek Verma (00414902018) Archit Kr. Singh (01414902018) Jatin Bagga (03814902018)
No ratings yet
Chatbot: Abhishek Verma (00414902018) Archit Kr. Singh (01414902018) Jatin Bagga (03814902018)
29 pages
2020-Anki P. Et Al.-Intelligent Chatbot Adapted From Question and Answer System Using RNN-LSTM Model
No ratings yet
2020-Anki P. Et Al.-Intelligent Chatbot Adapted From Question and Answer System Using RNN-LSTM Model
12 pages
Britto 1 15 2 15 - Merged
No ratings yet
Britto 1 15 2 15 - Merged
18 pages
Let's Build Our Own GPT Model From Scratch With PyTorch - by Shubh Mishra - Nov, 2024 - Level Up Coding
No ratings yet
Let's Build Our Own GPT Model From Scratch With PyTorch - by Shubh Mishra - Nov, 2024 - Level Up Coding
27 pages
Chatterbot
No ratings yet
Chatterbot
12 pages
Deep Learning Project
No ratings yet
Deep Learning Project
21 pages
Pgi20s02j - Lab Record
No ratings yet
Pgi20s02j - Lab Record
24 pages
Lesson 04 Fine-Tuning ChatGPT
No ratings yet
Lesson 04 Fine-Tuning ChatGPT
41 pages
Chat Bot
No ratings yet
Chat Bot
10 pages
NLU Final
No ratings yet
NLU Final
23 pages
ChatGPT Teardown
No ratings yet
ChatGPT Teardown
9 pages
Fine-Tuning Llama 2 On A Custom Dataset
No ratings yet
Fine-Tuning Llama 2 On A Custom Dataset
22 pages
FineTune OPUS MT Engine
No ratings yet
FineTune OPUS MT Engine
9 pages
Seminar
No ratings yet
Seminar
27 pages
Hand On Day 2 Salinan - Dari - 2 - Using - Transformers
No ratings yet
Hand On Day 2 Salinan - Dari - 2 - Using - Transformers
10 pages
Parameter Efficient Fine
No ratings yet
Parameter Efficient Fine
14 pages
Cse425 Assignement - 20101257
No ratings yet
Cse425 Assignement - 20101257
12 pages
Code Explanation
No ratings yet
Code Explanation
8 pages
Language Translation With NN - Transformer and Torchtext - PyTorch Tutorials 2.3.0+cu121 Documentation
No ratings yet
Language Translation With NN - Transformer and Torchtext - PyTorch Tutorials 2.3.0+cu121 Documentation
8 pages
Shreyank
No ratings yet
Shreyank
6 pages
Bulba Advanced Instructions
No ratings yet
Bulba Advanced Instructions
13 pages
4 - Instruction Finetune LLM
No ratings yet
4 - Instruction Finetune LLM
5 pages
DLT Experiment 2
No ratings yet
DLT Experiment 2
7 pages
P3R1 Text Classification
No ratings yet
P3R1 Text Classification
4 pages
Chatbots
No ratings yet
Chatbots
15 pages
Natural Language Processing With Pytorch Readthedocs Io en Latest PDF
No ratings yet
Natural Language Processing With Pytorch Readthedocs Io en Latest PDF
35 pages
Gen AI Notes Paer 2
No ratings yet
Gen AI Notes Paer 2
14 pages
Ai Phase 3 Project
No ratings yet
Ai Phase 3 Project
18 pages
Improving Language Understanding by Generative Pre-Training - by Ceshine Lee - Veritable - Medium
No ratings yet
Improving Language Understanding by Generative Pre-Training - by Ceshine Lee - Veritable - Medium
19 pages
Medical Text Classifier GabrieldeOlaguibel
No ratings yet
Medical Text Classifier GabrieldeOlaguibel
12 pages
Experiment 10 NLP
No ratings yet
Experiment 10 NLP
5 pages
Python Chatbot Project
No ratings yet
Python Chatbot Project
6 pages
Natural Language Processing GPT-2
No ratings yet
Natural Language Processing GPT-2
5 pages
Bert T
No ratings yet
Bert T
2 pages
Chatbot Phase3
No ratings yet
Chatbot Phase3
7 pages
Practical 1
No ratings yet
Practical 1
2 pages
Natural Language Understanding in Chatbots
No ratings yet
Natural Language Understanding in Chatbots
4 pages
Project Report
No ratings yet
Project Report
4 pages
Create Your Custom ChatGPT With Transfer Learning
No ratings yet
Create Your Custom ChatGPT With Transfer Learning
10 pages
Python Chatbot Project
No ratings yet
Python Chatbot Project
6 pages
AI Phase 4
No ratings yet
AI Phase 4
9 pages
Chatbot
No ratings yet
Chatbot
3 pages
Course Project Report For: Artificial Intelligence EL-3011
No ratings yet
Course Project Report For: Artificial Intelligence EL-3011
8 pages
Sentiment Analysis On Tweets
No ratings yet
Sentiment Analysis On Tweets
2 pages
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
From Everand
Unveiling the Secrets of ChatGPT Inside the Mind of an AI
Nelson Ambrose
No ratings yet
How to use ChatGPT
From Everand
How to use ChatGPT
Bernhard Gaum
No ratings yet

AI Phase2

Uploaded by

AI Phase2

Uploaded by

CREATE A CHATBOT

These training arguments define the training environment and

We train two separate GPT-2 models (model1 and model2) with

• In this code, we utilized an ensemble method to leverage the

• We have taken a pre-trained gpt-2 model and using

• We fine-tuned the model with the provided dataset and we

You might also like