0% found this document useful (0 votes)

6 views3 pages

Proj 6

Uploaded by

366 Ovais Tariq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views3 pages

Proj 6

Uploaded by

366 Ovais Tariq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

CSCI 4820/5820 (Barbosa)

Project 6: Prompt Engineering for GPT-2 Medium Model (345M) on an NLP Task

Due: See course calendar. May not be turned in late.

Assignment ID: proj6

File(s) to be submitted: proj6.ipynb, proj6.pdf, proj6doc.pdf, <x><y>_input.txt

Note: Your submitted code must run on the TAMU FASTER system – Ensure you test your code there before submission.

****** This project requires the Fall 2024 SIF on TAMU FASTER (see the video on D2L) ******

Objective(s): As demonstrated in class, the pre-trained 345 million parameter GPT-2 Medium model is the largest that will
fit on a GPU with 24 GB GPU. The 345M model was one of the intermediate sized models mentioned in the GPT-2 paper,
but was not the main model for which results were reported. In this project you will explore and report the capabilities of
this model on an NLP task you choose (and that follows the usage guidelines in the paper/literature).

Project description: The starter code and required modified files to download and use GPT-2 Medium were
demonstrated in class and have been tested on the TAMU FASTER system. Carefully follow notebook instructions.
Note: The T4 GPU on TAMU FASTER has insufficient memory for this task. Use one of the following GPUs: A10, A30,
A40, A100. Note that there are fewer of these GPUs, so plan accordingly.
Evaluate and modify aspects of this model to incorporate the following:
 Ensure you download the starter code and zip file into the same directory. On the first run, execute cells one
at a time and comment out those that need only run once.
 Recall that the GPT-2 model was trained on multiple tasks, without fine-tuning. Presenting prompts in the
proper format to the model can induce responses on NLP tasks for which it was not trained. In addition to
prompts used for the various tasks listed in the GPT-2 paper, a lot of work has been done by researchers and
users to extract valuable output from the model. To evaluate the model:
o Do some research and experimentation on prompt engineering to get answers to a non-trivial NLP
task which you will define.
o Explore with different tasks and prompts before choosing your task. Note: the exact format of many
“successful” prompts are not always clearly stated, and one’s interpretation of what was fed to the
model can result in poor performance at times. For example: is an input a single string with x
number of delimited components? OR is it a list or tuple composed of x strings? Ask yourself
questions: What’s the disconnect? Is there a similar prompt that works better? Is there a different
way of presenting the data that is optimal for the task? What examples are likely to result in good
model output.
o Clearly identify the sources from which you obtained ideas and submit at least 20 examples of
prompts for the task you settled on (do not submit prompts for multiple tasks).
 Modify the interact_model function in the notebook so that it is NOT keyboard-interactive.
o It should accept model prompts generated from a loop and return each output.
o The model prompts should come from an input file - Prompt the model with your samples in an
input file, so it runs without requiring user interaction at the keyboard.
o Name the file <initial of your first name><initial of your last name>_input.txt (example: sb_input.txt)
o The file’s format should include the complete input prompt (including any data), and the expected
output prompt. The prompt and input should be separated by a tab.
o Only input prompts should be sent to the interact_model function. The expected responses are used
for gauging the model’s performance by comparing them with the output.
o Submit an excerpt of at least 20 samples for your chosen task in a file.
o Vary arguments to the interact_model function (length, temperature, top_k) to improve performance.
 Trim the output: GPT is a generative model and will generate the requested number of tokens. The output
should be processed to remove extra components. For example:
o If a question is being asked of the model, and question/answer pairs are provided as examples, the
model may go beyond answering the question posed and provide its own question/answer pairs (in
addition to those asked, up to max tokens). These should be removed from the output.
o Anything after the model outputs <|endoftext|> is usually outside the scope of what is expected
from the prompt, and unrelated to the expected output. This should be removed from the output.
 Display the input prompt and model output (on separate lines) for the samples in the input file. Separate
the input/output pairs from one another with a blank line.
Requirements: Where two weights are shown for an item, the first is for CSCI 4820 and the second for CSCI 5820 credit.

1. (85%/70%) Your program must provide the functionality listed below, in a single file named proj6.ipynb:

 Modify the code per these requirements. Determine the format of the prompt to get the best performance from
the model on your task by consulting all resources at your disposal. Ensure you properly cite/attribute all
ideas/works you use in your submission.

The quality of your chosen NLP task, the prompts used, and the output obtained from the model matter. Do not
simply repeat examples covered in class (this will not earn significant credit).

2. (10%) Test your program – Test your code. Ensure your submitted model fully executes and produces results on the
TAMU system.

3. (5%) Code comments - Add the following comments to your code:

 Place a Markdown cell at the top of the source file(s) with the following identifying information:

Your Name
CSCI <course number>-<section number>
Project #X
Due: mm/dd/yy

4. (CSCI 5820 Students only 15%) Analysis – Provide an analysis (in a Markdown cell) addressing the following topics

a) Describe at least one NLP task you considered/attempted and the reason(s) why you abandoned making that your
main task (what you tried – specific prompts and input format, why you assumed it would work, and what steered
you away from the task).
b) Provide details the NLP task you settled on to demonstrate GPT-2 Medium capabilities, focusing on what the task
is, the prompt you decided on, and the quality of the output. Give some examples of real-world problems that
might use this task, and how your solution would fit into that architecture.
c) Describe the process by which you arrived at your final prompt, by comparing it to at least two other prompting
methods attempted that were not as successful.

5. Generate a pdf file of the notebook (from the terminal):

$ jupyter nbconvert --to html proj6.ipynb
$ wkhtmltopdf proj6.html proj6.pdf

6. Document all input and output from AI and other external tools and sources in the file proj6doc.pdf

7. Submit required files via the D2L dropbox for this project. Ensure your input file is submitted.
TAMU FASTER Troubleshooting

****** This project requires the Fall 2024 SIF on TAMU FASTER ******

If you have installed packages on TAMU FASTER prior to this assignment you may encounter some errors regarding missing
or incompatible packages. The notebook has been tested and works on the default container without problems.

To revert to default container, log on to TAMU and before starting a notebook instance, bring up a shell as shown below:

Once in the shell change to your home directory (using the cd command) and type the following command (note this
command will remove all of the contents of the .local directory, including previously installed packages):

rm –fr .local/*

Then instantiate a notebook following the video instructions on D2L.

After running any pip install command, an error may appear saying that one or more modules just installed were not
found. Simply restart the notebook and re-run it.

Ensure that you comment out any cells that perform a pip install or download after the first time the notebook runs, to
suppress unnecessary re-attempts and messages.

2024 New CMS Master Agreement
No ratings yet
2024 New CMS Master Agreement
3 pages
Dissertation Interior Design Examples
100% (2)
Dissertation Interior Design Examples
5 pages
t100 Manual
No ratings yet
t100 Manual
40 pages
1.3.2.1MoUs With Relevant Organizations For These Courses
No ratings yet
1.3.2.1MoUs With Relevant Organizations For These Courses
1,326 pages
Relay Setting
No ratings yet
Relay Setting
144 pages
Brochure VD4
No ratings yet
Brochure VD4
8 pages
Magnet G 857 Magnetometer Pocket Reference Guide
No ratings yet
Magnet G 857 Magnetometer Pocket Reference Guide
2 pages
Computer SSC CGL 2022 Tier II Paper I - RBE - Compressed
No ratings yet
Computer SSC CGL 2022 Tier II Paper I - RBE - Compressed
17 pages
Vault of Codes Assignment 2 - 30960551
No ratings yet
Vault of Codes Assignment 2 - 30960551
3 pages
Mod 1 Lesson 1 Ict and Its Current State
No ratings yet
Mod 1 Lesson 1 Ict and Its Current State
71 pages
Wellcomm User Guide
100% (1)
Wellcomm User Guide
23 pages
Computer Network MCQ
No ratings yet
Computer Network MCQ
42 pages
Scaler Masterclass - Notification Systems - HLD - Dec 10 2024
No ratings yet
Scaler Masterclass - Notification Systems - HLD - Dec 10 2024
10 pages
ED462790
No ratings yet
ED462790
641 pages
TS DRA 2022 en Create Drawings
No ratings yet
TS DRA 2022 en Create Drawings
1,070 pages
Computing
No ratings yet
Computing
23 pages
PC 22: Internal Control Evaluation Manual SL No. Points Key To Point
No ratings yet
PC 22: Internal Control Evaluation Manual SL No. Points Key To Point
9 pages
AI Class 9 Part A
No ratings yet
AI Class 9 Part A
26 pages
Summary of Agile and Scrum
No ratings yet
Summary of Agile and Scrum
3 pages
PYTHON Lab (21CSL46) Manual 4th Sem Final
No ratings yet
PYTHON Lab (21CSL46) Manual 4th Sem Final
69 pages
FM-2 Indexing Module Reference Manual
No ratings yet
FM-2 Indexing Module Reference Manual
290 pages
AIT TASKS2 Merged
No ratings yet
AIT TASKS2 Merged
24 pages
Machine Learning For Time Series Forecasting With Python 1st Edition Francesca Lazzeri
No ratings yet
Machine Learning For Time Series Forecasting With Python 1st Edition Francesca Lazzeri
48 pages
Automobile Gannt Chart
No ratings yet
Automobile Gannt Chart
6 pages
Response Document Doc1 V2
No ratings yet
Response Document Doc1 V2
25 pages
ML Lab 01 Manual - Intro To Python
No ratings yet
ML Lab 01 Manual - Intro To Python
9 pages
Online Freelancing in The Philippines
No ratings yet
Online Freelancing in The Philippines
22 pages
Knight Eod Robot
No ratings yet
Knight Eod Robot
11 pages
Response Document Doc5 V1
No ratings yet
Response Document Doc5 V1
23 pages
Lemur Astrologer Coding
No ratings yet
Lemur Astrologer Coding
28 pages
Project
No ratings yet
Project
4 pages
Payment Info Section Attachement
No ratings yet
Payment Info Section Attachement
7 pages
Draft National Telecom Policy 2011
No ratings yet
Draft National Telecom Policy 2011
27 pages
03 School Sports Draft Data Privacy Notice and Consent Form 3
No ratings yet
03 School Sports Draft Data Privacy Notice and Consent Form 3
3 pages
2080iq4 2
No ratings yet
2080iq4 2
2 pages
Seagate 1.5tb USB2.0 S$168 GSS: Asia Pte LTD Internet TV USB $39.90
No ratings yet
Seagate 1.5tb USB2.0 S$168 GSS: Asia Pte LTD Internet TV USB $39.90
4 pages
PTV-Vision VISWALK
No ratings yet
PTV-Vision VISWALK
4 pages
The Dragonflybsd Operating System: Jeffrey M. Hsu, Member, Freebsd and Dragonflybsd
No ratings yet
The Dragonflybsd Operating System: Jeffrey M. Hsu, Member, Freebsd and Dragonflybsd
6 pages
Anuhya Sandadi - HW2 - ITP 487 - Fall 2021
No ratings yet
Anuhya Sandadi - HW2 - ITP 487 - Fall 2021
3 pages
Answer Key
No ratings yet
Answer Key
2 pages
ViaLiteHD 1U Rack Chassis HRK1x DS 2
No ratings yet
ViaLiteHD 1U Rack Chassis HRK1x DS 2
2 pages
Limits of Sequences - Brilliant Math & Science Wiki
No ratings yet
Limits of Sequences - Brilliant Math & Science Wiki
9 pages
Prompt Engineering
No ratings yet
Prompt Engineering
3 pages
Prompt Engineering Guide by Examples
No ratings yet
Prompt Engineering Guide by Examples
14 pages
Hopper Coding RLHF - Instructions
No ratings yet
Hopper Coding RLHF - Instructions
17 pages
Instructions 22
No ratings yet
Instructions 22
28 pages
Lab Tasks
No ratings yet
Lab Tasks
10 pages
E02 Computer A Handout
No ratings yet
E02 Computer A Handout
19 pages
Lab Manual (AI)
100% (1)
Lab Manual (AI)
17 pages
Set 1
No ratings yet
Set 1
4 pages
TTLSyllabus
No ratings yet
TTLSyllabus
10 pages
Project Description 1
No ratings yet
Project Description 1
3 pages
gpt4-1 Prompting Guide - Ipynb
No ratings yet
gpt4-1 Prompting Guide - Ipynb
29 pages
Pgi20s02j - Lab Record
No ratings yet
Pgi20s02j - Lab Record
24 pages
Manual Ai Lab Final
No ratings yet
Manual Ai Lab Final
22 pages
Bulba Advanced Instructions
No ratings yet
Bulba Advanced Instructions
13 pages
Practical Fie AI Class 10
No ratings yet
Practical Fie AI Class 10
19 pages
Ai ANass1
No ratings yet
Ai ANass1
12 pages
Taask
No ratings yet
Taask
18 pages
Assignment3 - CSE4002
No ratings yet
Assignment3 - CSE4002
7 pages
EVALUATION - Coding Data Requirements
No ratings yet
EVALUATION - Coding Data Requirements
24 pages
ML Case Study
No ratings yet
ML Case Study
1 page
Mini ProjectGroup361 - AI
No ratings yet
Mini ProjectGroup361 - AI
1 page
Python Lab Manual Update
No ratings yet
Python Lab Manual Update
36 pages
TMLS20 Machine Learning Coursework-1
No ratings yet
TMLS20 Machine Learning Coursework-1
5 pages
Assignments & Projects of ChatGPT Advance Masterclass - Practice and Mastery
No ratings yet
Assignments & Projects of ChatGPT Advance Masterclass - Practice and Mastery
45 pages
Village Garden
No ratings yet
Village Garden
15 pages
Khushal Mishra AI File
No ratings yet
Khushal Mishra AI File
3 pages
Lab 01
No ratings yet
Lab 01
15 pages
AIML PGCP Project B21
No ratings yet
AIML PGCP Project B21
6 pages
CW Sequence Analysis
No ratings yet
CW Sequence Analysis
9 pages
Lab Lesson Plan AI
No ratings yet
Lab Lesson Plan AI
2 pages
Vijayi WFH Tech - Assignment - AI Internship - Jan 2025
No ratings yet
Vijayi WFH Tech - Assignment - AI Internship - Jan 2025
3 pages
Project
No ratings yet
Project
11 pages
P 2M: Generating Deployable Models From Natural Language Instructions
No ratings yet
P 2M: Generating Deployable Models From Natural Language Instructions
10 pages
Exercise 5
No ratings yet
Exercise 5
1 page
Ce473 Project - Fall 2024
No ratings yet
Ce473 Project - Fall 2024
8 pages
Lab 2 Tasks
No ratings yet
Lab 2 Tasks
2 pages
Aiml Lab
No ratings yet
Aiml Lab
3 pages
Attachment 1
No ratings yet
Attachment 1
2 pages
ChatGPT Data Science Prompts
No ratings yet
ChatGPT Data Science Prompts
12 pages
Course Project
No ratings yet
Course Project
3 pages
Code Explanation
No ratings yet
Code Explanation
8 pages
AI Eng Task
No ratings yet
AI Eng Task
1 page
PROMPTS
No ratings yet
PROMPTS
20 pages
Welcome To This Course On ChatGPT Video 2
No ratings yet
Welcome To This Course On ChatGPT Video 2
4 pages
Gen Ai Lab - DS
No ratings yet
Gen Ai Lab - DS
26 pages
NLP Assignment 2
No ratings yet
NLP Assignment 2
3 pages
RAI AI Engineer Intern Assignments
No ratings yet
RAI AI Engineer Intern Assignments
3 pages
Python - Genai - Intqa 2
No ratings yet
Python - Genai - Intqa 2
5 pages
cs229 hw4 2024spring
No ratings yet
cs229 hw4 2024spring
2 pages
AI Updated Format Without Adv
No ratings yet
AI Updated Format Without Adv
110 pages
Important Questions
No ratings yet
Important Questions
4 pages
Transform Raw Texts Into Training and Development Data: Instructor: Nikos Aletras
No ratings yet
Transform Raw Texts Into Training and Development Data: Instructor: Nikos Aletras
2 pages

Proj 6

Uploaded by

Proj 6

Uploaded by

CSCI 4820/5820 (Barbosa)

Due: See course calendar. May not be turned in late.

Assignment ID: proj6

3. (5%) Code comments - Add the following comments to your code:

5. Generate a pdf file of the notebook (from the terminal):

Then instantiate a notebook following the video instructions on D2L.

You might also like