0% found this document useful (0 votes)

4 views19 pages

E02 Computer A Handout

The document outlines a computer assignment for the CS-C1000 course focused on text generation using the GPT-2 model in JupyterHub. It includes instructions for generating text, completing tasks, and answering quiz questions by a specified deadline. Additionally, it provides a high-level overview of the GPT-2 architecture and its functionalities.

Uploaded by

hamidtemu3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views19 pages

E02 Computer A Handout

Uploaded by

hamidtemu3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

CS-C1000 – Introduction to Artificial Intelligence

Computer Assignment A

Mohammad Vali

March 11, 2025

Computer Assignment A: Text Generation
▶ Today’s task: Generate text with GPT-2 model in JupyterHub.
▶ Optional Read: Alex Hern, “New AI fake text generator may be too
dangerous to release, say creators”. The Guardian, February 14, 2018.
▶ Accessible via the link: https://www.theguardian.com/technology/2019/feb/14/
elon-musk-backed-ai-writes-convincing-news-fiction

CS-C1000 · Mohammad Vali · 2/19

GPT-2: High-level architecture

GPT-2

Embeddings

Prediction
Layer 1

Layer 2

Layer 3

Layer 4
"The quick brown fox" "jumps"

Figure inspiration from https://huggingface.co/docs/transformers/en/llm_tutorial.

Text example: https://en.wikipedia.org/wiki/The_quick_brown_fox_jumps_over_the_lazy_dog.

CS-C1000 · Mohammad Vali · 5/19

GPT-2: Text embeddings

"The quick brown fox"

"The" 464 [-0.036, 0.016, ..., -0.079]

Embeddings

Layer 1
"quick" 2068 [ 0.007, -0.082, ..., -0.158]
"brown" 7586 [-0.114, -0.019, ..., -0.042]
"fox" 21831 [ 0.043, -0.181, ..., -0.162]

1. Convert words into (unique) index from vocabulary.

2. Convert word index into (unique) word embedding.
3. Pass word embeddings to Layer 1.

CS-C1000 · Mohammad Vali · 6/19

GPT-2: Next word prediction
output word
probabilities

p("is") = 0.007
pick next word
p("the") = 0.000 based on its

Prediction
...

Layer 4
probability
"The quick brown fox" ... p("jumps") = 0.792 "jumps"
p("jumped") = 0.111
...
p("<eos>") = 0.000
extend sentence
with "jumps"

1. Outputs probabilities for every word in the vocabulary.

2. Pick (sample) the next word based on the probabilities.
3. Extend sentence with the new word and generate the next.
CS-C1000 · Mohammad Vali · 7/19
GPT-2: Top-5 word probabilities
"The quick brown fox" "The quick brown fox jumps" " "

GPT-2 GPT-2 GPT-2

p("jumps") = 0.792 p("over") = 0.923 p("The") = 0.257

p("jumped") = 0.111 p("the") = 0.058 p("A") = 0.094
p("to") = 0.013 p("out") = 0.011 p("This") = 0.062
p("es") = 0.008 p("on") = 0.002 p("In") = 0.053
p("has") = 0.007 p(",") = 0.002 p("I") = 0.040
.. .. ..
. . .

CS-C1000 · Mohammad Vali · 8/19

GPT-2: High-level summary GPT-2

Embeddings

Prediction
Layer 1

Layer 2

Layer 3

Layer 4
"The quick
"jumps"
brown fox"
▶ Training by predicting next word in text,
similar to generation.
▶ Transformer layers are the complex
part1,2,3 .
▶ Combine relationships in text sequence to
each word (Attention).
▶ Refine word representations (MLP).

▶ More parameters and more training Source: https://jalammar.github.io/illustrated-gpt2/

typically leads to better generation. Models GPT-2 S GPT-2 M GPT-2 GPT-3 GPT-4
#params 124M 355M 1.5B 175B ?
Source: https://en.wikipedia.org/wiki/Large_language_model

1
Blog post: https://jalammar.github.io/illustrated-gpt2/
2
”An Introduction to Transformers”, Richard E. Turner (2023). https://arxiv.org/abs/2304.10557
3 CS-C1000 · Mohammad Vali · 9/19
Youtube: https://youtu.be/wjZofJX0v4M?si=zjk7DnuEs_egGhWr
Computer Assignment A

▶ Run code in JupyterHub to generate text with GPT-2 medium (355M params).
▶ Only need a web browser. No coding required.

▶ Answer quiz questions on MyCourses to receive points.

▶ Deadline for quiz: Tuesday March 18 at 14:00.

CS-C1000 · Mohammad Vali · 10/19

Log into JupyterHub

Using your Aalto account.

CS-C1000 · Mohammad Vali · 11/19

Choose the course

CS-C1000 · Mohammad Vali · 12/19

JupyterHub: Find Assignment List

Click on Nbgrader > Assignment List in top panel.

CS-C1000 · Mohammad Vali · 13/19

Fetch the notebook (A this time)

Click on ’fetch’.
This will create a folder /introtoai2025/ which holds the notebook.

CS-C1000 · Mohammad Vali · 14/19

Start the notebook (A this time)

Click on Computer-Assignment-A to open the notebook.

Notebook can also be found under folder /notebooks/introtoai2025.

CS-C1000 · Mohammad Vali · 15/19

Ready to run (one cell at a time)
▶ 1) Select cell, 2) press Shift+Enter on keyboard, or ’Run’ on panel

CS-C1000 · Mohammad Vali · 16/19

If problems running the notebooks
▶ Try restarting the Kernel in JupyterHub.

▶ Kernel → Restart

CS-C1000 · Mohammad Vali · 17/19

Tasks (read the notebook for exact instructions)

▶ Task 1: Generate some unconditional fake text.

▶ Task 2: Auto-complete some text.

▶ Task 3: Generate multiple novel text samples.

▶ Task 4: Generate samples of varying length.

▶ Task 5: Modify the model parameters.

▶ Task 6: Compare GPT-2 Small (124M params, 500MB) vs GPT-2 Medium

(355M params, 1.5GB).

▶ Task 7: Play around with the model.

CS-C1000 · Mohammad Vali · 18/19

How to get points?

▶ Run the coding notebook on JupyterHub.

▶ Answer the questions in MyCourses under Exercise 2 (quiz).

▶ Deadline: Submit quiz answers before Tuesday March 18 at 14:00.

CS-C1000 · Mohammad Vali · 19/19

AI API Course
No ratings yet
AI API Course
85 pages
RLDL128
No ratings yet
RLDL128
73 pages
CS585 Lecture October15th
No ratings yet
CS585 Lecture October15th
162 pages
Blockchain Hacking Preview
100% (1)
Blockchain Hacking Preview
37 pages
AIND-Capstone - Machine - Translation - Ipynb at Master Tommytracey - AIND-Capstone
No ratings yet
AIND-Capstone - Machine - Translation - Ipynb at Master Tommytracey - AIND-Capstone
26 pages
1.machine Learning and Its Applications
No ratings yet
1.machine Learning and Its Applications
75 pages
No - Ntnu Inspera 187579291 24496466
No ratings yet
No - Ntnu Inspera 187579291 24496466
92 pages
Learning
No ratings yet
Learning
63 pages
Hi Everyone So by Now You Have Probtranscript
No ratings yet
Hi Everyone So by Now You Have Probtranscript
31 pages
Generative AI Primer
No ratings yet
Generative AI Primer
27 pages
DeepLearning Practical File K - Nishant
No ratings yet
DeepLearning Practical File K - Nishant
38 pages
HuggingFace GPT2
No ratings yet
HuggingFace GPT2
43 pages
Automated Image Captioning With Convnets and Recurrent Nets: Andrej Karpathy, Fei-Fei Li
No ratings yet
Automated Image Captioning With Convnets and Recurrent Nets: Andrej Karpathy, Fei-Fei Li
105 pages
Import Gensim
No ratings yet
Import Gensim
8 pages
GPT in 60 Lines of NumPy - Jay Mody
No ratings yet
GPT in 60 Lines of NumPy - Jay Mody
41 pages
GRP 17
No ratings yet
GRP 17
16 pages
Zy 174360787988339
No ratings yet
Zy 174360787988339
8 pages
Full Introduction About Xilinx FPGA and Its Architecture
No ratings yet
Full Introduction About Xilinx FPGA and Its Architecture
19 pages
Generative AI 2
No ratings yet
Generative AI 2
24 pages
3 - Deep Learning
No ratings yet
3 - Deep Learning
33 pages
Installation: Order No.: Customer: Equipment: Converter Type: Document: 3BHS213774E01 ACS 1000 W
No ratings yet
Installation: Order No.: Customer: Equipment: Converter Type: Document: 3BHS213774E01 ACS 1000 W
73 pages
Summer Course Material
No ratings yet
Summer Course Material
52 pages
CS480 Lecture November 28th
No ratings yet
CS480 Lecture November 28th
96 pages
2 Ai
No ratings yet
2 Ai
21 pages
1 Introduction
No ratings yet
1 Introduction
31 pages
Gravimetic Feeders
100% (1)
Gravimetic Feeders
26 pages
5th Unit
No ratings yet
5th Unit
36 pages
4.1 Guest Lecture - Intro To AI - Melissa Van Schaik
No ratings yet
4.1 Guest Lecture - Intro To AI - Melissa Van Schaik
38 pages
Make The Future Ai and Provide Code Also
No ratings yet
Make The Future Ai and Provide Code Also
21 pages
Verification and Validation Norvig 2016
No ratings yet
Verification and Validation Norvig 2016
83 pages
Pgi20s02j - Lab Record
No ratings yet
Pgi20s02j - Lab Record
24 pages
AIML LAB Week9 2
No ratings yet
AIML LAB Week9 2
3 pages
TONEX Pedal User Manual
No ratings yet
TONEX Pedal User Manual
67 pages
Orca: Progressive Learning From Complex Explanation Traces of GPT-4
No ratings yet
Orca: Progressive Learning From Complex Explanation Traces of GPT-4
24 pages
Huawei SUN2000 30KTL-A - 33KTL - 40KTL User Manual (Issue04 - 2016!06!20)
No ratings yet
Huawei SUN2000 30KTL-A - 33KTL - 40KTL User Manual (Issue04 - 2016!06!20)
108 pages
Awwa C 510
No ratings yet
Awwa C 510
18 pages
Language Model Evaluation in Open-Ended Text Gener
No ratings yet
Language Model Evaluation in Open-Ended Text Gener
70 pages
Super Quick - In-Context Learning With Personal Data Using LLAMA 2.0 On CPU - by Ashhadul Islam - Aug, 2023 - Python in Plain English
No ratings yet
Super Quick - In-Context Learning With Personal Data Using LLAMA 2.0 On CPU - by Ashhadul Islam - Aug, 2023 - Python in Plain English
20 pages
2-Alarm Check Valve Viking Manual........
No ratings yet
2-Alarm Check Valve Viking Manual........
23 pages
Artificial Intelligence - Assignment 3
No ratings yet
Artificial Intelligence - Assignment 3
11 pages
3.7.1 Copies of Colabarations For 2021 22 Part 3
No ratings yet
3.7.1 Copies of Colabarations For 2021 22 Part 3
240 pages
Neural Approaches To Conversational AI
No ratings yet
Neural Approaches To Conversational AI
95 pages
Ba K 0106 1 en
No ratings yet
Ba K 0106 1 en
20 pages
Text Generation
No ratings yet
Text Generation
4 pages
LLaMA Ankit - Rawat
No ratings yet
LLaMA Ankit - Rawat
52 pages
Gen Ai Lab - DS
No ratings yet
Gen Ai Lab - DS
26 pages
Generative AI
No ratings yet
Generative AI
16 pages
Phone
0% (1)
Phone
4 pages
NLP Lab1
No ratings yet
NLP Lab1
6 pages
Specifiying Technology Readiness Levels For The Chemical Industry 2019 Buchner
100% (1)
Specifiying Technology Readiness Levels For The Chemical Industry 2019 Buchner
13 pages
Trí Tuệ Nhân Tạo - Search - Project
No ratings yet
Trí Tuệ Nhân Tạo - Search - Project
34 pages
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
No ratings yet
REPORT-MTechPESJul23BGrp2-3 (22-02-25)
15 pages
PilotstarD AP02-S01 Mar09
No ratings yet
PilotstarD AP02-S01 Mar09
168 pages
Exercise 5
No ratings yet
Exercise 5
1 page
Python Chatbot Project
No ratings yet
Python Chatbot Project
10 pages
Hello 1
No ratings yet
Hello 1
1 page
Pranshi Singla IX C AI Activity 1
No ratings yet
Pranshi Singla IX C AI Activity 1
24 pages
Lab 11 - Neural Networks
No ratings yet
Lab 11 - Neural Networks
2 pages
Chat Bot
No ratings yet
Chat Bot
10 pages
Augmenting LLMs Survey
No ratings yet
Augmenting LLMs Survey
33 pages
Natural Language Processing GPT-2
No ratings yet
Natural Language Processing GPT-2
5 pages
Bay Learn 2015 Deep Mind
No ratings yet
Bay Learn 2015 Deep Mind
69 pages
How To Make Custom AI-Generated Text With GPT-2
No ratings yet
How To Make Custom AI-Generated Text With GPT-2
3 pages
Fundamentals of Generative AI
No ratings yet
Fundamentals of Generative AI
17 pages
1-ICT Topic 3
100% (1)
1-ICT Topic 3
6 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
The Beginners Guide To Concrete Maturity Ebook
No ratings yet
The Beginners Guide To Concrete Maturity Ebook
32 pages
PHD Thesis Media Communication
100% (3)
PHD Thesis Media Communication
4 pages
Code Explanation
No ratings yet
Code Explanation
8 pages
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
100% (1)
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
21 pages
Lab1 Installation
No ratings yet
Lab1 Installation
8 pages
Proj 6
No ratings yet
Proj 6
3 pages
Telangana State - State Eligibility Test 2023 Hall Ticket - 620822
No ratings yet
Telangana State - State Eligibility Test 2023 Hall Ticket - 620822
1 page
Thesis On Color Image Segmentation
100% (2)
Thesis On Color Image Segmentation
5 pages
NLP Assignment 2
No ratings yet
NLP Assignment 2
3 pages
2023 Toyota Crown
No ratings yet
2023 Toyota Crown
9 pages
Flow Over Weirs Apparatus: Model FM 02
No ratings yet
Flow Over Weirs Apparatus: Model FM 02
22 pages
cs229 hw4 2024spring
No ratings yet
cs229 hw4 2024spring
2 pages
A List of All My Torrents
No ratings yet
A List of All My Torrents
3 pages
So3 b1 Unit Test U8a PDF
No ratings yet
So3 b1 Unit Test U8a PDF
5 pages
Transform Raw Texts Into Training and Development Data: Instructor: Nikos Aletras
No ratings yet
Transform Raw Texts Into Training and Development Data: Instructor: Nikos Aletras
2 pages
Advanced ATM Crime Prevention System by Using Wireless Communication
No ratings yet
Advanced ATM Crime Prevention System by Using Wireless Communication
6 pages
DLL - Mapeh 4 - Q3 - W9
No ratings yet
DLL - Mapeh 4 - Q3 - W9
4 pages
Case Study Instructions
No ratings yet
Case Study Instructions
8 pages
Chapter Two and Exception Handling
No ratings yet
Chapter Two and Exception Handling
6 pages
Dasar Mesin Elektrik G-M Saja
No ratings yet
Dasar Mesin Elektrik G-M Saja
45 pages
RL Quadcopter Movement Control Using Image Processing Techniques
No ratings yet
RL Quadcopter Movement Control Using Image Processing Techniques
4 pages
LV Circuit Breaker Calculator Guide (Level 2) European Arc Guide EAG
No ratings yet
LV Circuit Breaker Calculator Guide (Level 2) European Arc Guide EAG
5 pages
CASE 1 - Global Marketing
No ratings yet
CASE 1 - Global Marketing
1 page