0% found this document useful (0 votes)

4 views47 pages

AI2025 Lecture05 Inperson Slide

The document outlines a lecture on Logistic Regression as part of an AI System Semiconductor Design course. It covers the basics of binary classification, the learning process, the cost function, and optimization techniques such as gradient descent. Key concepts include the logistic regression model, loss functions, and the derivatives used for parameter updates.

Uploaded by

chiyeon0607

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views47 pages

AI2025 Lecture05 Inperson Slide

Uploaded by

chiyeon0607

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

1

AI System Semiconductor Design

Lecture5 (In-person): Logistic Regression
3/19/2025
Lecturer: Taewook Kang
Acknowledgments
Lecture material adapted from
Prof. Woowhan Jung, DSLAB, Hanyang Univ.
PyTorchZeroToAll, Prof. Sung Kim, HKUST
Dr. Hyungjoo Seo, Apple, CA, USA

SKKU Kang Research Group / SEE3007 Spring 2025 1 1

Class Schedule - Python Part
▪ Week2
▪ What is ML?
▪ Classification Vs. Regression
▪ Linear regression
▪ Week3
▪ Gradient descent for linear regression
▪ Logistic regression
▪ Week4
▪ Shallow Neural Networks
▪ Shallow Neural Networks – gradient descent
▪ Deep Neural Networks
▪ Stochastic Gradient Descent
▪ MNIST implementation

SKKU Kang Research Group / SEE3007 Spring 2025 2

REVIEW – LOGISTIC REGRESSION

SKKU Kang Research Group / SEE3007 Spring 2025 3

Binary Classification
𝒙 Model 𝑦ො

Output Label
Model =
0 Not a Cat
Architecture
1 Cat
+ Parameters
123
94
…
202
𝒙=
123
94
…
142
SKKU Kang Research Group / SEE3007 Spring 2025 4
Learning Process
Many vector/matrix
operations

Input Model Output Label

Architecture 𝑃 𝐶𝑎𝑡 𝑃 𝐶𝑎𝑡

+ Parameters = 0.9 =1

Compute loss
Update parameters
to minimize the loss

SKKU Kang Research Group / SEE3007 Spring 2025 5

Logistic Regression - Model

Model
Input Model Output Label

Architecture 𝑃 𝐶𝑎𝑡 𝑃 𝐶𝑎𝑡

+ Parameters = 0.9 =1

Compute loss
Update parameters
to minimize the loss

SKKU Kang Research Group / SEE3007 Spring 2025 6

Logistic Regression
▪ A simple model for binary classification
▪ Maybe one of the simplest neural networks
▪ A training example 𝒙, 𝑦
▪ Input: 𝒙 ∈ ℝ𝑛
▪ Output: 𝑦 ∈ 0,1
▪ m training examples (or Mtrain)
▪ 𝒙 𝟏 ,𝑦 1 , 𝒙 2 ,𝑦 2 ,… 𝒙 𝑚 ,𝑦 𝑚

SKKU Kang Research Group / SEE3007 Spring 2025 7

Review: Linear Regression
▪ Input: 𝒙 ∈ ℝ𝑛
▪ Goal: 𝑦ො ≈ 𝑦 Model = Architecture +
Parameters

𝑦ො = 𝒘⊤ 𝒙 + 𝑏

Parameters: 𝒘 ∈ ℝ𝑛 , 𝑏 ∈ ℝ

SKKU Kang Research Group / SEE3007 Spring 2025 8

Logistic Regression
▪ Input: 𝒙 ∈ ℝ𝑛
▪ Goal: 𝑦ො = 𝑃 𝑦 = 1 𝒙 Model = Architecture +
Parameters

𝑦ො = 𝜎 𝒘⊤ 𝒙 + 𝑏

Parameters: 𝒘 ∈ ℝ𝑛 , 𝑏 ∈ ℝ

1
Sigmoid 𝜎 𝑥 =
1+𝑒 −𝑥
𝜎 −∞ = 0
𝜎 +∞ = 1

SKKU Kang Research Group / SEE3007 Spring 2025 9

10
Logistic Regression – Cost Function & Loss

Input Model Output Label

Architecture 𝑃 𝐶𝑎𝑡 𝑃 𝐶𝑎𝑡

+ Parameters = 0.9 =1

Compute loss
Update parameters
to minimize the loss Cost function & Loss
SKKU Kang Research Group / SEE3007 Spring 2025 10
Logistic Regression: Cost Function
▪ Given 𝒙 𝟏 , 𝑦 1 , … 𝒙 𝑚 , 𝑦 𝑚 , want 𝑦ො (𝑖) = 𝑃(𝑦 𝑖 = 1) Roughly speaking , 𝑦ො (𝑖) ≈ 𝑦 𝑖

▪ Loss function: Binary Cross Entropy

𝐿 𝑦,
ො 𝑦 = −𝑦 log 𝑦ො − 1 − 𝑦 log 1 − 𝑦ො
If 𝑦 = 1: 𝐿 𝑦,
ො 𝑦 = − log 𝑦ො
If 𝑦 = 0: 𝐿 𝑦,
ො 𝑦 = − log 1 − 𝑦ො
− log 𝑦ො
▪ Cost function:
𝑚
1
𝐽 𝒘, 𝑏 = ෍ 𝐿 𝑦ො (𝑖) , 𝑦 (𝑖)
𝑚
𝑖=1
Logistic regression model

𝑦ො = 𝜎 𝒘⊤ 𝒙 + 𝑏
SKKU Kang Research Group / SEE3007 Spring 2025 11
12
Logistic Regression – Optimization

Input Model Output Label

Architecture 𝑃 𝐶𝑎𝑡 𝑃 𝐶𝑎𝑡

+ Parameters = 0.9 =1

Compute loss
Update parameters
to minimize the loss
Optimization
SKKU Kang Research Group / SEE3007 Spring 2025 12
Optimization
▪ Logistic regression model
1
▪ 𝑦ො = 𝜎 𝒘⊺𝒙 + 𝑏 , where 𝜎 𝑧 = 1+𝑒−𝑧
▪ Cost function
𝑚 𝑚
1 (𝑖) (𝑖)
1
𝐽 𝒘, 𝑏 = ෍ 𝐿 𝑦ො , 𝑦 = − ෍ 𝑦 (𝑖) log 𝑦ො (𝑖) + 1 − 𝑦 (𝑖) log 1 − 𝑦ො (𝑖)
𝑚 𝑚
𝑖=1 𝑖=1

▪ Our goal
▪ Find parameters 𝒘 ∈ ℝ𝑛 , 𝑏 ∈ ℝ that minimize 𝐽 𝒘, 𝑏
▪ Gradient Descent!

SKKU Kang Research Group / SEE3007 Spring 2025 13

Review - Gradient Descent
▪ Algorithm to minimize a cost function 𝐽 𝜃 θ = {𝒘, 𝑏}
▪ 𝐽 𝜃 : cost function
▪ 𝜃: model parameters
▪ 𝜂: Learning rate

Repeatedly update

𝜃 ∗ = 𝜃- 𝜂 ⋅ 𝛻𝜃 𝐽(𝜃)
𝜃∗
By Hakky St
SKKU Kang Research Group / SEE3007 Spring 2025 14
Computing the Parameters with Gradient Decent
𝑦ො = 𝜎 𝒘⊤ 𝒙 + 𝑏
∗
𝑚 𝜃 = 𝜃- 𝜂 ⋅ 𝛻𝜃 𝐽(𝜃)
1
𝐽 𝒘, 𝑏 = ෍ 𝐿 𝑦ො (𝑖) , 𝑦 (𝑖)
𝑚
𝑖=1

𝜕𝐽 𝜃
𝜕𝑤1
𝑤1 𝜕𝐽 𝜃
𝑤2 𝜕𝑤2
𝜃= … 𝛻𝜃 𝐽 𝜃 = …
𝑤𝑛 𝜕𝐽 𝜃
𝑏 𝜕𝑤𝑛
𝜕𝐽 𝜃
𝜕𝑏

SKKU Kang Research Group / SEE3007 Spring 2025 15

GRADIENT DESCENT
: LOGISTIC REGRESSION

SKKU Kang Research Group / SEE3007 Spring 2025 16

Background: Derivatives

𝑑 𝑑 1
1. log e 𝑥 = 𝑙𝑛 𝑥 =
𝑑𝑥 𝑑𝑥 𝑥

𝑑
2. 𝜎 𝑥 = 𝜎 𝑥 (1 − 𝜎 𝑥 )
𝑑𝑥

1 𝑑 −1
𝜎 𝑧 =
1 + 𝑒 −𝑧
𝜎 𝑥 = −𝑒 −𝑧 = 𝜎 𝑥 (1 − 𝜎 𝑥 )
𝑑𝑥 1+𝑒 −𝑧 2

SKKU Kang Research Group / SEE3007 Spring 2025 17

Logistic Regression Recap
1
▪ 𝑦ො = 𝜎 𝒘 𝒙 + 𝑏 where 𝜎 𝑧 = 1+𝑒 −𝑧
⊺

▪ 𝐿 𝑦,
ො 𝑦 = −𝑦 log 𝑦ො − 1 − 𝑦 log 1 − 𝑦ො
For the simplicity
𝑥1 𝑤1
𝒙= 𝑥 ,𝒘= 𝑤
2 2
𝑥1
𝑥2
𝑤1 𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝑦ො = 𝑎 = 𝜎 𝑧 𝐿 𝑎, 𝑦
𝑤2
𝑏
𝑦
Target to learn

SKKU Kang Research Group / SEE3007 Spring 2025 18

Logistic Regression Derivative
𝐿 𝑎, 𝑦 = −𝑦 log 𝑎 − 1 − 𝑦 log 1 − 𝑎

𝑦
𝑥1
𝑥2
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝑦ො = 𝑎 = 𝜎 𝑧 𝐿 𝑎, 𝑦
𝑤1
𝑑𝑧 𝑑𝑎 𝑑𝐿
𝑤2 𝑑𝑤𝑘 𝑑𝑧 𝑑𝑎

𝑏 𝑑𝐿 𝑑𝐿 𝑑𝑎 𝑑𝑧 𝑦 1−𝑦
= = − + 𝜎 𝑧 (1 − 𝜎 𝑧 )𝑥𝑘
𝑑𝑤𝑘 𝑑𝑎 𝑑𝑧 𝑑𝑤𝑘 𝑎 1−𝑎

SKKU Kang Research Group / SEE3007 Spring 2025 19

Logistic Regression Derivative
𝐿 𝑎, 𝑦 = −𝑦 log 𝑎 − 1 − 𝑦 log 1 − 𝑎

𝑦
𝑥1
𝑥2
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝑦ො = 𝑎 = 𝜎 𝑧 𝐿 𝑎, 𝑦
𝑤1
𝑤2
𝑏 𝑑𝐿 𝑑𝐿 𝑑𝑎 𝑑𝑧 𝑦 1−𝑦
= = − + 𝜎 𝑧 (1 − 𝜎 𝑧 )𝑥𝑘
𝑑𝑤𝑘 𝑑𝑎 𝑑𝑧 𝑑𝑤𝑘 𝑎 1−𝑎
𝑦 1−𝑦
= − + 𝑎(1 − 𝑎)𝑥𝑘 = −𝑦 1 − 𝑎 𝑥𝑘 + 1 − 𝑦 𝑎𝑥𝑘
𝑎 1−𝑎

= 𝑎 − 𝑦 𝑥𝑘
SKKU Kang Research Group / SEE3007 Spring 2025 20
Logistic Regression Derivative – Calculate on your own
𝑑𝐿
=
𝑑𝑤1

𝑑𝐿
=
𝑑𝑤2

𝑑𝐿
=
𝑑𝑏

SKKU Kang Research Group / SEE3007 Spring 2025 21

Logistic Regression Derivative
𝐿 𝑎, 𝑦 = −𝑦 log 𝑎 − 1 − 𝑦 log 1 − 𝑎

𝑦
𝑥1
𝑥2
𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝑦ො = 𝑎 = 𝜎 𝑧 𝐿 𝑎, 𝑦
𝑤1
𝑤2
𝑏 𝑑𝐿 𝑑𝐿 𝑑𝑎 𝑑𝑧 𝑑𝐿
= = 𝑎 − 𝑦 𝑥1 𝑤1∗ ≔ 𝑤1 − 𝜂
𝑑𝑤1 𝑑𝑎 𝑑𝑧 𝑑𝑤1 𝑑𝑤1
𝑑𝐿 𝑑𝐿 𝑑𝑎 𝑑𝑧 𝑑𝐿
= = 𝑎 − 𝑦 𝑥2 𝑤2∗ ≔ 𝑤2 − 𝜂
𝑑𝑤2 𝑑𝑎 𝑑𝑧 𝑑𝑤2 𝑑𝑤2

𝑑𝐿 𝑑𝐿 𝑑𝑎 𝑑𝑧 𝑑𝐿
= =𝑎−𝑦 𝑏∗ ≔𝑏−𝜂
𝑑𝑏 𝑑𝑎 𝑑𝑧 𝑑𝑏 𝑑𝑏
SKKU Kang Research Group / SEE3007 Spring 2025 22
Gradient descent on m examples
𝑚
1
𝐽 𝒘, 𝑏 = ෍ 𝐿 𝑦ො (𝑖) , 𝑦 (𝑖)
𝑚
𝑖=1

𝑚 𝑚
𝑑 1 𝑑 (𝑖) (𝑖)
1
𝐽 𝒘, 𝑏 = ෍ 𝐿 𝑦ො , 𝑦 = ෍ 𝑦ො (𝑖) − 𝑦 𝑥𝑘
𝑑𝑤𝑘 𝑚 𝑑𝑤𝑘 𝑚
𝑖=1 𝑖=1

𝑚 𝑚
𝑑 1 𝑑 (𝑖) (𝑖)
1
𝐽 𝒘, 𝑏 = ෍ 𝐿 𝑦ො , 𝑦 = ෍ 𝑦ො (𝑖) − 𝑦
𝑑𝑏 𝑚 𝑑𝑏 𝑚
𝑖=1 𝑖=1

SKKU Kang Research Group / SEE3007 Spring 2025 23

Logistic Regression on #m Examples
▪ Initialize 𝒘, 𝑏
▪ lr = 0.1
▪ For e=1 to 𝑛𝑒𝑝𝑜𝑐ℎ :
▪ J = 0; d_w1 = 0; d_w2 = 0; d_b=0
▪ For i=1 to m:
▪ 𝑧 = 𝑤1𝑥1(𝑖) + 𝑤2𝑥2(𝑖) + 𝑏 𝑚 𝑚
𝑑 1 𝑑 1
▪𝑎 =𝜎 𝑧 𝑑𝑤2
𝐽 𝒘, 𝑏 = ෍
𝑚 𝑑𝑤2
(𝑖)
𝐿 𝑎 ,𝑦 (𝑖) = ෍ 𝑎(𝑖) − 𝑦 (𝑖) 𝑥2
𝑚
𝑖=1 𝑖=1
▪ d_w1+= 𝑎 − 𝑦 𝑥1(𝑖)
𝑚 𝑚
𝑑 1 𝑑 1
▪ d_w2+= 𝑎 − 𝑦 (𝑖)
𝑥2 𝐽 𝒘, 𝑏 = ෍ (𝑖)
𝐿 𝑎 ,𝑦 (𝑖) = ෍ 𝑎(𝑖) − 𝑦 (𝑖) 𝑥2
𝑑𝑤2 𝑚 𝑑𝑤2 𝑚
▪ d_b+= 𝑎 − 𝑦 𝑖=1 𝑖=1

▪ 𝑤1 −= 𝑙𝑟 ∗ 𝑑_𝑤1/m 𝑑 1 𝑑
𝑚
1
𝑚
(𝑖) (𝑖) = ෍ 𝑎(𝑖) − 𝑦 (𝑖)
▪ 𝑤2 −= 𝑙𝑟 ∗ 𝑑_𝑤2/m 𝑑𝑏
𝐽 𝒘, 𝑏 = ෍ 𝐿 𝑎 , 𝑦
𝑚 𝑑𝑏 𝑚
𝑖=1 𝑖=1
▪ 𝑏 −= 𝑙𝑟 ∗ 𝑑_𝑏/m

SKKU Kang Research Group / SEE3007 Spring 2025 24

Terminology: Epoch, batch size, iteration
Batch size (=3) Whole dataset (e.g., m=12)

1 Epoch

▪ If the training ends up using a whole dataset, it corresponds to 1 epoch training

▪ Due to the memory limit, we split #m data to smaller size (batch size)
▪ Running 1 iteration over batch_size data
▪ For 1 epoch training, #m/batch_size of iteration required
▪ E.g., to do 1 epoch training, 12/3 = 4 Iteration loop should be done
▪ After every one iteration training with #batch_size data, the parameter of model is
updated

SKKU Kang Research Group / SEE3007 Spring 2025 https://m.blog.naver.com/qbxlvnf11/22144929703 25

Linear regression vs Logistic regression
𝑦 Linear
𝑥1 Regression
𝑥2 𝑦ො = 𝑎 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝐿 𝑎, 𝑦
𝑤1 2
𝐿 𝑦,
ො 𝑦 = 𝑦ො − 𝑦
𝑤2
𝜕𝐿 𝜕𝐿 𝜕𝑎 𝜕𝐿 𝜕𝐿 𝜕𝑎
𝑏 = = 2 𝑎 − 𝑦 𝑥𝑘 = =2 𝑎−𝑦
𝜕𝑤𝑘 𝜕𝑎 𝜕𝑤𝑘 𝜕𝑏 𝜕𝑎 𝜕𝑏 For the simplicity
𝑥1 𝑤1
𝒙= 𝑥 ,𝒘= 𝑤
2 2
𝑥1
𝑥2 𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑏 𝑦ො = 𝑎 = 𝜎 𝑧 𝐿 𝑎, 𝑦
𝑤1
𝑤2 𝐿 𝑦,
ො 𝑦 = −𝑦 log 𝑦ො − 1 − 𝑦 log 1 − 𝑦ො
𝑦
𝑏
𝜕𝐿 𝜕𝐿 𝜕𝑎 𝜕𝑧 𝜕𝐿 𝜕𝐿 𝜕𝑎 𝜕𝑧 Logistic
= = 𝑎 − 𝑦 𝑥𝑘 = = 𝑎−𝑦
𝜕𝑤𝑘 𝜕𝑎 𝜕𝑧 𝜕𝑤𝑘 𝜕𝑏 𝜕𝑎 𝜕𝑧 𝜕𝑏 Regression
SKKU Kang Research Group / SEE3007 Spring 2025 26
ASSIGNMENT 3 REVIEW

SKKU Kang Research Group / SEE3007 Spring 2025 27

28
Lecture3 Assignment
▪ Implement linear regression with the given dataset below (x_data, y_data). Copy these 5 lines in top
of your code.

m_train=1000
Model definition
w=2 𝑦ො = 𝑤 ∗ 𝑥 + 𝑏
b=3 𝑚_𝑡𝑟𝑎𝑖𝑛
x_data=10*np.array(range(m_train))/m_train 1 2
y_data=x_data*w+np.random.randn(m_train)+b 𝑙𝑜𝑠𝑠 = 𝑀𝑆𝐸 = ෍ 𝑦ො𝑛 − 𝑦𝑛
𝑚_𝑡𝑟𝑎𝑖𝑛
𝑛=1

▪ Report guidelines (doesn’t have to be long. make it easy to read and short)
▪ Plot training data
▪ Describe the data shape in a few sentences. E.g., how does it look like?
▪ Guess your fitting results without coding and explain your reasoning for the guess
▪ Attach your full code
▪ Show the training result → w, b, cost (J)
▪ How many times did you repeat training(number of epochs) to get a good result?
▪ Submit to iCampus
▪ Due: 3/19, 8:59AM

SKKU Kang Research Group / SEE3007 Spring 2025 28

Lecture3 Assignment
Answer
import numpy as np
import matplotlib.pyplot as plt
▪ Training data is in linear
m_tra in=100 0
w=2
shape with added noise
b=3
x_d ata =10*np.array(ran ge(m_tr ain))/m_train
y_d ata =x_d ata *w+b+np.ran dom.r andn(m_tra in)

def forward_cal(w, b, x) :
▪ The fitting model
return x*w+b
def loss_cal(w, b, x, y, y_pred) :
return (y_pred-y)**2
supposed to be
▪ w=2, b=3
def w_g radient(w, b, x, y, y_pred) :
return 2*x*(y_pred-y)
def b_grad ient(w, b, x, y, y_pred) :
return 2*(y_pr ed-y)

w=0
b=0
lr=0.02
Nepoch=400
▪ You may observe the
# Training loop
for epo ch in ran ge(Nepoch):
# initialize
training diverges if you
d_w=0
d_b=0
loss=0
give too large learning
for x_val, y_val in zip(x_data, y_d ata ):
y_p red = forward_ca l(w, b, x_val)
d_w += w_gr adient(w, b, x_val, y_val, y_pred )
rate.
d_b += b _gr adient(w, b, x_val, y_val, y_pred )
loss += loss_cal(w, b, x_val, y_val, y_pred )
loss = lo ss/m_train
w = w - lr * d_w/m_train
b = b - lr * d_b/m_train
if epo ch%100==0:
prin t("e poch=% d, pre viou s_loss=%f, w'=% f, b'=%f" %( epoch, loss, w, b))

plt.figu re()
plt.plot(x_data, y_data, '*', labe l='Training data')
plt.plot(x_data, w*x_da ta+b, labe l='Trained model')
plt.xlab el('Input x')
plt.ylab el('Output y')
plt.legend( )
plt.sho w()

SKKU Kang Research Group / SEE3007 Spring 2025 29

Large Learning Rate can be Unstable
▪ With a too-large learning rate, the parameter can
move back and forth
▪ You can see w trace is oscillating without bound
▪ We can qualitatively explain by the below graph

SKKU Kang Research Group / SEE3007 Spring 2025 30

PYTHON PRACTICE

SKKU Kang Research Group / SEE3007 Spring 2025 31

Move on to Jupyter Notebook
▪ Let’s use Jupyter Notebook for the class for easier use
▪ Also, it gives us better interactive environment
▪ PyCharm has powerful debugging ability, but in our class scope, we may not need
that complexity.
▪ Install Jupyter

▪ Launch Jupyter

SKKU Kang Research Group / SEE3007 Spring 2025 32

Jupyter Notebook – Create a New Code

Select “Notebook”

Select “Python 3”

SKKU Kang Research Group / SEE3007 Spring 2025 33

Jupyter Notebook Tip - Documentation

“Shift+Enter”
at each
Markdown line

▪ Change the line type from as “Markdown”

▪ Shift+Enter at each line makes it a text
▪ Put “#” in your comment
▪ It will help your documenting code
▪ If you put “##”, it is a sub-hierarchy of # line
SKKU Kang Research Group / SEE3007 Spring 2025 34
Logistic regression: logical AND

x1 x2 y

0 0 0

0 1 0

1 0 0

1 1 1

𝑦 = 𝑥1 𝐴𝑁𝐷 𝑥2

SKKU Kang Research Group / SEE3007 Spring 2025 35

Data Preparation

▪ You may wonder why we only have 4 training samples.

▪ The number of samples won’t matter for logistic regression. We can still train our model
with only 4 samples.
▪ But we will explore real-looking data in next week, too.

SKKU Kang Research Group / SEE3007 Spring 2025 36

Data Plot
▪ Boolean value of training data is represented with
color
▪ If the label coding part is difficult, you can also use the
below version without the label.

SKKU Kang Research Group / SEE3007 Spring 2025 37

Model

SKKU Kang Research Group / SEE3007 Spring 2025 38

If you haven’t heard of Python “class”
▪ “class” will make your coding a lot easier → Object Oriented Programming
▪ This is a must!
▪ Every programmer uses “class”
▪ Korean tutorial
▪ https://wikidocs.net/16071
▪ English tutorial
▪ https://www.hackerearth.com/practice/python/object-oriented-programming/classes-and-objects-
i/tutorial/#:~:text=A%20class%20is%20a%20code,the%20instance%20of%20the%20class.

SKKU Kang Research Group / SEE3007 Spring 2025 39

Training

model = logistic_regression_model() #This will initialize “model”

Nepoch=1000
Cost_list=[]
for epoch in range(Nepoch):
cost = ~~
cost_list.append(cost)
SKKU Kang Research Group / SEE3007 Spring 2025 40
Testing
▪ Result analysis
▪ model.predict((0, 0)) → 7.1e-6
▪ It means when you give (0, 0), the result is 7.1e-6
▪ Hence, our trained model tells us the output is 0.
▪ model.predict((1, 1)) → 0.98
▪ Our model predicts the output as 0.98, which means 1
▪ The model predicts output well.

SKKU Kang Research Group / SEE3007 Spring 2025 41

LOGISTIC REGRESSION
ASSIGNMENT

SKKU Kang Research Group / SEE3007 Spring 2025 42

Logistic Regression: Boolean Operators
▪ Repeat the practice for OR and XOR operators, too.
▪ Requirements
▪ AND, OR, XOR
▪ Use a dataset for each operator →
▪ The model may not work for some operators
▪ We will cover the details in the lecture 6
▪ Use numpy arrays
▪ Initialization with lists: x, y
▪ Random initialization: w, b
▪ Use numpy operator
▪ Inner product
▪ Addition

SKKU Kang Research Group / SEE3007 Spring 2025 43

Lecture 5 Assignment
▪ Logistic regression on Boolean data cost
▪ Report required contents
▪ Add source code for model & training
▪ For each operator (AND, OR, XOR):
▪ Cost plot for epoch epoch
▪ Use different learning rates (at least 3 learning rates)
▪ Show predicted results for whole input combinations→ e.g.,

▪ Explain whether the logistic regression model works well for AND, OR, XOR data
▪ For one operator, the logistic regression won’t work. Which one?
▪ Due: 2024/3/27 (Thu) 11:59 PM (after in-person lecture07)
▪ So, you have two more classes (including today) to ask questions to finish.
▪ Submit to iCampus

SKKU Kang Research Group / SEE3007 Spring 2025 44

NumPy usages

Import numpy

Initialization with lists

Inner product

Addition

Scalar multiplication

Radom initialization with

𝑁(0,1)

SKKU Kang Research Group / SEE3007 Spring 2025 45

Preview

SKKU Kang Research Group / SEE3007 Spring 2025 46

Limitations of logistic regression
▪ Logistic regression does not work well with the XOR dataset
▪ Think about why. How do we classify the data? (Hint: we are using a straight line so far)

▪ We need a more complicated model!

▪ Neural Network

SKKU Kang Research Group / SEE3007 Spring 2025 47

Thingworx Devops
No ratings yet
Thingworx Devops
120 pages
AI2025 Lecture04 Recording Slide
No ratings yet
AI2025 Lecture04 Recording Slide
42 pages
AI2025 Lecture02 Recording Slides
No ratings yet
AI2025 Lecture02 Recording Slides
52 pages
AI2025 Lecture07 Inperson Slide
No ratings yet
AI2025 Lecture07 Inperson Slide
21 pages
Module 04 - Extra Class: Logistic Regression
No ratings yet
Module 04 - Extra Class: Logistic Regression
41 pages
AI2025 Lecture09 Inperson Slide
No ratings yet
AI2025 Lecture09 Inperson Slide
25 pages
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
No ratings yet
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
25 pages
practicalMachineLearning Lecture3
No ratings yet
practicalMachineLearning Lecture3
25 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
ML Lab 08 Manual - Logisitic Regression (Ver7)
No ratings yet
ML Lab 08 Manual - Logisitic Regression (Ver7)
9 pages
W2 Ann
No ratings yet
W2 Ann
12 pages
B24 ML Exp-1
No ratings yet
B24 ML Exp-1
10 pages
Logistic Regression - Update - 2
No ratings yet
Logistic Regression - Update - 2
60 pages
01B DL2023 LinearModels
No ratings yet
01B DL2023 LinearModels
47 pages
ML Lab 06 Manual - Linear Regression 1 (Version 6)
No ratings yet
ML Lab 06 Manual - Linear Regression 1 (Version 6)
8 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Chapter 4 Assignment
No ratings yet
Chapter 4 Assignment
5 pages
Lab 04 - Logisitic Regression
No ratings yet
Lab 04 - Logisitic Regression
5 pages
DDA3020 Lecture 06 Logistic Regression
No ratings yet
DDA3020 Lecture 06 Logistic Regression
47 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
Lecture 2
No ratings yet
Lecture 2
80 pages
AI2025 Lecture10 Recording Slide
No ratings yet
AI2025 Lecture10 Recording Slide
46 pages
LogisticRegression ExercisesSolutions
No ratings yet
LogisticRegression ExercisesSolutions
5 pages
Lecture3 Upload
No ratings yet
Lecture3 Upload
28 pages
CS229 Lecture 2 PDF
100% (1)
CS229 Lecture 2 PDF
48 pages
Exp3 ML
No ratings yet
Exp3 ML
4 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
ML Logistic Regression
No ratings yet
ML Logistic Regression
19 pages
Logistic Regression
No ratings yet
Logistic Regression
51 pages
Log Reg Skimed - Ipynb - Colab
No ratings yet
Log Reg Skimed - Ipynb - Colab
10 pages
AI2025 Lecture06 Recording Slide
No ratings yet
AI2025 Lecture06 Recording Slide
38 pages
CH 4
No ratings yet
CH 4
41 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
New Data Science Module Logistic Regression
No ratings yet
New Data Science Module Logistic Regression
49 pages
LR2
No ratings yet
LR2
25 pages
Artificial Neural Networks and Deep Learning
No ratings yet
Artificial Neural Networks and Deep Learning
55 pages
Week 2
No ratings yet
Week 2
43 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Logistic Regression (Probability Concepts) and Perceptron
No ratings yet
Logistic Regression (Probability Concepts) and Perceptron
20 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Lecture04. Training Models (Regression in Chapter 4)
No ratings yet
Lecture04. Training Models (Regression in Chapter 4)
44 pages
Index: Name - JINESH PRAJAPAT Class - B. Tech, III Year Branch - AI & DS Sem - V
No ratings yet
Index: Name - JINESH PRAJAPAT Class - B. Tech, III Year Branch - AI & DS Sem - V
35 pages
HW1 Final
No ratings yet
HW1 Final
4 pages
Lecture 3 - Linear Regression
No ratings yet
Lecture 3 - Linear Regression
31 pages
AC-ED L04 - Logistic Regression, Regularization
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
80 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
Lecture20 Backprop
No ratings yet
Lecture20 Backprop
77 pages
Lecture3 Logistic Regression Regularization
No ratings yet
Lecture3 Logistic Regression Regularization
39 pages
HW 3
No ratings yet
HW 3
4 pages
Basic ML Algorithm
No ratings yet
Basic ML Algorithm
74 pages
ML Labs
No ratings yet
ML Labs
46 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Lec 03
No ratings yet
Lec 03
42 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
DeepLearning Introduction
No ratings yet
DeepLearning Introduction
14 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
47 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
Machine Learning Notes Cs229 1
No ratings yet
Machine Learning Notes Cs229 1
217 pages
CS229 Lecture Notes
No ratings yet
CS229 Lecture Notes
142 pages
Learning Oracle 12c: A PL/SQL Approach
From Everand
Learning Oracle 12c: A PL/SQL Approach
Prof. Sham Tickoo
No ratings yet
AutoCAD Electrical 2020 for Electrical Control Designers, 11th Edition
From Everand
AutoCAD Electrical 2020 for Electrical Control Designers, 11th Edition
Prof. Sham Tickoo
No ratings yet
(2주차 1차시 학습자료) 1.집적회로이론및설계 2강
No ratings yet
(2주차 1차시 학습자료) 1.집적회로이론및설계 2강
16 pages
(1주차 3차시 학습자료) 1.집적회로이론및설계 1강
No ratings yet
(1주차 3차시 학습자료) 1.집적회로이론및설계 1강
15 pages
AI2025 Lecture08 Recording Slide
No ratings yet
AI2025 Lecture08 Recording Slide
38 pages
(2주차 2차시 학습자료) 1.집적회로이론및설계 2강
No ratings yet
(2주차 2차시 학습자료) 1.집적회로이론및설계 2강
16 pages
Digital Integrated Circuit 01 Welcome&Intro
No ratings yet
Digital Integrated Circuit 01 Welcome&Intro
38 pages
Digital Integrated Circuit 02 Manufacturing Process and Layout
No ratings yet
Digital Integrated Circuit 02 Manufacturing Process and Layout
43 pages
Digital Integrated Circuit 03 Wires
No ratings yet
Digital Integrated Circuit 03 Wires
24 pages
(10주차 - 3차시 - 학습자료) 3.반도체 장비와 설비
No ratings yet
(10주차 - 3차시 - 학습자료) 3.반도체 장비와 설비
17 pages
2nd Year Cirriculum
No ratings yet
2nd Year Cirriculum
11 pages
Direct Use of Line Equation in Computer Graphics
No ratings yet
Direct Use of Line Equation in Computer Graphics
6 pages
1 Intro To Computers and Logic
No ratings yet
1 Intro To Computers and Logic
8 pages
Unit 3 Event and GUI Programming (NEP)
No ratings yet
Unit 3 Event and GUI Programming (NEP)
35 pages
Assinment 3
No ratings yet
Assinment 3
3 pages
Exp 4
No ratings yet
Exp 4
2 pages
Untitled
No ratings yet
Untitled
6 pages
Assignment 1: Cs 2255 - Computer Science Ii
No ratings yet
Assignment 1: Cs 2255 - Computer Science Ii
3 pages
OS Project Ideas For Beginners To Advanced 1
No ratings yet
OS Project Ideas For Beginners To Advanced 1
10 pages
LA3-Accept String and Display Its Length
No ratings yet
LA3-Accept String and Display Its Length
6 pages
Data Entry
No ratings yet
Data Entry
4 pages
CS-108 Lecture 0
No ratings yet
CS-108 Lecture 0
187 pages
Unit 5 Material
No ratings yet
Unit 5 Material
12 pages
Q1 LE TLE-7 Lesson-1 Week-1
No ratings yet
Q1 LE TLE-7 Lesson-1 Week-1
16 pages
AJP Practicals Output
No ratings yet
AJP Practicals Output
95 pages
Javascript Lab
No ratings yet
Javascript Lab
17 pages
Design of A Python-Subset Compiler in Rust Targeting ZPAQL
No ratings yet
Design of A Python-Subset Compiler in Rust Targeting ZPAQL
57 pages
DHTML Programs Examples With Output PDF
No ratings yet
DHTML Programs Examples With Output PDF
9 pages
CodeBase User Guide
No ratings yet
CodeBase User Guide
134 pages
(R18A0504) Operating Systems
No ratings yet
(R18A0504) Operating Systems
60 pages
Matlab Study Guide
No ratings yet
Matlab Study Guide
85 pages
Chapter 5: Threads
No ratings yet
Chapter 5: Threads
10 pages
Java Collections Framework Java Collections Framework: CS211 Fall 2000
No ratings yet
Java Collections Framework Java Collections Framework: CS211 Fall 2000
2 pages
Insy2840 CH01
No ratings yet
Insy2840 CH01
2 pages
GC-Students - 10 - AI - Record File 2024-25
No ratings yet
GC-Students - 10 - AI - Record File 2024-25
15 pages
Oop 3
No ratings yet
Oop 3
25 pages
HDL Survival Guide
No ratings yet
HDL Survival Guide
4 pages
FSD Module-05
No ratings yet
FSD Module-05
38 pages
FIT3176 W4 Lab 03 Activity Sheet 3 MongoDB Text Search
No ratings yet
FIT3176 W4 Lab 03 Activity Sheet 3 MongoDB Text Search
3 pages

AI2025 Lecture05 Inperson Slide

Uploaded by

AI2025 Lecture05 Inperson Slide

Uploaded by

1

AI System Semiconductor Design

SKKU Kang Research Group / SEE3007 Spring 2025 1 1

SKKU Kang Research Group / SEE3007 Spring 2025 2

SKKU Kang Research Group / SEE3007 Spring 2025 3

Input Model Output Label

Architecture 𝑃 𝐶𝑎𝑡 𝑃 𝐶𝑎𝑡

SKKU Kang Research Group / SEE3007 Spring 2025 5

Architecture 𝑃 𝐶𝑎𝑡 𝑃 𝐶𝑎𝑡

SKKU Kang Research Group / SEE3007 Spring 2025 6

SKKU Kang Research Group / SEE3007 Spring 2025 7

SKKU Kang Research Group / SEE3007 Spring 2025 8

SKKU Kang Research Group / SEE3007 Spring 2025 9

Input Model Output Label

Architecture 𝑃 𝐶𝑎𝑡 𝑃 𝐶𝑎𝑡

▪ Loss function: Binary Cross Entropy

Input Model Output Label

Architecture 𝑃 𝐶𝑎𝑡 𝑃 𝐶𝑎𝑡

SKKU Kang Research Group / SEE3007 Spring 2025 13

SKKU Kang Research Group / SEE3007 Spring 2025 15

SKKU Kang Research Group / SEE3007 Spring 2025 16

SKKU Kang Research Group / SEE3007 Spring 2025 17

SKKU Kang Research Group / SEE3007 Spring 2025 18

SKKU Kang Research Group / SEE3007 Spring 2025 19

SKKU Kang Research Group / SEE3007 Spring 2025 21

SKKU Kang Research Group / SEE3007 Spring 2025 23

SKKU Kang Research Group / SEE3007 Spring 2025 24

▪ If the training ends up using a whole dataset, it corresponds to 1 epoch training

SKKU Kang Research Group / SEE3007 Spring 2025 https://m.blog.naver.com/qbxlvnf11/22144929703 25

SKKU Kang Research Group / SEE3007 Spring 2025 27

SKKU Kang Research Group / SEE3007 Spring 2025 28

SKKU Kang Research Group / SEE3007 Spring 2025 29

SKKU Kang Research Group / SEE3007 Spring 2025 30

SKKU Kang Research Group / SEE3007 Spring 2025 31

SKKU Kang Research Group / SEE3007 Spring 2025 32

SKKU Kang Research Group / SEE3007 Spring 2025 33

▪ Change the line type from as “Markdown”

SKKU Kang Research Group / SEE3007 Spring 2025 35

▪ You may wonder why we only have 4 training samples.

SKKU Kang Research Group / SEE3007 Spring 2025 36

SKKU Kang Research Group / SEE3007 Spring 2025 37

SKKU Kang Research Group / SEE3007 Spring 2025 38

SKKU Kang Research Group / SEE3007 Spring 2025 39

model = logistic_regression_model() #This will initialize “model”

SKKU Kang Research Group / SEE3007 Spring 2025 41

SKKU Kang Research Group / SEE3007 Spring 2025 42

SKKU Kang Research Group / SEE3007 Spring 2025 43

SKKU Kang Research Group / SEE3007 Spring 2025 44

Initialization with lists

Radom initialization with

SKKU Kang Research Group / SEE3007 Spring 2025 45

SKKU Kang Research Group / SEE3007 Spring 2025 46

▪ We need a more complicated model!

SKKU Kang Research Group / SEE3007 Spring 2025 47

You might also like