Assignment 1

This document provides two problems related to text classification and predicting movie ratings from reviews. The first problem involves classifying four sample movie reviews as positive or negative using a linear classifier with word count features and the hinge loss. The second problem predicts numeric movie ratings in the range of 0 to 1 using a logistic regression model with a non-linear predictor and squared loss.

Uploaded by

nandhinigirijavks

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

159 views

Assignment 1

Uploaded by

nandhinigirijavks

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Artificial Intelligence EC6442E Semester 2

Instructor : Anup Aprem. Year: 2023–2024

Assignment 1

Problem 1: Classification
Here are two reviews of Perfect Blue, from Rotten Tomatoes:

Rotten Tomatoes has classified these reviews as “positive” and “negative”, respectively, as indicated by the intact
tomato on the top and the splatter on the bottom. In this problem, you will create a simple text classification system
that can perform this task automatically. We’ll warm up with the following set of four mini-reviews, each labeled
positive (+1) or negative (-1):

1. (-1) not good

2. (-1) pretty bad
3. (+1) good plot
4. (+1) pretty scenery

Each review x is mapped onto a feature vector ϕ(x), which maps each word to the number of occurrences of that
word in the review. For example, the second review maps to the (sparse) feature vector ϕ(x) = {pretty : 1, bad : 1}.
In this problem, we will use the hinge loss given by:

↕(y, yi ) = max (0, 1 − y ẏi ) , (1)

where, yi is the actual value and y is the predicted value. Assuming a linear hypothesis space, and using the feature
vector ϕ(x) as input,
Losshinge (xi , yi , w) = max {0, 1 − w.ϕ(xi )yi } ,
where x is the review text, yi is the correct label, w is the weight vector.

1. Suppose we run stochastic gradient descent once for each of the 4 samples in the order given above, updating
the weights according to
w ← w − η∇w Losshinge (x, y, w).
After the updates, what are the weights of the six words (“pretty”, “good”, “bad”, “plot”, “not”, “scenery”)
that appear in the above reviews?
• Use η = 0.1 as the step size.
• Initialize w = [0, 0, 0, 0, 0, 0].

2. Given the following dataset of reviews:

(a) (-1) bad
(b) (+1) good
(c) (+1) not bad
(d) (-1) not good

3. Prove that no linear classifier using word features (i.e. word count) can get zero error on this dataset. Remember
that this is a question about classifiers, not optimization algorithms; your proof should be true for any linear
classifier of the form fw (x) = sign(w.ϕ(x)), regardless of how the weights are learned.
4. Propose a single additional feature for your dataset that we could augment the feature vector with that would
fix this problem.

Problem 2: Predicting Movie Ratings

Suppose that we are now interested in predicting a numeric rating for movie reviews. We will use a non-linear
predictor that takes a movie review x and returns σ(w.ϕ(x)), where σ(z) = (1 + e−z )−1 is the logistic function that
squashes a real number to the range (0, 1). For this problem, assume that the movie rating y is a real-valued variable
in the range [0, 1].

1. Suppose that we wish to use squared loss. Write out the expression for Loss(x, y, w) for a single datapoint
(x, y).
2. Given Loss(x, y, w from the previous part, compute the gradient of the loss with respect to w, ∇w Loss(x, y, w).
Write the answer in terms of the predicted value p = σ(w.ϕ(x)).

Midterm Review Spring18 Sols
No ratings yet
Midterm Review Spring18 Sols
22 pages
CS178 Homework #1: Problem 0: Getting Connected
No ratings yet
CS178 Homework #1: Problem 0: Getting Connected
4 pages
十字路口交通灯PLC控制程序的研究与设计
No ratings yet
十字路口交通灯PLC控制程序的研究与设计
35 pages
Homework 3: SVM and Sentiment Analysis: Minted Listings
No ratings yet
Homework 3: SVM and Sentiment Analysis: Minted Listings
7 pages
DSCI 303: Machine Learning For Data Science Fall 2020
No ratings yet
DSCI 303: Machine Learning For Data Science Fall 2020
5 pages
178 hw3
No ratings yet
178 hw3
3 pages
final_exam_solutions
No ratings yet
final_exam_solutions
12 pages
hw3
No ratings yet
hw3
7 pages
Winter21Exam1
No ratings yet
Winter21Exam1
17 pages
hw7
No ratings yet
hw7
7 pages
Homework2
No ratings yet
Homework2
3 pages
CS388N Practice Questions Answers
No ratings yet
CS388N Practice Questions Answers
48 pages
Midterm Solutions For Machine Learning
No ratings yet
Midterm Solutions For Machine Learning
13 pages
Midterm 2006
No ratings yet
Midterm 2006
11 pages
DIT865 2018 Mar Solution
No ratings yet
DIT865 2018 Mar Solution
9 pages
Midterm Sol
No ratings yet
Midterm Sol
23 pages
COMP 4650 6490 Assignment 3 2023-v1.1
No ratings yet
COMP 4650 6490 Assignment 3 2023-v1.1
6 pages
Machine Learning Assignments and Answers
No ratings yet
Machine Learning Assignments and Answers
35 pages
15-381 Spring 2007 Assignment 6: Learning
No ratings yet
15-381 Spring 2007 Assignment 6: Learning
14 pages
CS_221_Fall_19_Solution
No ratings yet
CS_221_Fall_19_Solution
30 pages
Week 3 - Lecture Slides - Logistic Regression
No ratings yet
Week 3 - Lecture Slides - Logistic Regression
54 pages
AI Lec2.1 MLsupervised
No ratings yet
AI Lec2.1 MLsupervised
21 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
Group 4 MovieReview
No ratings yet
Group 4 MovieReview
10 pages
ISYE6740_Fall2024_HW4_Rubric
No ratings yet
ISYE6740_Fall2024_HW4_Rubric
5 pages
Ifjo 320 Fy 98324 Fo 3 F 2 Ifr
No ratings yet
Ifjo 320 Fy 98324 Fo 3 F 2 Ifr
6 pages
Solution PDF
No ratings yet
Solution PDF
20 pages
CS273a Final Exam
No ratings yet
CS273a Final Exam
9 pages
Cs 229, Spring 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
No ratings yet
Cs 229, Spring 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
8 pages
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
No ratings yet
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
12 pages
Homework 04: Your Content Should Use Any Color That Is Different From Those
No ratings yet
Homework 04: Your Content Should Use Any Color That Is Different From Those
5 pages
hw2 2020
No ratings yet
hw2 2020
3 pages
178 hw1
No ratings yet
178 hw1
4 pages
Final2019 Solutions
No ratings yet
Final2019 Solutions
23 pages
Lecture 4 - Linear Classification
No ratings yet
Lecture 4 - Linear Classification
34 pages
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
No ratings yet
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
5 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Machine Learning PYQ 2021
No ratings yet
Machine Learning PYQ 2021
4 pages
COL 774: Assignment 2
No ratings yet
COL 774: Assignment 2
3 pages
MachineLearningMathematics
No ratings yet
MachineLearningMathematics
15 pages
Yousef ML Washin Classification
100% (1)
Yousef ML Washin Classification
333 pages
hw1
No ratings yet
hw1
12 pages
DS3001_DAV_Final Exam_Fall23_v3
No ratings yet
DS3001_DAV_Final Exam_Fall23_v3
14 pages
hw2_red
No ratings yet
hw2_red
4 pages
10-701 Midterm Exam, Fall 2007
No ratings yet
10-701 Midterm Exam, Fall 2007
25 pages
__Part A_ Total 5 Marks (No Choice)__
No ratings yet
__Part A_ Total 5 Marks (No Choice)__
4 pages
f11ms
No ratings yet
f11ms
4 pages
CS4740/5740 Introduction To NLP Fall 2017 Neural Language Models and Classifiers
No ratings yet
CS4740/5740 Introduction To NLP Fall 2017 Neural Language Models and Classifiers
7 pages
Repeat 2014 Wwith Answers
No ratings yet
Repeat 2014 Wwith Answers
9 pages
MSBD5001_WrittenAssignment2_2024F
No ratings yet
MSBD5001_WrittenAssignment2_2024F
5 pages
Lecture 1, Part 2: Linear Classification: Roger Grosse
No ratings yet
Lecture 1, Part 2: Linear Classification: Roger Grosse
10 pages
Kinetic theory of gases notes
No ratings yet
Kinetic theory of gases notes
5 pages
Exam Spring 10
No ratings yet
Exam Spring 10
10 pages
Ai Ml Exam_1march 16 2022-Michael Magreola
No ratings yet
Ai Ml Exam_1march 16 2022-Michael Magreola
8 pages
Ex NN
No ratings yet
Ex NN
2 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
1907 11737
No ratings yet
1907 11737
16 pages
FULLTEXT01
No ratings yet
FULLTEXT01
244 pages
Challenging Programming in Python
No ratings yet
Challenging Programming in Python
287 pages
Curriculum and Syllabi: M. Tech
No ratings yet
Curriculum and Syllabi: M. Tech
40 pages
Laws of Thermodynamics
No ratings yet
Laws of Thermodynamics
9 pages
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
No ratings yet
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
127 pages
RDBMS
No ratings yet
RDBMS
6 pages
Roleplay
No ratings yet
Roleplay
2 pages
Project Proposal PI Controller
No ratings yet
Project Proposal PI Controller
10 pages
Diapositivas Curso Scrum Master v1 PDF
No ratings yet
Diapositivas Curso Scrum Master v1 PDF
72 pages
Software Reliability: Abstract
No ratings yet
Software Reliability: Abstract
18 pages
Introduction To SCOR Model Atif N. Mughal
No ratings yet
Introduction To SCOR Model Atif N. Mughal
11 pages
Lab 7 Pid Control
No ratings yet
Lab 7 Pid Control
6 pages
Nonlinear Lect3
No ratings yet
Nonlinear Lect3
16 pages
Test Techniques
No ratings yet
Test Techniques
13 pages
Exam Questions For SE (Students)
No ratings yet
Exam Questions For SE (Students)
6 pages
Control Sensors and Actuators: Lalith Gamage, PHD
No ratings yet
Control Sensors and Actuators: Lalith Gamage, PHD
16 pages
SRM VALLIAMMAI 1924103-Machine-Learning
100% (1)
SRM VALLIAMMAI 1924103-Machine-Learning
10 pages
9d25206a Secure Software Engineering
No ratings yet
9d25206a Secure Software Engineering
1 page
Unit 2 New
No ratings yet
Unit 2 New
21 pages
Gamification For The Purpose of Eliciting Requirements A Systematic Literature Review PDF
No ratings yet
Gamification For The Purpose of Eliciting Requirements A Systematic Literature Review PDF
21 pages
AI Classical Model
100% (1)
AI Classical Model
16 pages
ST Lab Manual
No ratings yet
ST Lab Manual
30 pages
Process-Based Management: A Structured Approach To Provide The Best Answers To The ISO 9001 Requirements
No ratings yet
Process-Based Management: A Structured Approach To Provide The Best Answers To The ISO 9001 Requirements
10 pages
KG Stpcs7 2017-V en Web
No ratings yet
KG Stpcs7 2017-V en Web
608 pages
What Is Artificial Intelligence (AI) ?
No ratings yet
What Is Artificial Intelligence (AI) ?
6 pages
Program Maintenance - Tutorialspoint
No ratings yet
Program Maintenance - Tutorialspoint
2 pages
Ai - Unit 4 - Study Resource
No ratings yet
Ai - Unit 4 - Study Resource
15 pages
AI - For-Everyone
No ratings yet
AI - For-Everyone
19 pages
Grid Search Random Search Genetic Algorithm A Big
No ratings yet
Grid Search Random Search Genetic Algorithm A Big
11 pages
Rattle Brochure
No ratings yet
Rattle Brochure
1 page
Artificial Intelligence
No ratings yet
Artificial Intelligence
2 pages
Reliability Engineering
No ratings yet
Reliability Engineering
42 pages

Assignment 1

Uploaded by

Assignment 1

Uploaded by

Artificial Intelligence EC6442E Semester 2

Instructor : Anup Aprem. Year: 2023–2024

1. (-1) not good

↕(y, yi ) = max (0, 1 − y ẏi ) , (1)

2. Given the following dataset of reviews:

Problem 2: Predicting Movie Ratings

You might also like