Lect 2 in Machine Learning For NLP

This document discusses natural language processing and machine learning techniques for NLP tasks. It covers supervised and unsupervised learning, with supervised learning involving classification using labeled training examples. Support vector machines are described as linear classifiers that find a hyperplane to separate classes of data, and can perform nonlinear separation using kernel functions.

Uploaded by

Mohamed Adel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views17 pages

Lect 2 in Machine Learning For NLP

Uploaded by

Mohamed Adel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Natural Language Processing

Machine learning for NLP

Supervised vs. unsupervised Learning
• Supervised learning: classification is seen as
supervised learning from examples.
– Supervision: The data (observations,
measurements, etc.) are labeled with pre-defined
classes. It is like that a “teacher” gives the classes
(supervision).
– Test data are classified into these classes too.
• Unsupervised learning (clustering)
– Class labels of the data are unknown
– Given a set of data, the task is to establish the
existence of classes or clusters in the data

CS583, Bing Liu, UIC 7

Supervised learning process: two steps
 Learning (training): Learn a model using the
training data
 Testing: Test the model using unseen test data
to assess the model accuracy
Number of correct classifica tions
Accuracy  ,
Total number of test cases

CS583, Bing Liu, UIC 8

What do we mean by learning?
• Given
– a data set D,
– a task T, and
– a performance measure M,
a computer system is said to learn from D to
perform the task T if after learning the
system’s performance on T improves as
measured by M.
• In other words, the learned model helps the
system to perform T better as compared to no
learning.
CS583, Bing Liu, UIC 9
Support vector machines
• SVMs are linear classifiers that find a hyperplane to
separate two class of data, positive and negative.
• Kernel functions are used for nonlinear separation.
• SVM not only has a rigorous theoretical foundation, but
also performs classification more accurately than most
other methods in applications, especially for high
dimensional data.
• It is perhaps the best classifier for text classification.

CS583, Bing Liu, UIC 14

Basic concepts
• Let the set of training examples D be
{(x1, y1), (x2, y2), …, (xr, yr)},
where xi = (x1, x2, …, xn) is an input vector in a real-
valued space X  Rn and yi is its class label (output value),
yi  {1, -1}.
1: positive class and -1: negative class.
• SVM finds a linear function of the form (w: weight
vector)
f(x) = w  x + b
 1 if  w  x i   b  0
yi  
 1 if  w  x i   b  0
CS583, Bing Liu, UIC 15
The hyperplane
• The hyperplane that separates positive and negative
training data is
w  x + b = 0
• It is also called the decision boundary (surface).
• So many possible hyperplanes, which one to choose?

CS583, Bing Liu, UIC 16

Maximal margin hyperplane
• SVM looks for the separating hyperplane with the largest margin.
• Machine learning theory says this hyperplane minimizes the error
bound

CS583, Bing Liu, UIC 17

Machine Learning
No ratings yet
Machine Learning
17 pages
Machine Learning IAI
No ratings yet
Machine Learning IAI
94 pages
ML 2
No ratings yet
ML 2
166 pages
Lec-1 ML Intro
No ratings yet
Lec-1 ML Intro
15 pages
Unit 1 PDF
No ratings yet
Unit 1 PDF
135 pages
Unit 1 - Machine Learning
No ratings yet
Unit 1 - Machine Learning
21 pages
MI - Unit 3
No ratings yet
MI - Unit 3
107 pages
CS583 Supervised Learning
No ratings yet
CS583 Supervised Learning
166 pages
Support Vector Machine: Prof. Subodh Kumar Mohanty
No ratings yet
Support Vector Machine: Prof. Subodh Kumar Mohanty
52 pages
Support Vector Machine: Abinas Panda
No ratings yet
Support Vector Machine: Abinas Panda
52 pages
Slide 10 Chapter9 Classification Advanced Methods
No ratings yet
Slide 10 Chapter9 Classification Advanced Methods
46 pages
AI
No ratings yet
AI
52 pages
ML Unit - 2
No ratings yet
ML Unit - 2
36 pages
SML Lecture1
No ratings yet
SML Lecture1
37 pages
Supervised Learning
No ratings yet
Supervised Learning
30 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Unit II
No ratings yet
Unit II
25 pages
Week 8. Supervised Learning. Classification
No ratings yet
Week 8. Supervised Learning. Classification
45 pages
AP For NLP-LO2
No ratings yet
AP For NLP-LO2
38 pages
Lecture 4.1 Machine Learning Deep Learning Reinforcement Learning
No ratings yet
Lecture 4.1 Machine Learning Deep Learning Reinforcement Learning
32 pages
Topic 5-Types of Machine Learning
No ratings yet
Topic 5-Types of Machine Learning
31 pages
E-Notes 34758 Content Document 20250415115803AM
No ratings yet
E-Notes 34758 Content Document 20250415115803AM
23 pages
Lecture 2
No ratings yet
Lecture 2
22 pages
Algorithm of Neural Network M4
No ratings yet
Algorithm of Neural Network M4
25 pages
Unit5 ML Introduction
No ratings yet
Unit5 ML Introduction
32 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
IR - Group1
No ratings yet
IR - Group1
27 pages
Unit-1 DL
No ratings yet
Unit-1 DL
29 pages
Mod09-ppt2-ML in Image Classification
No ratings yet
Mod09-ppt2-ML in Image Classification
30 pages
Lecture 8-2 - Text Classification, Naïve Bayes, Vector Space Classification
No ratings yet
Lecture 8-2 - Text Classification, Naïve Bayes, Vector Space Classification
30 pages
ITD253 L6 TextClassificationClustering
No ratings yet
ITD253 L6 TextClassificationClustering
39 pages
SVM VS SVC
No ratings yet
SVM VS SVC
27 pages
3.unit 3 ML Part-2 Q&A
No ratings yet
3.unit 3 ML Part-2 Q&A
23 pages
Lecture 02 Supervised Learning 27102022 124322am
No ratings yet
Lecture 02 Supervised Learning 27102022 124322am
29 pages
Unit 3ML
No ratings yet
Unit 3ML
23 pages
W9 ML Overview NRG
No ratings yet
W9 ML Overview NRG
21 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
Sde Problems
No ratings yet
Sde Problems
8 pages
Lecture#11
No ratings yet
Lecture#11
19 pages
AI Lec3
No ratings yet
AI Lec3
22 pages
Machine Learning: Dr. Windhya Rankothge (PHD - Upf, Barcelona)
No ratings yet
Machine Learning: Dr. Windhya Rankothge (PHD - Upf, Barcelona)
44 pages
Lesson 2 - Machine Learning
No ratings yet
Lesson 2 - Machine Learning
14 pages
QUESTIONS
No ratings yet
QUESTIONS
20 pages
01 Ml-Overview Notes
No ratings yet
01 Ml-Overview Notes
19 pages
Manish NTCC Presentation Sem 5
No ratings yet
Manish NTCC Presentation Sem 5
11 pages
CH 5 SVM
No ratings yet
CH 5 SVM
25 pages
Sensitivity Analysis: Lindo Input & Results
No ratings yet
Sensitivity Analysis: Lindo Input & Results
16 pages
Comparative Study of Four Supervised Machine Learning Techniques For Classification
No ratings yet
Comparative Study of Four Supervised Machine Learning Techniques For Classification
15 pages
DA Unit 3,4
No ratings yet
DA Unit 3,4
11 pages
Data Analysis ch1
No ratings yet
Data Analysis ch1
13 pages
3 Introduction To Machine Learning
No ratings yet
3 Introduction To Machine Learning
21 pages
1 Introduction To Machine Learning
No ratings yet
1 Introduction To Machine Learning
32 pages
(BI 2025-1) Lesson11
No ratings yet
(BI 2025-1) Lesson11
26 pages
Data Science Unit 3
No ratings yet
Data Science Unit 3
10 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
9 pages
Machine Learning Models For News Article Classification
No ratings yet
Machine Learning Models For News Article Classification
8 pages
Ankita
No ratings yet
Ankita
10 pages
Unit 2
No ratings yet
Unit 2
10 pages
Lec # 9
No ratings yet
Lec # 9
18 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
5 pages
Introduction. Binary Classification and Bayes Optimal Classifier
No ratings yet
Introduction. Binary Classification and Bayes Optimal Classifier
7 pages
Merton Truck Excel
No ratings yet
Merton Truck Excel
35 pages
Section 1.8 Gaussian Elimination With Pivoting
No ratings yet
Section 1.8 Gaussian Elimination With Pivoting
8 pages
Fuzzy Inference System
No ratings yet
Fuzzy Inference System
7 pages
(AI) Searching
No ratings yet
(AI) Searching
49 pages
DBSCAN Clustering Algorithm: Presented by
No ratings yet
DBSCAN Clustering Algorithm: Presented by
22 pages
Maths Project
No ratings yet
Maths Project
40 pages
K Maps - Karnaugh Maps - Solved Examples: Minimization of Boolean Expressions
No ratings yet
K Maps - Karnaugh Maps - Solved Examples: Minimization of Boolean Expressions
18 pages
Intro. To NLP
No ratings yet
Intro. To NLP
18 pages
3.0sg Systems of Linear Equations and Inequalities - Study Guide
No ratings yet
3.0sg Systems of Linear Equations and Inequalities - Study Guide
1 page
Halting Problem Presentation
No ratings yet
Halting Problem Presentation
2 pages
NLP Ambiguity
No ratings yet
NLP Ambiguity
35 pages
Semantic Report
No ratings yet
Semantic Report
24 pages
CS721 q9 Spring 2024 HW - 2
No ratings yet
CS721 q9 Spring 2024 HW - 2
5 pages
Assignment No-4 Subject: Cse-202: Object Oriented Programming
No ratings yet
Assignment No-4 Subject: Cse-202: Object Oriented Programming
9 pages
Artificial Intelligence Fundamentals Midterm Q1
No ratings yet
Artificial Intelligence Fundamentals Midterm Q1
4 pages
2019vgg Vqealgorithmhacker Dojowbg 190911191759
No ratings yet
2019vgg Vqealgorithmhacker Dojowbg 190911191759
12 pages
Data Types: in C Programming
No ratings yet
Data Types: in C Programming
11 pages
Matlab Optimization Toolbox: Most Materials Are Obtained From Matlab Website
No ratings yet
Matlab Optimization Toolbox: Most Materials Are Obtained From Matlab Website
12 pages
F16midterm1 Solution
No ratings yet
F16midterm1 Solution
8 pages
Complex Systems 535/physics 508: Homework 1
No ratings yet
Complex Systems 535/physics 508: Homework 1
2 pages
Unit 4 (Optimization)
No ratings yet
Unit 4 (Optimization)
50 pages
Emotion
No ratings yet
Emotion
42 pages
A Tabu Search Algorithm
No ratings yet
A Tabu Search Algorithm
19 pages
Data Compression (KCS-064) FIRST SESSIONAL EXAM 2020-21 EVEN SEMESTER B.TECH CSE-3RD YEAR
No ratings yet
Data Compression (KCS-064) FIRST SESSIONAL EXAM 2020-21 EVEN SEMESTER B.TECH CSE-3RD YEAR
10 pages
Tut 3
No ratings yet
Tut 3
3 pages
Char CRC5
No ratings yet
Char CRC5
6 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
Decision Stump Algorithm 1
No ratings yet
Decision Stump Algorithm 1
4 pages
Coding Theorems For A Discrete Source A Criterion-: With Fidelity
No ratings yet
Coding Theorems For A Discrete Source A Criterion-: With Fidelity
1 page
Editorial - ICPC Dhaka Regional 2020 Online Preliminary
No ratings yet
Editorial - ICPC Dhaka Regional 2020 Online Preliminary
9 pages
DS Graph Assignment: DFS Program
No ratings yet
DS Graph Assignment: DFS Program
6 pages
DWH&DM Ver2
No ratings yet
DWH&DM Ver2
3 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet