Ai - W6L11

Uploaded by

sajidajalil63

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views15 pages

Ai - W6L11

Uploaded by

sajidajalil63

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

ARTIFICIAL INTELLIGENCE

WEEK 06
LECTURE 11
TOPICS TO COVER IN THIS LECTURE

• Features: Organization of features

• Feature templates
• Sparsity in feature vectors
• Feature vector representation
FEATURES

• Features are a critical part of machine learning

which often do not get as much attention as they
deserve. Ideally, they would be given to us by a
domain expert.
• The prediction is driven by the score w (x).
• In regression, we predict the score directly, and in
binary classification, we predict the sign of the score.
• So far, we have fixed (x) and used learning to set w.
Now, we will explore how (x) affects the prediction.
ORGANIZATION OF FEATURES

Overall, the organization of features directly impacts the performance, efficiency, and
interpretability of machine learning models. Properly managing features helps in building
robust models that generalize well to unseen data. If you have specific aspects you want
to delve deeper into, let me know!

• Task: predict whether a string is an email address

• How would we go about creating good
features?

• Here, we used our prior knowledge to dene

certain features (contains @) which we
believe are helpful for detecting email
addresses.

• But this is ad-hoc: which strings should we

include? We need a more systematic way to
FEATURE TEMPLATES
• Feature templates are often used in the context of
natural language processing (NLP) and other domains
where you want to define a structure for extracting
features from raw data.
• They serve as blueprints for how to transform data into a
format that a machine learning model can understand.

• A feature template specifies how to extract relevant

information from input data. It outlines the rules or
patterns for identifying features.
• A feature template is a group of features all computed in a
• A feature template also allows us to group a set of related
features (contains @, contains a, contains b).
• This reduces the amount of burden on the feature engineer
since we don't need to know which particular characters are
useful, but only that existence of certain single characters is a
useful cue to look at.
• We can write each feature template as a English description
with a blank (--- ), which is to be labelled in with an arbitrary
string. Also note that feature templates are most natural for
combining binary features, ones which take on value 1 (true) or
0 (false).
• Note that an isolated feature (fraction of alphanumeric
characters) can be treated as a trivial feature template with no
blanks to be labelled.
• As another example, if x is a k k image, then f pixel Intensityij :
SPARSITY IN FEATURE
VECTORS
• Sparsity in feature vectors refers to the condition where a
significant number of elements in a vector are zero (or some
default value). This is common in many datasets, especially in
fields like natural language processing, image processing, and
recommendation systems.
• For example, in a vector of size 1000, if only 50 components are
non-zero, it is a sparse vector.
• Common in High-Dimensional Data:
• Storage and Efficiency: sparse data structures, which only
store non-zero elements and their indices.
• Impact on Algorithms: linear models and support vector
machines, can benefit from sparsity, as they may converge faster
when dealing with sparse data.
FEATURE VECTOR REPRESENTATIONS
• Arrays assume a fixed ordering of the features and represent the
feature values as an array. This representation is appropriate
when the number of non-zeros is significant (the features are
dense).
• Arrays are especially efficient in terms of space and speed (and
you can take advantage of GPUs Graphics Processing Unit). In
computer vision applications, features (e.g., the pixel intensity
features) are generally dense, so array representation is more
common.

• USE of map: a map (also known as a dictionary, associative array,

or hash table) is a collection of key-value pairs where each key is
unique and is used to access its corresponding value.
• when we have sparsity (few non zeros), it is typically more
effcient to represent the feature vector as a map from strings to
doubles rather than a fixed-size array of doubles.
• The features not in the map implicitly have a default value of
zero. This sparse representation is very useful in natural
• language processing, and is what allows us to work eectively over
trillions of features.
• In Python, one would dene a feature vector (x) as the dictionary
• "endsWith "+x[-3:]: 1.
• Maps do incur extra overhead compared to arrays, and therefore
maps are much slower when the features are not sparse.
QUESTIONS?

Chapter 1
No ratings yet
Chapter 1
18 pages
Unit 1
No ratings yet
Unit 1
26 pages
Panimalar Engineering College Departmentof Cse Program Coding
No ratings yet
Panimalar Engineering College Departmentof Cse Program Coding
15 pages
Mining Association Rule With WEKA Explorer: Lab Exercise Two
No ratings yet
Mining Association Rule With WEKA Explorer: Lab Exercise Two
4 pages
PRNN P S Sastry Lec 1
No ratings yet
PRNN P S Sastry Lec 1
177 pages
Python Arrays: What Is An Array?
No ratings yet
Python Arrays: What Is An Array?
4 pages
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
No ratings yet
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
19 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
20 pages
Minato ROBDD Lecture 2
No ratings yet
Minato ROBDD Lecture 2
31 pages
17 Trees
No ratings yet
17 Trees
46 pages
Feature Engineering and Selection: CS 294: Practical Machine Learning October 1, 2009 Alexandre Bouchard-Côté
No ratings yet
Feature Engineering and Selection: CS 294: Practical Machine Learning October 1, 2009 Alexandre Bouchard-Côté
94 pages
Semi Supervised Learning
No ratings yet
Semi Supervised Learning
86 pages
Penerapan Model K-Nearest Neighbors Dalam Klasifikasi Kebutuhan Daya Listrik Untuk Masing-Masing Daerah Di Kota Lhokseumawe
No ratings yet
Penerapan Model K-Nearest Neighbors Dalam Klasifikasi Kebutuhan Daya Listrik Untuk Masing-Masing Daerah Di Kota Lhokseumawe
8 pages
Branch and Price
No ratings yet
Branch and Price
23 pages
Pruning
No ratings yet
Pruning
7 pages
Learning 2
No ratings yet
Learning 2
104 pages
Lab Assignment 2: 1. Write Algorithm and Javascript Code For Linear Search
No ratings yet
Lab Assignment 2: 1. Write Algorithm and Javascript Code For Linear Search
7 pages
Chap 12 LPP
No ratings yet
Chap 12 LPP
16 pages
Lecture - 1: CS-406 Data Structures and Algorithms
No ratings yet
Lecture - 1: CS-406 Data Structures and Algorithms
24 pages
Lecture 0
No ratings yet
Lecture 0
23 pages
Ipmv Mod 5&6 (Theory Questions)
No ratings yet
Ipmv Mod 5&6 (Theory Questions)
11 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
100 pages
Shivani 6
No ratings yet
Shivani 6
7 pages
Lec 14
No ratings yet
Lec 14
18 pages
00 CS 312 Comp Algo Course Description
No ratings yet
00 CS 312 Comp Algo Course Description
7 pages
CS464 Ch1 Intro Fall2020
No ratings yet
CS464 Ch1 Intro Fall2020
83 pages
Figure 2-37. Decision Boundary Found by A Linear SVM: # Add The Squared First Feature
No ratings yet
Figure 2-37. Decision Boundary Found by A Linear SVM: # Add The Squared First Feature
3 pages
DS Unit 4
No ratings yet
DS Unit 4
21 pages
ITC - Lab 09
No ratings yet
ITC - Lab 09
24 pages
Machine Learning: Aigerim Bogyrbayeva
No ratings yet
Machine Learning: Aigerim Bogyrbayeva
85 pages
CV Lecture 11
No ratings yet
CV Lecture 11
147 pages
Unit 2 Feature Engineering
No ratings yet
Unit 2 Feature Engineering
64 pages
CHP 4
No ratings yet
CHP 4
72 pages
CS204 - Final
No ratings yet
CS204 - Final
3 pages
AI-Module 4 - Updated
No ratings yet
AI-Module 4 - Updated
53 pages
Pattern Recognition
No ratings yet
Pattern Recognition
66 pages
Pattern Recognition
No ratings yet
Pattern Recognition
33 pages
ML Unit2 Classppt
No ratings yet
ML Unit2 Classppt
44 pages
Feature Engineering Handout
No ratings yet
Feature Engineering Handout
33 pages
DSH - L5 - Data-Driven Approaches - Concepts
No ratings yet
DSH - L5 - Data-Driven Approaches - Concepts
38 pages
Classification Techniques
No ratings yet
Classification Techniques
99 pages
Ai - W7L13
No ratings yet
Ai - W7L13
46 pages
A Matheuristic Multi-Start Algorithm For A Novel Static Repositioning Problem in Public Bike-Sharing Systems
No ratings yet
A Matheuristic Multi-Start Algorithm For A Novel Static Repositioning Problem in Public Bike-Sharing Systems
15 pages
Pattern Recognition
No ratings yet
Pattern Recognition
3 pages
Basics of Feature Engineering Marked
No ratings yet
Basics of Feature Engineering Marked
33 pages
ML 3170724 Unit-4
No ratings yet
ML 3170724 Unit-4
97 pages
Pattern Recognition 14
No ratings yet
Pattern Recognition 14
46 pages
ML Unit 2 Part 2
No ratings yet
ML Unit 2 Part 2
23 pages
Data Structures and Algorithms (ECE-4024)
No ratings yet
Data Structures and Algorithms (ECE-4024)
2 pages
Feature Extraction
No ratings yet
Feature Extraction
16 pages
Unit No. 02 - Feature Extraction & Selection
No ratings yet
Unit No. 02 - Feature Extraction & Selection
47 pages
ML UNIT 2 2 Old
No ratings yet
ML UNIT 2 2 Old
15 pages
ML-Unit 3
No ratings yet
ML-Unit 3
58 pages
2 3-FeatureRelatedIssues
No ratings yet
2 3-FeatureRelatedIssues
10 pages
AI - W3L5 Modified
No ratings yet
AI - W3L5 Modified
17 pages
Ai - W8L15
No ratings yet
Ai - W8L15
44 pages
Ai - W14L28
No ratings yet
Ai - W14L28
24 pages
Data
No ratings yet
Data
36 pages
Ai - W7L14
No ratings yet
Ai - W7L14
22 pages
Feature and Feature Extractionlect2
No ratings yet
Feature and Feature Extractionlect2
28 pages
PR Unit 1 ....
No ratings yet
PR Unit 1 ....
34 pages
Important 2 Marks and 16 Marks Question Data Stuructures Using C - PUCS3BE01 Unit - 1 To 5-1
No ratings yet
Important 2 Marks and 16 Marks Question Data Stuructures Using C - PUCS3BE01 Unit - 1 To 5-1
29 pages
PP&DS 4
No ratings yet
PP&DS 4
82 pages
Ai - W6L12
No ratings yet
Ai - W6L12
44 pages
Expectimaxalgo Game 2048
No ratings yet
Expectimaxalgo Game 2048
5 pages
Python Unit 4
No ratings yet
Python Unit 4
43 pages
AIDS C04-Session-20
No ratings yet
AIDS C04-Session-20
17 pages
UNIT04
No ratings yet
UNIT04
35 pages
KNN Classififer W7L13 Example
No ratings yet
KNN Classififer W7L13 Example
8 pages
Doubt Clearance Session (AI) On 29.12.2024
No ratings yet
Doubt Clearance Session (AI) On 29.12.2024
41 pages
Hil Climbing On Real Life Scenerio
No ratings yet
Hil Climbing On Real Life Scenerio
5 pages
Lithosphere 3
No ratings yet
Lithosphere 3
4 pages
Dijkstra's Algorithm
No ratings yet
Dijkstra's Algorithm
4 pages
Design and Analysis of Algorithms
No ratings yet
Design and Analysis of Algorithms
5 pages
PIJ File
No ratings yet
PIJ File
28 pages
Summary Chap 1 & 2
No ratings yet
Summary Chap 1 & 2
5 pages
Unit II
No ratings yet
Unit II
119 pages
AI Lab 5 Manual
No ratings yet
AI Lab 5 Manual
3 pages
Pattern Recognition
No ratings yet
Pattern Recognition
5 pages
Pattern Mining
No ratings yet
Pattern Mining
36 pages
1 Introduction
No ratings yet
1 Introduction
81 pages
Alpha - Beta Pruning Example
No ratings yet
Alpha - Beta Pruning Example
3 pages
DSA - NOTES Unit 1,2
No ratings yet
DSA - NOTES Unit 1,2
22 pages
Module 1-3
No ratings yet
Module 1-3
63 pages
Srujitha 1
No ratings yet
Srujitha 1
91 pages
Unit 6aics
No ratings yet
Unit 6aics
25 pages
Pattern Recognition
No ratings yet
Pattern Recognition
11 pages
Copy Merged
No ratings yet
Copy Merged
3 pages
Self Reading - KNN - Notes
No ratings yet
Self Reading - KNN - Notes
7 pages
UNIT3
No ratings yet
UNIT3
98 pages
DSP Unit - III
No ratings yet
DSP Unit - III
49 pages
Dalal 2008
No ratings yet
Dalal 2008
6 pages
Unit 2
No ratings yet
Unit 2
91 pages
Pattern and Pattern Classes 17
No ratings yet
Pattern and Pattern Classes 17
15 pages
Lect 2
No ratings yet
Lect 2
77 pages
Mathophilia
No ratings yet
Mathophilia
18 pages
Applied APL Programming: Definitive Reference for Developers and Engineers
From Everand
Applied APL Programming: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
JavaScript Data Structures Explained: A Practical Guide with Examples
From Everand
JavaScript Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Mastering Data Structures and Algorithms in C and C++
From Everand
Mastering Data Structures and Algorithms in C and C++
Sachin Naha
No ratings yet