Co-Testing: Multi-View Active Learning: R2 Backto (Cuisine) Backto ( (Number) )

This document discusses active learning algorithms for multi-view learning, including Co-Testing and Co-EMT. Co-Testing uses initially labeled data to train classifiers in different views, then queries the user to label examples where the views disagree. Co-EMT improves on Co-Testing by interleaving it with Co-EM, a semi-supervised multi-view learner, allowing the algorithms to benefit from each other's labeled and unlabeled data. The document provides an example of applying these techniques to extract phone numbers from documents using forward-looking and backward-looking rules as different views.

Uploaded by

Srinivas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views1 page

Co-Testing: Multi-View Active Learning: R2 Backto (Cuisine) Backto ( (Number) )

Uploaded by

Srinivas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Active Learning with Multiple Views

and Adaptive View Validation. Note that these algo- This rule is applied forward, from the beginning
rithms are not specific to wrapper induction, and they of the page, and it ignores everything until it finds the
have been applied to a variety of domains, such as text string Phone:. Note that this is not the only way to
classification, advertisement removal, and discourse detect where the phone number begins. An alternative
tree parsing (Muslea, 2002). way to perform this task is to use the following rule:

Co-Testing: Multi-View Active Learning R2 = BackTo( Cuisine ) BackTo( ( Number ) )

Co-Testing (Muslea, 2002, Muslea et al., 2000), which which is applied backward, from the end of the docu-
is the first multi-view approach to active learning, ment. R2 ignores everything until it finds “Cuisine”
works as follows: and then, again, skips to the first number between
parentheses.
• First, it uses a small set of labeled examples to Note that R1 and R2 represent descriptions of the
learn one classifier in each view. same concept (i.e., beginning of phone number) that are
• Then, it applies the learned classifiers to all unla- learned in two different views (see Muslea et al. [2001]
beled examples and asks the user to label one of for details on learning forward and backward rules).
the examples on which the views predict different That is, views V1 and V2 consist of the sequences of
labels. characters that precede and follow the beginning of the
• It adds the newly labeled example to the training item, respectively. View V1 is called the forward view,
set and repeats the whole process. while V2 is the backward view. Based on V1 and V2,
Co-Testing can be applied in a straightforward man-
Intuitively, Co-Testing relies on the following ob- ner to wrapper induction. As shown in Muslea (2002),
servation: if the classifiers learned in each view predict Co-Testing clearly outperforms existing state-of-the-art
a different label for an unlabeled example, at least one algorithms, both on wrapper induction and a variety of
of them makes a mistake on that prediction. By ask- other real world domains.
ing the user to label such an example, Co-Testing is
guaranteed to provide useful information for the view Co-EMT: Interleaving Active and
that made the mistake. Semi-Supervised Learning
To illustrate Co-Testing for wrapper induction, con-
sider the task of extracting restaurant phone numbers To further reduce the need for labeled data, Co-EMT
from documents similar to the one shown in Figure 2. (Muslea et al., 2002a) combines active and semi-
To extract this information, the wrapper must detect supervised learning by interleaving Co-Testing with
both the beginning and the end of the phone number. Co-EM (Nigam & Ghani, 2000). Co-EM, which is a
For instance, to find where the phone number begins, semi-supervised, multi-view learner, can be seen as the
one can use the following rule: following iterative, two-step process: first, it uses the
hypotheses learned in each view to probabilistically
R1 = SkipTo( Phone: ) label all the unlabeled examples; then it learns a new
hypothesis in each view by training on the probabilisti-
cally labeled examples provided by the other view.
By interleaving active and semi-supervised learn-
ing, Co-EMT creates a powerful synergy. On one hand,
Figure 2. The forward rule R1 and the backward rule Co-Testing boosts Co-EM’s performance by providing
R2 detect the beginning of the phone number. Forward it with highly informative labeled examples (instead
and backward rules have the same semantics and differ of random ones). On the other hand, Co-EM provides
only in terms of from where they are applied (start/end Co-Testing with more accurate classifiers (learned
of the document) and in which direction from both labeled and unlabeled data), thus allowing
Co-Testing to make more informative queries.
R1: SkipTo( Phone : ) R2: BackTo( Cuisine) BackTo( (Number) ) Co-EMT was not yet applied to wrapper induction,
Name: Gino’s Phone : (800)111-1717 Cuisine : … because the existing algorithms are not probabilistic

Ai
No ratings yet
Ai
287 pages
AI Powered IDS
No ratings yet
AI Powered IDS
6 pages
Matric S
No ratings yet
Matric S
16 pages
Eight Puzzle
100% (4)
Eight Puzzle
10 pages
SPM12 Manual
No ratings yet
SPM12 Manual
449 pages
Assignment 24 - 3 1
100% (1)
Assignment 24 - 3 1
2 pages
Merge Sort
No ratings yet
Merge Sort
9 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
SC Lab Record
No ratings yet
SC Lab Record
82 pages
Lecture17 PDF
No ratings yet
Lecture17 PDF
19 pages
Monitoring Online Tests Through Data Visualization 1
No ratings yet
Monitoring Online Tests Through Data Visualization 1
74 pages
Timeline of Probability and Statistics
100% (1)
Timeline of Probability and Statistics
3 pages
Lab Manual On Soft Computing (IT-802) : Ms. Neha Sexana
No ratings yet
Lab Manual On Soft Computing (IT-802) : Ms. Neha Sexana
29 pages
Artificial Intelligence and Machine Learning (Theory Exam)
No ratings yet
Artificial Intelligence and Machine Learning (Theory Exam)
65 pages
Encipherment Using Modern Symmetric-Key Ciphers-Block Ciphers
No ratings yet
Encipherment Using Modern Symmetric-Key Ciphers-Block Ciphers
48 pages
Supervised Learning
No ratings yet
Supervised Learning
4 pages
Temporal Pattern Classification Using Spiking Neural Networks
No ratings yet
Temporal Pattern Classification Using Spiking Neural Networks
67 pages
Over Fitting and TBL
No ratings yet
Over Fitting and TBL
46 pages
Usc 08
No ratings yet
Usc 08
46 pages
CHAPTER 6 Machine Learning: Objective
No ratings yet
CHAPTER 6 Machine Learning: Objective
29 pages
Formal Languages Models of Computation: Spring 2005 Costas Busch - RPI
No ratings yet
Formal Languages Models of Computation: Spring 2005 Costas Busch - RPI
36 pages
DUnit I
No ratings yet
DUnit I
25 pages
ML With PYTHON LAB
No ratings yet
ML With PYTHON LAB
53 pages
Plug22 Data Mining - Report
No ratings yet
Plug22 Data Mining - Report
32 pages
Plug 160819 PDF
No ratings yet
Plug 160819 PDF
31 pages
3rd Year Maths Guide 19-20v2 (002) - Pages-7-27
No ratings yet
3rd Year Maths Guide 19-20v2 (002) - Pages-7-27
21 pages
Learning Scenarios Supervised Learning Unsupervised Learning Unit 1 Part B
No ratings yet
Learning Scenarios Supervised Learning Unsupervised Learning Unit 1 Part B
25 pages
ML Answer Key (M.tech)
No ratings yet
ML Answer Key (M.tech)
31 pages
Introducti0n (MLT)
No ratings yet
Introducti0n (MLT)
39 pages
Knowledge Acquisition Via Incremental Conceptual Clustering
No ratings yet
Knowledge Acquisition Via Incremental Conceptual Clustering
34 pages
Natural Language Processing
No ratings yet
Natural Language Processing
31 pages
A Genetic Algorithm-Based 3D Feature Selection For Lip Reading
No ratings yet
A Genetic Algorithm-Based 3D Feature Selection For Lip Reading
6 pages
(Slide) Non Linear Regression
No ratings yet
(Slide) Non Linear Regression
39 pages
Advanced Matrix Operations: 6.1 Opening Remarks
No ratings yet
Advanced Matrix Operations: 6.1 Opening Remarks
22 pages
Lecture 1
No ratings yet
Lecture 1
15 pages
1920 File Paper
No ratings yet
1920 File Paper
10 pages
GrowOVER - How Can LLMs Adapt To Growing Real-World Knowledge
No ratings yet
GrowOVER - How Can LLMs Adapt To Growing Real-World Knowledge
27 pages
Vired
No ratings yet
Vired
4 pages
Search Engine Development: Adaptation From Supervised Learning Methodology
No ratings yet
Search Engine Development: Adaptation From Supervised Learning Methodology
8 pages
ML
No ratings yet
ML
29 pages
Week 2 Characterization of Learning Problems: Nptel Video Course On Machine Learning
No ratings yet
Week 2 Characterization of Learning Problems: Nptel Video Course On Machine Learning
18 pages
Speech Recognition Using Backoff N-Gram Modelling in Android Application
No ratings yet
Speech Recognition Using Backoff N-Gram Modelling in Android Application
7 pages
Chap1-Multi-View Data Completion
No ratings yet
Chap1-Multi-View Data Completion
25 pages
TTNT 09 Learning From Examples
No ratings yet
TTNT 09 Learning From Examples
58 pages
First Step in Supervised Learning
No ratings yet
First Step in Supervised Learning
10 pages
Natural Language Processing Experiment PDF
No ratings yet
Natural Language Processing Experiment PDF
46 pages
An Integrated Technique To Enhance The Performance of The Classifiers
No ratings yet
An Integrated Technique To Enhance The Performance of The Classifiers
6 pages
Barlaaaa
No ratings yet
Barlaaaa
12 pages
Afaster Training Algorithm and Genetic Algorithm To Recognize Some of Arabic Phonemes
No ratings yet
Afaster Training Algorithm and Genetic Algorithm To Recognize Some of Arabic Phonemes
13 pages
Superloss: A Generic Loss For Robust Curriculum Learning
No ratings yet
Superloss: A Generic Loss For Robust Curriculum Learning
12 pages
An Automated Speech Recognition and Feature Selection Approach Based On Improved Northern Goshawk Optimization
No ratings yet
An Automated Speech Recognition and Feature Selection Approach Based On Improved Northern Goshawk Optimization
9 pages
AI Assignment 2
No ratings yet
AI Assignment 2
5 pages
AML Unit 4 Part 1
No ratings yet
AML Unit 4 Part 1
14 pages
Literatuer Survey On Document Extraction in Web Pages Using Data Mining Techniques
No ratings yet
Literatuer Survey On Document Extraction in Web Pages Using Data Mining Techniques
5 pages
Advances in AI: Module-1
No ratings yet
Advances in AI: Module-1
23 pages
Pert CPM
No ratings yet
Pert CPM
11 pages
2020 Dse-4
No ratings yet
2020 Dse-4
12 pages
Mining The Student Assessment Data: Lessons Drawn From A Small Scale Case Study
No ratings yet
Mining The Student Assessment Data: Lessons Drawn From A Small Scale Case Study
5 pages
Learning To Match Jobs With Resumes From Sparse Interaction
No ratings yet
Learning To Match Jobs With Resumes From Sparse Interaction
10 pages
Chapter 3
No ratings yet
Chapter 3
14 pages
Democratic Co Learning
No ratings yet
Democratic Co Learning
9 pages
ChatDev V2 Experiential Co-Learning of Software-Developing Agents 2312.17025v2
No ratings yet
ChatDev V2 Experiential Co-Learning of Software-Developing Agents 2312.17025v2
10 pages
Data Mining Mubarak Pasha
No ratings yet
Data Mining Mubarak Pasha
12 pages
Icassp 2016
No ratings yet
Icassp 2016
6 pages
Ans Key CIA 2 Set 1
No ratings yet
Ans Key CIA 2 Set 1
9 pages
AI First
No ratings yet
AI First
9 pages
File 29
No ratings yet
File 29
9 pages
Minimum Spanning Tree Formulation: X Ij T
No ratings yet
Minimum Spanning Tree Formulation: X Ij T
6 pages
Practice Problem: Chapter 15, Short Term Scheduling
No ratings yet
Practice Problem: Chapter 15, Short Term Scheduling
6 pages
AI MCQs
No ratings yet
AI MCQs
7 pages
Spaces and Search Algorithms: Tc2011 Artificial Intelligence
No ratings yet
Spaces and Search Algorithms: Tc2011 Artificial Intelligence
7 pages
ME1401 - Finite Elements Analysis
No ratings yet
ME1401 - Finite Elements Analysis
5 pages
Unit 5 Half Ai
No ratings yet
Unit 5 Half Ai
9 pages
PSO3
No ratings yet
PSO3
4 pages
Genetic Algorithm Based Semi-Feature Selection Method: Hualong Bu Shangzhi Zheng, Jing Xia
No ratings yet
Genetic Algorithm Based Semi-Feature Selection Method: Hualong Bu Shangzhi Zheng, Jing Xia
4 pages
Supervised Learning
No ratings yet
Supervised Learning
4 pages
Question Paper 1
No ratings yet
Question Paper 1
3 pages
A Star Ai and ML Lab
No ratings yet
A Star Ai and ML Lab
3 pages
Databases and Ontologies
No ratings yet
Databases and Ontologies
1 page
Data Analysis Resume
No ratings yet
Data Analysis Resume
2 pages
Familiar With The Browser
No ratings yet
Familiar With The Browser
1 page
Bio in For Matics
No ratings yet
Bio in For Matics
1 page
10 Year Ravens Web Test
No ratings yet
10 Year Ravens Web Test
1 page
Discussed The Application
No ratings yet
Discussed The Application
1 page
American Standard Code For Informa
No ratings yet
American Standard Code For Informa
1 page
Bibliomining For Library Decision-Making: Key Terms
No ratings yet
Bibliomining For Library Decision-Making: Key Terms
1 page
Bioinformatics Programmers
No ratings yet
Bioinformatics Programmers
1 page
Provides More Accurate Recommendations
No ratings yet
Provides More Accurate Recommendations
1 page
Bibliomining For Library Decision-Making: Background
No ratings yet
Bibliomining For Library Decision-Making: Background
1 page
Have Realized The Importance
No ratings yet
Have Realized The Importance
1 page
Historic Nature of Data
No ratings yet
Historic Nature of Data
1 page
Business Areas Served
No ratings yet
Business Areas Served
1 page
Best Practices in Data Warehousing: Les Pang
No ratings yet
Best Practices in Data Warehousing: Les Pang
1 page
Key Terms: A Bayesian Based Machine Learning Application To Task Analysis
No ratings yet
Key Terms: A Bayesian Based Machine Learning Application To Task Analysis
1 page
Recorded Phone Conversations Between
No ratings yet
Recorded Phone Conversations Between
1 page
Categories of Customer Behavior
No ratings yet
Categories of Customer Behavior
1 page
Modified For This Purpose
No ratings yet
Modified For This Purpose
1 page
The Framework For Behavioral Pattern-Based Clustering
No ratings yet
The Framework For Behavioral Pattern-Based Clustering
1 page
A Bayesian Based Machine Learning Application To Task Analysis
No ratings yet
A Bayesian Based Machine Learning Application To Task Analysis
1 page
Bayesian Based Machine Learning
No ratings yet
Bayesian Based Machine Learning
1 page
Task Analysis Compared
No ratings yet
Task Analysis Compared
1 page
Automatic Musical Instrument
No ratings yet
Automatic Musical Instrument
1 page
Similarly Presented and Having
No ratings yet
Similarly Presented and Having
1 page
What Are Musical Pitch
No ratings yet
What Are Musical Pitch
1 page
Proceedings of International Symposium
No ratings yet
Proceedings of International Symposium
1 page
Support Vector Machines
No ratings yet
Support Vector Machines
1 page
Their Semantic and Multidimen
No ratings yet
Their Semantic and Multidimen
1 page
A Small Set of Digital Library
No ratings yet
A Small Set of Digital Library
1 page
View Detection Algorithm
No ratings yet
View Detection Algorithm
1 page
Active Learning With Multiple Views
No ratings yet
Active Learning With Multiple Views
1 page
A Motivating Problem: Wrapper Induction: Thai Restaurants in L.A. A-Rated by The L.A. County Health Depart
No ratings yet
A Motivating Problem: Wrapper Induction: Thai Restaurants in L.A. A-Rated by The L.A. County Health Depart
1 page
Key Terms: Conference On Machine Learning (ICML-2002)
No ratings yet
Key Terms: Conference On Machine Learning (ICML-2002)
1 page
Machine Learning Tools: (Scherf Et. Al. 2005)
No ratings yet
Machine Learning Tools: (Scherf Et. Al. 2005)
1 page
Automatic Music Timbre Indexing
No ratings yet
Automatic Music Timbre Indexing
1 page
Automated Theorem Proving: Fundamentals and Applications
From Everand
Automated Theorem Proving: Fundamentals and Applications
Fouad Sabry
No ratings yet
Esp-r Easy
From Everand
Esp-r Easy
Roman Rabenseifer
No ratings yet
Laboratory Practice, Testing, and Reporting: Time-Honored Fundamentals for the Sciences
From Everand
Laboratory Practice, Testing, and Reporting: Time-Honored Fundamentals for the Sciences
Dwayne Phillips
No ratings yet
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Feedback Control Theory
From Everand
Feedback Control Theory
Bruce Francis
5/5 (1)