0% found this document useful (0 votes)

59 views29 pages

Introduction To Multiple Instance Learning PDF

This document provides an introduction to multiple instance learning (MIL). It begins by defining MIL as a form of weakly supervised learning where training instances are arranged in bags and labels are provided for bags rather than individual instances. It then discusses some common applications of MIL including content-based image retrieval, computer-aided diagnosis from medical images, and sentiment analysis of text documents. The document outlines different types of MIL approaches including instance space methods that classify each instance and bag space methods that model distributions between instances. It concludes by describing characteristics that differentiate MIL problems such as whether the task is instance classification or bag classification, the composition of bags, and the relationship between instances.

Uploaded by

Alba Alba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views29 pages

Introduction To Multiple Instance Learning PDF

Uploaded by

Alba Alba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Introduction to Multiple

Instance Learning
Marc-André Carbonneau
Supervisors : Eric Granger and Ghyslain Gagnon
October 19th 2016
Outline of the presentation
1. Definition and formulation.
2. Applications.
3. Type of approaches
4. Characteristics of MIL problems.
What Is Multiple Instance
Learning?
Problem Formulation
Multiple Instance Learning
What it is:
• It is a form of weakly supervised learning.
• Training instances are arranged in sets, called bags.
• A label is provided for entire bags but not for instances.

What it is not:
• Supervised learning
• Unsupervised learning
• Semi-supervised learning
Illustration of a MIL problem
Can enter the secret room
Can I the secret room???

Can not enter the secret room

What is the magic key???
Why use Multiple Instance Learning?
It has been proposed because:
• Some problems are naturally formulated as MIL

It is gaining momentum in the pattern recognition community because:

• It deals with weakly annotated data.
• This reduces the annotation cost.
• Algorithms can now learn from a greater quantity of training data.
Definition of the standard MIL assumption
• Training instances are arranged in sets
generally called bags.
• A label is given to bags but not to individual
instances.
• Negative bags do not contain positive
instances.
• Positive bags may contain negative and
positive instances.
• Positive bags contain at least one positive
instance.
Image from : http://www.miproblems.org/mi-learning/
Relaxed MIL assumptions
In many applications, the standard MIL assumption is to restrictive. MIL
can alternatively formulated as:
• A bag is positive when it contains a sufficient number of positive
instances.
• A bag is positive when it contains a certain combination of positive
instances.
• Positive and negative bags differ by their instance distributions.

More on MIL assumption: J. Foulds and E. Frank, “A Review of Multi-Instance Learning Assumptions,” Knowl. Eng. Rev., vol. 25,
no. 1, pp. 1–25, Mar. 2010.
Example of
relaxed MIL
assumptions
• Both sand and water
segments are positive
instances for beach
pictures.
• However, picture of
beach must contain both
segments of sand and
water. Otherwise, they
can be pictures of desert
or sea.
Image from : J. Amores, “Multiple instance classification: Review, taxonomy and comparative study,”
Artif. Intell., vol. 201, pp. 81–105, Aug. 2013.
Tasks that can be performed in MIL
Group-based
Bag classification in Instance classification
Supervised Learning classification and set
MIL in MIL
classification

Image from: V. Cheplygina, D. M. J. Tax, and M. Loog, “On classification with bags, groups and sets,” Pattern Recognition Letters, vol. 59, pp. 11–17, Jul. 2015.
What Can I Do with Multiple
Instance Learning?
Applications
Molecule Classification
This is the first MIL application published in:
T. G. Dietterich, R. H. Lathrop, and T. Lozano-Pérez, “Solving the Multiple
Instance Problem with Axis-parallel Rectangles,” Artificial Intelligence
1997.
Objective: Predict if a molecule produces a given effect.
Bag: Collection of all conformations of the same molecule.
Instance: Conformation of a molecule.
Justification: Conformations are not observable individually.
Content Base Image Retrieval
Objective: Classify images based
on their subject.
Bag: Collection segments or
patches extracted from an image.
Instance: Image segments or
patches.
Justification: Images can
represent composite objects or
concepts.
Note: Bag-of-words methods are
MIL methods.
Image from: Y. Chen, J. Bi, and J. Z. Wang, “MILES: Multiple-Instance Learning via Embedded
Instance Selection,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 28, no. 12, pp. 1931–1947,
2006.
Object Localization in Image
Objective: Find objects in images.
Bag: Collection of candidate annotation boxes
Instance: Sub-image corresponding to
candidate windows.
Justification: A large quantity of data can be
used to learn because costly strong
annotations are not necessary.

H. O. Song, R. Girshick, S. Jegelka, J. Mairal, Z. Harchaoui, and T.

Darrell, “On learning to localize objects with minimal supervision,”
International Conference on Machine Learning, 2014
Computer Aided Diagnosis (from images)
Objective: Predict if a subject is diseased or
healthy.
Bags: Collection segments or patches
extracted from a medical image.
Instances: Image segments or patches.
Justification: A large quantity of images can Image from: M. Kandemir and F. A. Hamprecht, “Computer-aided
diagnosis from weak supervision: a benchmarking study.,” Comput. Med.
be used to train. Only a diagnosis is required Imaging Graph., vol. 42, pp. 44–50, Jun. 2015.

per image. Expert local annotation are no

longer required.
Sentiment Analysis in Text
Objective: Predict if a text/sentence
expresses positive or negative
sentiment.
Bags: Texts/paragraphs.
Instances: Sentences.
Justification: Large quantity of text Image from: D. Kotzias, M. Denil, P. Blunsom, and N. de Freitas, “Deep Multi-Instance
Transfer Learning,” CoRR, vol. abs/1411.3, 2014.

can be harvested from the web. A

sentiment is usually given to a
complete text while it may contain
positive and negative sentences.
How Can I Do Multiple Instance
Learning?
Types of Methods
Taxonomy of MIL Methods
A generally accepted taxonomy divides MIL methods based on their
reasoning space:

Taxonomy from: J. Amores, “Multiple instance classification: Review, taxonomy and comparative study,” Artificial Intelligence, vol. 201, pp. 81–105, Aug. 2013.
Instance Space Methods
These methods try to uncover the true nature of each
instance in order to make a decision on bag labels.
MI-SVM
Pros: mi-SVM
APR
• Can be directly used for instance classification tasks. RSIS
Cons: EM-DD
MIL-Boost
• Do not work when instances have no precise classes. SbMIL

• Usually less accurate than bag space methods.

Bag Space Methods
These methods embed the content of bags in a single feature
vector, thus transforming the problem into supervised learning.
Alternatively, they use set distance metrics to compare bags
directly.
Pros: MILES
Citation-kNN
• Can model distributions and relation between instances. MnID
• Deal with unclassifiable instances. Bag-of-Words
NSK-SVM
• Can be faster than instance based methods (when an CCE
embedding is used). Mi-Graph
EMD-SVM
• Often more accurate for bag classification tasks.
Cons:
• Cannot be directly used for instance classification tasks.
What Differentiates Multiple
Instance Learning Problems?
Characteristics of MIL problems
Characteristics of MIL Problems
There are characteristics that differentiates a MIL problem from other
MIL problems. These characteristics can be related to four distinctive
properties of MIL.

Image from: M.-A. Carbonneau, V. Cheplygina, E. Granger, and G. Gagnon, “Multiple Instance Learning: A Survey on Problems Characteristics and Applications,” to be
submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017.
Task
Instance and bag classification
are two different tasks.
It has been observed by many
authors that the best algorithm
for instance classification is
rarely the best for bag
classification.
G. Vanwinckelen, V. do O, D. Fierens, and H. Blockeel, “Instance-level accuracy
versus bag-level accuracy in multi-instance learning,” Data Mining Knowledge
Discovery, 2015.

The key difference is the

instance misclassifying cost. Image from: M.-A. Carbonneau, E. Granger, and G. Gagnon, “Decision Threshold Adjustment
Strategies for Increased Accuracy in Multiple Instance Learning,” in Proc. The 6th International
Conference on Image Processing Theory, Tools and Applications (IPTA), 2016.
Bag Composition
Depending on the applications, bags can differ in:
• The proportion of positive instances in positive bags (witness rate).
• The size of the bags.
Images from: M.-A. Carbonneau, V. Cheplygina, E. Granger, and G. Gagnon, “Multiple

• The relation between the instances: Instance Learning: A Survey on Problems Characteristics and Applications,” to be
submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017.

• Co-occurences
• Structure
• Intra-bags similarities
Data Distribution
The type of distribution is important when choosing a MIL algorithm.
Not all MIL algorithms easily deal with :
• Multi-modal distributions
• Unknown negative distribution

Images from: M.-A. Carbonneau, V. Cheplygina, E. Granger, and G.

Gagnon, “Multiple Instance Learning: A Survey on Problems
Characteristics and Applications,” to be submitted to IEEE Transactions
on Pattern Analysis and Machine Intelligence, 2017.
Label Ambiguity
Weak supervision implies label
ambiguity. The ambiguity can be due
to:
• Noise.
• Lack of clear classes at instance
level.
• Ambiguous representation.
• Classes can share the same type of Images from: M.-A. Carbonneau, V. Cheplygina, E. Granger, and G. Gagnon, “Multiple
Instance Learning: A Survey on Problems Characteristics and Applications,” to be
submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017.
instances.
So…
Conclusion
Conclusion
Multiple instance deal with problems where:
• Data points are grouped in sets
• Weak supervision is provided
It is used when:
• Problems are naturally formulated as MIL.
• Strong supervision is costly to obtain or a large quantity of weakly
labeled data can be leveraged.
There are several particularities inherent to this type of problem that
have to be understood in order to be successful in the application of MIL.

Rise of The Machines: Larry Wasserman
No ratings yet
Rise of The Machines: Larry Wasserman
12 pages
ML Merged
No ratings yet
ML Merged
433 pages
AML All Merged PDF Class 1 To 8
No ratings yet
AML All Merged PDF Class 1 To 8
423 pages
1309.5643v3
No ratings yet
1309.5643v3
35 pages
Classification Techniques
No ratings yet
Classification Techniques
99 pages
Bayesian PDF
No ratings yet
Bayesian PDF
67 pages
Module 1 ML
No ratings yet
Module 1 ML
78 pages
ML Mid Syllabus
No ratings yet
ML Mid Syllabus
182 pages
(Babenko2009) Multiple Instance Learning - Algorithms and Applications
No ratings yet
(Babenko2009) Multiple Instance Learning - Algorithms and Applications
19 pages
(Amores2013) Multiple Instance Classification - Review, Taxonomy and Comparative Study
No ratings yet
(Amores2013) Multiple Instance Classification - Review, Taxonomy and Comparative Study
25 pages
Visual Model Building
No ratings yet
Visual Model Building
48 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
08classification I
No ratings yet
08classification I
52 pages
Fintech ML Using Azure
No ratings yet
Fintech ML Using Azure
51 pages
applsci-14-11696
No ratings yet
applsci-14-11696
21 pages
Chap 13
No ratings yet
Chap 13
68 pages
MIL Rules 4
No ratings yet
MIL Rules 4
174 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
Chapter 5 Machine Learning
No ratings yet
Chapter 5 Machine Learning
96 pages
PR Unit 1 2
No ratings yet
PR Unit 1 2
40 pages
How To Do Machine Learning With Small Data
No ratings yet
How To Do Machine Learning With Small Data
22 pages
Machine Learning Top Trumps With David Tarrant
No ratings yet
Machine Learning Top Trumps With David Tarrant
22 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Lin_Interventional_Bag_Multi-Instance_Learning_on_Whole-Slide_Pathological_Images_CVPR_2023_paper
No ratings yet
Lin_Interventional_Bag_Multi-Instance_Learning_on_Whole-Slide_Pathological_Images_CVPR_2023_paper
10 pages
01 Introduction 1
No ratings yet
01 Introduction 1
71 pages
ML Unit 1
No ratings yet
ML Unit 1
73 pages
s40537-020-00301-0
No ratings yet
s40537-020-00301-0
17 pages
Convolutional Sparse Coding Multiple Instance Learning For Whole Slide Image Classification
No ratings yet
Convolutional Sparse Coding Multiple Instance Learning For Whole Slide Image Classification
9 pages
Li Dual-Stream Multiple Instance Learning Network For Whole Slide Image Classification CVPR 2021 Paper
No ratings yet
Li Dual-Stream Multiple Instance Learning Network For Whole Slide Image Classification CVPR 2021 Paper
11 pages
Miforests: Multiple-Instance Learning With Randomized Trees
No ratings yet
Miforests: Multiple-Instance Learning With Randomized Trees
14 pages
Machine Learning On Big Data: Opportunities and Challenges: Version of Record
No ratings yet
Machine Learning On Big Data: Opportunities and Challenges: Version of Record
27 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
MAI Lecture 01 Introduction
No ratings yet
MAI Lecture 01 Introduction
52 pages
Self-Training Multi-Sequence Learning With Transformer
No ratings yet
Self-Training Multi-Sequence Learning With Transformer
9 pages
Introd M
No ratings yet
Introd M
37 pages
Journal For Research
No ratings yet
Journal For Research
6 pages
Oligois: Scalable Instance Selection For Class-Imbalanced Data Sets
No ratings yet
Oligois: Scalable Instance Selection For Class-Imbalanced Data Sets
15 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Lecture 1 Course Introduction
No ratings yet
Lecture 1 Course Introduction
18 pages
Introduction To Machine Learning and Deep Learning
No ratings yet
Introduction To Machine Learning and Deep Learning
40 pages
Machine Learning The Way To Better Thinking
No ratings yet
Machine Learning The Way To Better Thinking
11 pages
Recent Hammers
No ratings yet
Recent Hammers
6 pages
Machine Learning On Big Data: Opportunities and Challenges
No ratings yet
Machine Learning On Big Data: Opportunities and Challenges
25 pages
(IJCST-V8I6P9) :Y Madhankumar
No ratings yet
(IJCST-V8I6P9) :Y Madhankumar
6 pages
(Amores) Notes On Multiple Instance Learning - Review, Taxonomy and Comparative Study
No ratings yet
(Amores) Notes On Multiple Instance Learning - Review, Taxonomy and Comparative Study
15 pages
Bagging Using Instance-Level Difficulty For Multi-Class Imbalanced Big Data Classification On Spark
No ratings yet
Bagging Using Instance-Level Difficulty For Multi-Class Imbalanced Big Data Classification On Spark
10 pages
bag_of_words nlp
No ratings yet
bag_of_words nlp
23 pages
7 Predicting 114
No ratings yet
7 Predicting 114
6 pages
A Study On Document Classification Using Machine Learning Techniques
No ratings yet
A Study On Document Classification Using Machine Learning Techniques
6 pages
Slides Imbalanced Learning Intro
No ratings yet
Slides Imbalanced Learning Intro
7 pages
Science Research Journal
No ratings yet
Science Research Journal
7 pages
Towards Machine Learning Guided by Best Practices
No ratings yet
Towards Machine Learning Guided by Best Practices
5 pages
Automatic Image Annotation and Retrieval Using Multi-Instance Multi-Label Learning
No ratings yet
Automatic Image Annotation and Retrieval Using Multi-Instance Multi-Label Learning
5 pages
Deep MIML Network
No ratings yet
Deep MIML Network
7 pages
1 - A Framework For Multiple-Instance Learning (Diverse Density)
No ratings yet
1 - A Framework For Multiple-Instance Learning (Diverse Density)
7 pages
Machine Learning Usefull Things
No ratings yet
Machine Learning Usefull Things
18 pages
Unit 1
No ratings yet
Unit 1
15 pages
A Survey of Machine Learning Methods For Iot and Their Future Applications
No ratings yet
A Survey of Machine Learning Methods For Iot and Their Future Applications
5 pages
Supervised Learning
No ratings yet
Supervised Learning
5 pages
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet

Introduction To Multiple Instance Learning PDF

Uploaded by

Introduction To Multiple Instance Learning PDF

Uploaded by

Introduction to Multiple

Can not enter the secret room

It is gaining momentum in the pattern recognition community because:

H. O. Song, R. Girshick, S. Jegelka, J. Mairal, Z. Harchaoui, and T.

per image. Expert local annotation are no

can be harvested from the web. A

• Usually less accurate than bag space methods.

The key difference is the

Images from: M.-A. Carbonneau, V. Cheplygina, E. Granger, and G.

You might also like