0% found this document useful (0 votes)

31 views9 pages

Democratic Co Learning

This document presents Democratic Co-Learning, a new single-view semi-supervised learning technique that leverages the fact that different learning algorithms have different biases. It uses multiple algorithms trained on the same labeled data to label unlabeled data for each other, adding data the algorithms disagree on to their training sets. This allows it to make use of unlabeled data when there is only a small amount of labeled data and no redundant feature sets, addressing limitations of previous co-training approaches. The document also introduces Democratic Priority Sampling for active learning to select examples to label.

Uploaded by

Hisler

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views9 pages

Democratic Co Learning

Uploaded by

Hisler

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Democratic Co-Learning

Yan Zhou
School of Computer and Information Sciences
University of South Alabama
Mobile, AL 36688
[email protected]
Sally Goldman
Department of Computer Science and Engineering
Washington University
St. Louis, MO 63130-4899
[email protected]

Abstract are also many settings in which there are not. Nigam and
Ghani [11] showed that co-training has strong dependence
For many machine learning applications it is important on its assumption of independent and redundant feature
to develop algorithms that use both labeled and unlabeled split.
data. We present democratic co-learning in which multiple The question we address in this paper is how unlabeled
algorithms instead of multiple views enable learners to la- data can be used to improve the accuracy of supervised
bel data for each other. Our technique leverages off the fact learning algorithms in situations when:
that different learning algorithms have different inductive
– Only a small amount of labeled data is available,
biases and that better predictions can be made by the voted
majority. We also present democratic priority sampling, a – there is a large pool of unlabeled data, and
new example selection method for active learning. – there are not two independent and redundant sets of at-
tributes
Our work replaces the need for two attribute sets by lever-
1. Introduction aging from the fact that different learning algorithms have
different inductive biases even when seeing the same data.
In many practical learning scenarios there is only a small Our work is motivated, in part, by the empirical success
amount of labeled data (which is often costly to obtain) of ensemble methods (e.g. boosting [8] or bagging [9]) in
along with a large pool of unlabeled data. One of many which individual classifiers are trained from different train-
example applications is content-based image retrieval in ing sets using re-sampling techniques on the labeled data.
which a user (via relevance feedback) labels a small num- There are two important questions we must address:
ber of images as desirable or undesirable. However, there is 1. How can one create the set of hypotheses to combine
an extremely large pool of unlabeled images available. The to obtain better accuracy given that there is not enough
goal of the content-based image retrieval system is to deter- labeled data to apply re-sampling techniques?
mine which images the user finds desirable. We use semi-
supervised learning to refer to settings in which unlabeled 2. How can one make use of the large unlabeled pool of
data is used to augment labeled data when the size of the la- data?
beled data is insufficient. In our work, we use an ensemble-style approach but
In a single-view semi-supervised method the learner re- rather than creating the classifiers with a single algorithm
ceives a single set of attributes to use for learning. In a run on different subsets of the labeled data (which is not an
multi-view approach (such as co-training [1]), the learner option because of the limited amount of labeled data), we
receives two or more independent and redundant sets of at- instead run different algorithms using the same set of data.
tributes where each view individually is adequate for learn- Also ensemble methods do not use unlabeled data as an ad-
ing. While there are applications with two such views, there ditional source of knowledge but rather are designed when

Proceedings of the 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2004)
1082-3409/04 $20.00 © 2004 IEEE
Authorized licensed use limited to: ASU Library. Downloaded on July 05,2022 at 21:26:23 UTC from IEEE Xplore. Restrictions apply.
there is a sufficient source of labeled data but only weak assumed model of data holds, violations of these assump-
learning algorithms. Our early work [7] demonstrates that tions often result in poor performance [10]. Democratic co-
two different algorithms can successfully label data from learning is different from other single-view algorithms such
the unlabeled pool for each other. More recently, such an ap- as EM [6] in that like the statistical co-learning algorithm
proach has been successfully applied to content-based im- introduced in our early work [7], it uses multiple learning
age retrieval [19]. algorithms to serve a similar role that multiple views pro-
We present democratic co-learning, a new single-view vide in co-training.
semi-supervised technique, that can be used for applica- Blum and Mitchell [1] introduced the multi-view semi-
tions without two independent and redundant feature sets supervised learning approach. They make the strong as-
and which is applicable with a small pool of labeled data. sumption that the instance space can be represented us-
In democratic co-learning, a set of different learning algo- ing two different views (i.e. two independent and redun-
rithms are employed to train a set of classifiers separately dant sets of attributes) and that either view by itself is
on the labeled data set. The output concepts are combined sufficient for perfect classification if there were enough
using weighted voting to predict labels for an unlabeled ex- labeled data. They presented a co-training algorithm for
amples. The newly labeled examples are added to the train- this situation and gave both empirical and theoretical re-
ing set of the classifiers that predict differently than the ma- sults evaluating it. While there are settings such as these
jority. The process is repeated until no more data can be in which there are two independent (and sufficiently redun-
added to the training set of the classifiers. We also present dant) views, there are also many settings in which such
democratic priority sampling to select examples for which redundant views are not available. Nigam and Ghani [11]
to request labels for active learning. Finally, we obtain ac- have shown that co-training has strong dependence on its as-
tive democratic co-learning which uses democratic priority sumption of independent and redundant feature split. In this
sampling to select examples to be actively labeled and uses paper, we present a new single-view technique, democratic
democratic co-learning to label additional examples. co-learning that is applicable to settings that violate the as-
sumption of independent and redundant feature sets. Our
2. Related Work technique leverages off the fact that different learning algo-
rithms have different inductive biases and that better predic-
Like ensemble methods (e.g. boosting [8] or bag- tions can be made by the voted majority.
ging [9]), democratic co-learning integrates a group of Co-EM [11] integrates co-learning and EM by using the
learners to boost the overall accuracy and exploits differ- hypothesis learned in one view to probabilistically label the
ences in the bias between methods or methods that allow examples in the other view. The primary difference between
locally different models. However, there are fundamen- co-EM and co-training is that like EM, co-EM assigns a
tal differences and motivations. An ensemble method im- temporary label to each unlabeled example from scratch at
proves itself by creating random subsets or purposely each iteration whereas co-training selects a subset of the un-
biased distributions from the training data, which is inap- labeled examples to permanently label. In both cases, the
plicable when the amount of training data is small. hypothesis obtained from one view is used to perform la-
In general the semi-supervised learning problem has beling for the other view.
been studied in two settings: multiple-view and single view. Two-view EM (2v-EM) [12] aims to demonstrate that the
In a single-view semi-supervised method the learner re- strength from co-training and co-EM does not come merely
ceives a single set of attributes to use for learning. In a multi- from combining classifiers learned from different views.
view approach (such as the co-training procedure of Blum 2v-EM performs EM on each view in isolation and then
and Mitchell [1]), the learner receives two or more indepen- combines the prediction of the hypotheses learned in each
dent and redundant sets of attributes where each view indi- view. Using text-categorization benchmarks they showed
vidually is adequate for learning. Democratic co-learning is that when the requirement of two independent and redun-
a new single-view approach. dant views is severely violated 2v-EM can outperform co-
The Expectation-Maximization (EM) [6] can be viewed training and co-EM.
as a single-view semi-supervised learning algorithm by While democratic co-learning has similarities with sta-
treating the unlabeled examples as having a hidden vari- tistical co-learning from our earlier work [7], there are ma-
able (the label). Used in this way, EM begins with an ini- jor differences. First, statistical co-learning uses two learn-
tial classifier trained on the labeled examples. It then re- ing algorithms and requires them each to output a hypothe-
peatedly uses the current classifier to temporarily label the sis that partitions the domain into equivalence classes. For
unlabeled examples and then trains a new classifier on all la- example, the decision tree output by C4.5 defines one equiv-
beled examples (the original and the newly labeled) until it alence class per leaf. This assumption limits the applicabil-
converges. While the EM algorithm works well when the ity of that approach. Also, we used statistical tests to de-

Table 1. A framework for classifying semi-supervised algorithms.

cide when one algorithm should label data for the other. Yet, ent subsets of the training data, or randomly chosen accord-
the amount of labeled data available was insufficient for ap- ing to the posterior distribution of possible models given
plying those tests. Democratic co-learning resolves both of the training data. Instead of basing priorities on the num-
these problems by using an ensemble-like method to reduce ber of disagreements, we consider a variant [15] of QBC
the need for statistical tests and enable it to be applied to where the priority of example is computed using the en-
any three or more standard supervised learning algorithms.
Some useful insights for our work come from meta-

tropy of the classifications voted by each member where

for the number of
learning. In theory, there is no single learning algorithm that committee members, the total number of labels, and
will be superior on all problems [2]. It has also been shown the number of votes for label . Examples with the high-
that classifiers with uncorrelated errors may reduce the error est entropy are selected for labeling.
rate when using a combined model [5]. Chan and Stolfo [3] Co-testing [12] is an active multi-view learning that re-
considered learning in a distributed setting in which the la- peatedly trains one hypothesis for each view and selects as a
beled data is distributed over many locations and thus each query an unlabeled example where the two hypotheses pre-
learning algorithm only sees a subset of the labeled data. dict differently (a contention point). The contention points
While the setting for their research is quite different than on which the combined prediction of the two classifiers is
ours, their research showed that since different learning al- least confident is selected. Co-Test (Co-Em) [16] combines
gorithms use different representations for their hypotheses co-testing and co-EM to get an active multi-view semi-
and have different inductive biases, the underlying strate- supervised learning algorithm. Their experiments show that
gies embodied by different learning algorithms may com- co-Test (co-EM) outperforms other non-active multi-view
plement each other by effectively reducing the space of algorithms without using more labeled data and is better
incorrect classifications of a learned concept [3]. In their able to overcome violations in the assumptions of two in-
multi-algorithm meta-learning strategy [4], Chan et al. pro- dependent and redundant views.
vided only a fraction of the labeled data to each base clas- Table 1 classifies semi-supervised techniques based on
sifier yet the resulting combined classifier obtained a better whether they use a single-view or multi-view approach and
overall accuracy than a classifier trained from all the avail- on whether active learning is used. Our new contributions
able data. One key difference from our work is that they as- are shown in bold.
sume each learner only sees a small amount of labeled data
because it is distributed. As in their work, we expect dif- 3. Democratic Co-Learning
ferent algorithms to infer different patterns in the data. An-
other difference with our work is that we use the classifiers We now present democratic co-learning. Let be the set
not only to boost the performance but also to label data in of labeled data, the set of unlabeled data, and
to increase the pool of labeled data for other learning al- (for ) the provided supervised learning algorithms1 .
gorithms that did not infer the same patterns. Democratic co-learning begins by training all learners
We briefly review work on active learning. Uncertainty on the original labeled data set . For every example
sampling [13, 14] repeatedly selects an unlabeled exam- in the unlabeled data set , each learner predicts a label
ple with the most “uncertain” membership and asks the or- for . Let be the majority pre-
acle to provide the correct label. The learning algorithm diction. In Section 3.1, we introduce several labeling crite-
then rebuilds its hypothesis based on the new training set. ria that must be satisfied before example will be labeled
Query-by-committee (QBC) [13, 8] measures the degree to
which a group of classifiers disagree rather than using a sin- 1 While we describe democratic co-learning for any number of super-
gle classifier to measure the certainty of its classification. vised learning algorithms in our empirical work we only consider
In QBC, committee members can be generated on differ- .

/ data proposed for adding to /

If ¼
¼
The easiest way to create the ﬁnal hypotheses is using
standard majority vote among the possible class labels. In

such that

order to combine better, in addition to the number of votes

for each label, we also consider each individual classifier’s
/– Estimate if adding to improves accuracy–/
confidence value (as measured by the mean of the 95%-
For confidence interval2 ) in its prediction. We partition classi-
Use to compute 95%-conf. int. for
fiers into groups, one for each possible label. We use an
/ est. of error rate / m-estimator to adjust the average of the mean confidence

value of each group such that the average mean confidence
/ est. of new error rate /
value of a group is discounted more if it has smaller size.
Let be the size of a group. Based on some preliminary ex-

/ if
periments not reported here, we use a Laplace correction
added / of to avoid zero frequency of votes and bias

If towards a voting power of 0.5 when the group size is too
small. The group of classifiers with the highest discounted
confidence value is used to predict for the example. When
Return Combine the confidence value of classifiers within a group has a large
variance, the adjustment made above may not be effective.
Figure 1. Democratic co-learning. Hence, we ignore any classifier whose confidence value is
less than 50%. See Figure 2.

4. Democratic Priority Sampling

3.1. Labeling Criteria
In this section, we present democratic priority sampling.
No unlabeled example is labeled by one learner for an- As in democratic co-learning we begin by using the labeled
other unless a majority of the learners agree on the label.
In addition to this majority vote requirement, we also re- 2 Again, empirically we found little difference when using either a 90%
quire that the sum of the mean conﬁdence values of the or 99%-conﬁdence interval.

ally perform better than 3-NN. To create a hypothesis from

naive Bayes that partitions the input domain as required by
where ¾ for the mean of the statistical co-learning, we take all of the data in and la-
95%-conﬁdence interval of and . The
bel it according to the naive Bayes hypothesis and then use
example with the highest priority label is given to an expert C4.5 to create the equivalence classes (one per leaf). We
for labeling. Then the hypotheses are recomputed using the use eight of the UCI3 benchmark data sets. For all data sets
larger pool of labeled data and the process is repeated. except for the adult data set where . Ta-
ble 2 shows other statistics about the data sets. We created
While there are many similarities between democratic 20 different data sets by randomly partitioning the data into
priority sampling and QBC, there are two key differences. , , and the test data. In addition, we picked random par-
First, the committee members are obtained by using differ- titions in which democratic co-learning labeled at least one
ent learning algorithms versus the same learning algorithm example in .
trained on different data. Secondly, we use a weighted vari-
ant of vote entropy to incorporate the conﬁdence estimates
into the priorities. 3 http://www.ics.uci.edu/ mlearn/MLRepository.html

0.9 0.9

0.8
0.8

0.7
0.7
0.6
0.6
0.5

0.5
Naive Bayes 0.4
C4.5 DemoCoL
3-NN EM-Naive Bayes
0.4 DemoCoL 0.3 EM-C4.5
Combine Only EM-3NN

0.3 0.2
40 50 60 70 80 90 100 110 120 40 50 60 70 80 90 100 110 120

Figure 3. Results on DNA data. The -axis is and the -axis is the accuracy.

data # exs from labeled for avg. #

set atts NB C4.5 3-NN rounds
ﬂare 10 108 151 80 40 515 2.7
monk2 6 40 84 40 40 193 2.3
vote 16 66 40 40 40 200 2.2
DNA 180 367 289 432 40 1588 2.8
cancer 9 59 40 45 40 124 2.1
adult 14 413 130 353 60 1691 2.6
3-of-9 9 40 91 40 40 234 2.3
xd6 9 86 115 40 40 463 2.7

Table 2. A summary of amount of data labeled by democratic co-learning.

A key contribution of our work is a semi-supervised umn labeled by “data in labeled” shows the best result
learning technique that can be applied when there are not obtained among any of the base algorithms (naive Bayes,
such independent and redundant set of attributes. Since the C4.5, and 3-NN) when all examples in are correctly la-
work on two-view approaches generally only reports re- beled and placed in . Due to the small size of and there-
sults on data sets that naturally have two appropriate fea- fore considerable variation in performance, a paired t-test
ture sets, comparing our work to those approaches requires is used to determine the statistical significance of the dif-
that we re-implement their work. We have selected to do ference made by democratic co-learning. Our results are
this for the Blum and Mitchell co-training procedure [1] shown in Table 3. The value in parenthesis is the value of
which we refer to as two-view co-training. In order to cre- the paired t-test values between democratic co-learning and
ate two views, we randomly partition the features into two that method. A positive value indicates that democratic co-
sets and then treat these as our two views as done by Nigam learning performed better. Any value that is statistically sig-
and Ghani [11]. We also tested how sensitive the perfor- nificant at the 95% confidence level or higher (i.e. )
mance of two-view co-training was to the random choice of is in bold. All values greater than 2.861 are also statisti-
the partition of the features. For each of the UCI data sets cally significant at the 99% level and all values greater than
we fixed the choice of which examples to place in , , and 3.8834 are statistically significant at the 99.9% level. Due
the test set and then randomly picked 20 different random to space constraints the standard deviation is only shown
partitions of the features into two sets. For these we found a for democratic co-learning.
standard deviation of anywhere from 0.03 to 0.06. Finally, As compared to combining alone, the performance of
we present results obtained by using EM with each of the democratic co-learning performs better at the 95% confi-
three base algorithms. To create a measure of the best per- dence level for 6 of the 8 data sets and at the 90% confidence
formance one could expect for the given data sets, the col- level for the other two data sets. So democratic co-learning

Algorithm cancer adult 3-of-9 xd6

Demo. Co-Learning
Combining Only (2.697) (4.154) (5.181) (2.667)
Data in Labeled
Statistical Co-Learning (0.386) (2.781) (5.221) (3.484)
Two-View Co-Training (2.752) (-0.280) (9.931) (7.175)
EM-NB (5.351) (4.358) (14.386) (22.433)
EM-C4.5 (1.462) (7.452) (7.808) (6.580)
EM-3NN (-1.40) (2.630) (4.944) (13.638)
NB (8.923) (3.927) (4.816) (4.680)
C4.5 (1.462) (7.398) (7.881) (6.111)
3NN (3.206) (4.091) (4.116) (3.239)

Table 3. Our non-active learning results.

is making use of the unlabeled data and not just benefit- which labels may be requested) with QBC and uncertainty
ing from the use of an ensemble method of combining. As sampling. For QBC and uncertainty sampling, we show the
compared to the other 5 semi-supervised methods, demo- paired t-test value with respect to democratic priority sam-
cratic co-learning performs statistically significantly at the pling. Finally, we compare the following active and semi-
95% level in 32 of the 40 tests we performed. (In fact in supervised algorithms: active democratic co-learning, co-
27 of the 40 tests, our improvements are statistically signif- testing, and co-test(co-EM) showing the paired t-test val-
icant at the 99% confidence level.) Of the 8 tests in which ues with respect to active democratic co-learning.
the difference in performance was not statistically signifi- For the active approaches in which the unlabeled data is
cant democratic co-learning performed better in all but two only used as a pool for the active learner, democratic pri-
of them. ority sampling performed better in 5 of the 8 data sets than
each of QBC and uncertainty sampling but only 2 of these
5.2. Active Co-learning 5 cases (for each data set) was statistically significant at the
95% level. We are currently repeating these experiments us-
Table 4 shows our active learning results. For uncertainty ing 20 different random choices for , , and the test data
sampling we use naive Bayes where the normalized proba- and we believe that we will find statistically significant im-
bility measure of naive Bayes is used to give an uncertainty provements in more cases. For the active semi-supervised
value. For QBC we use different committee members algorithms, democratic co-learning performed better than
each trained with naive Bayes on a random subset (without each of co-testing and co-test (Co-EM) in 5 of the 8 data
replacement) of examples from . The active learn- sets with 4 of the 5 (for each data set) being statistically sig-
ing is used to select 40 additional examples to have labeled. nificant at the 95% level.
We first show the best result obtained among the base We also ran a paired t-test between democratic priority
algorithms when all data in is properly labeled. Next sampling and active democratic learning. For the 3-of-9 and
we compare democratic priority sampling (with no use of DNA data sets the improvement of active democratic co-
the unlabeled data except in serving as a pool of data for learning was statistically significant at the 95% level, and

for the vote and XD6 data sets, the improvement of ac- beled data. Democratic co-learning also outperformed each
tive democratic co-learning was statistically significant at of the three individual algorithms when combined with EM
the 90% level. For the flare and monk2 data sets there really and by picking learners that work in very different ways, we
is not much room for improvement. Similarly, in comparing can increase the diversity needed for them to be able to la-
the performance between democratic co-learning and active bel data for each other.
democratic co-learning the use of active learning generally
improved the performance in data sets in which the perfor- References
mance of democratic co-learning was not already close to
that obtained when all data in is given the proper label. [1] Blum, A., Mitchell, T.: Combining labeled and unlabeled
data with co-training. In: Proc. of the 11th Annual Conf. on
Computational Learning Theory. (1998) 92–100.
6. Concluding Remarks [2] Schaffer, C.: A conservation law for generalization perfor-
mance. In: Proc. of the 11th Int. Conf. on Machine Learning,
We have demonstrated that democratic co-learning, a San Mateo: Morgan Kaufmann (1994) 259–265.
single-view multiple algorithm semi-supervised learning [3] Chan Philip, K., Stolfo, S.: On the accuracy of meta-learning
technique is statistically superior to many semi-supervised for scalable data mining. Journal of Intelligent Integration of
learning approaches when there are not two sufficiently in- Information 8(1) (1998) 5–28.
dependent and redundant set of attributes. Using data from [4] Chan Philip, K., Stolfo, S.: Scaling learning by meta-
the UCI repository, we have compared the performance learning over disjoint and partially replicated data. In: Proc.
of democratic co-learning to combining alone (without us- of the 9th Florida AI Research Symposium. (1996) 151–155.
ing the unlabeled data) and to other single-view and multi- [5] Ali, K., Pazzani, M.: Error reduction through learning mul-
view semi-supervised learning algorithms. Democratic co- tiple descriptions. Machine Learning 24 (1996) 173–202
learning performed better at the 95% confidence level in 38 [6] Dempster, A., Laird, N., Rubin, D.: Maximum likelihood
of the 48 tests that we performed in the non-active learn- from incomplete data via the em algorithm. Journal of the
Royal Statistical Society, Series B 39 (1977) 1–38
ing setting. For the other 10 tests there was no significant
[7] Goldman, S., Zhou, Y.: Enhancing supervised learning with
difference in performance between democratic co-learning
unlabeled data. In: Proc. of the 17th Int. Conf. on Machine
and the other approaches studied. Learning, San Francisco: Morgan Kaufmann (2000) 327–
In general, co-learning works well if the estimated mean 334.
confidence reflects which learner is better and when the [8] Freund, Y., Seung, H., Shamir, E., Tishby, N.: Selective sam-
multiple classifiers are good in different regions enabling pling using the query by committee algorithm. Machine
them to classify data for each other. Finally, there needs Learning 28 (1997) 133–168.
to be room for improvement by at least one of the super- [9] Breiman, L.: Bagging predictors. Machine Learning 24(2)
vised learning algorithm if it received more correctly la- (1996) 123–140.

The Organizational Environment
100% (4)
The Organizational Environment
7 pages
Invoice Bill Eplett 14021
No ratings yet
Invoice Bill Eplett 14021
1 page
Spotlight On First Reading 1 - Unit 3 - PPT - PPSX
100% (1)
Spotlight On First Reading 1 - Unit 3 - PPT - PPSX
39 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Active Learning With Multiple Views
No ratings yet
Active Learning With Multiple Views
1 page
1 s2.0 S1047320308001144 Main
No ratings yet
1 s2.0 S1047320308001144 Main
7 pages
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Using Weighted Nearest Neighbor To Benef PDF
No ratings yet
Using Weighted Nearest Neighbor To Benef PDF
12 pages
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Usc 08
No ratings yet
Usc 08
46 pages
Semi-Supervised Learning Literature Survey
No ratings yet
Semi-Supervised Learning Literature Survey
59 pages
TR1530
No ratings yet
TR1530
39 pages
SemiSupervisedRegressionWithCoTraining Zhou
No ratings yet
SemiSupervisedRegressionWithCoTraining Zhou
6 pages
Chapter 4
No ratings yet
Chapter 4
43 pages
Self-Supervised Learning: Teaching AI with Unlabeled Data
From Everand
Self-Supervised Learning: Teaching AI with Unlabeled Data
Robert Johnson
No ratings yet
Unit 4 AIML
No ratings yet
Unit 4 AIML
29 pages
View Detection Algorithm
No ratings yet
View Detection Algorithm
1 page
A Framework For Learning Predictive Structures From Multiple Tasks and Unlabeled Data
No ratings yet
A Framework For Learning Predictive Structures From Multiple Tasks and Unlabeled Data
37 pages
Is Active Learning Same As Semi Supervised Learning
No ratings yet
Is Active Learning Same As Semi Supervised Learning
2 pages
Bickel Icdm 2004
No ratings yet
Bickel Icdm 2004
9 pages
Lect 0407
No ratings yet
Lect 0407
6 pages
A Survey On Semi-, Self - and Unsupervised Learning For Image Classification
No ratings yet
A Survey On Semi-, Self - and Unsupervised Learning For Image Classification
33 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
60 pages
Unit Iv
No ratings yet
Unit Iv
22 pages
Ensemble Learning, Decision Trees
No ratings yet
Ensemble Learning, Decision Trees
65 pages
Unit Iv
No ratings yet
Unit Iv
18 pages
MLQB Unit 3
No ratings yet
MLQB Unit 3
12 pages
Co-Training: "Combining Labeled and Unlabeled Data With Co-Training" by A. Blum & T. Mitchell, 1998
No ratings yet
Co-Training: "Combining Labeled and Unlabeled Data With Co-Training" by A. Blum & T. Mitchell, 1998
9 pages
AML Unit-3 Material
No ratings yet
AML Unit-3 Material
26 pages
Semi-Supervised Learning With Per-Class Adaptive Confidence Scores For Acoustic Environment Classification With Imbalanced Data
No ratings yet
Semi-Supervised Learning With Per-Class Adaptive Confidence Scores For Acoustic Environment Classification With Imbalanced Data
5 pages
On Consistency of Graph-Based Semi-Supervised Learning: Chengan Du Yunpeng Zhao Feng Wang
No ratings yet
On Consistency of Graph-Based Semi-Supervised Learning: Chengan Du Yunpeng Zhao Feng Wang
9 pages
MATHEMATICAL FOUNDATIONS OF MACHINE LEARNING: Unveiling the Mathematical Essence of Machine Learning (2024 Guide for Beginners)
From Everand
MATHEMATICAL FOUNDATIONS OF MACHINE LEARNING: Unveiling the Mathematical Essence of Machine Learning (2024 Guide for Beginners)
DAVID MACKAY
No ratings yet
Chapter 7 - Semi-Supervised Learning
No ratings yet
Chapter 7 - Semi-Supervised Learning
9 pages
Divide Mix
No ratings yet
Divide Mix
14 pages
Label Propagation For Deep Semi-Supervised Learning
No ratings yet
Label Propagation For Deep Semi-Supervised Learning
10 pages
Advances in AI: Module-1
No ratings yet
Advances in AI: Module-1
23 pages
Few-Shot Machine Learning: Doing More with Less Data
From Everand
Few-Shot Machine Learning: Doing More with Less Data
Robert Johnson
No ratings yet
Semi-: Supervised Learning
No ratings yet
Semi-: Supervised Learning
40 pages
Semi-Supervised Learning With Ladder Network
No ratings yet
Semi-Supervised Learning With Ladder Network
19 pages
Unit 4 Part 1
No ratings yet
Unit 4 Part 1
47 pages
Pseudo Label Final
No ratings yet
Pseudo Label Final
7 pages
Semi-Supervised Learning A Brief Review
No ratings yet
Semi-Supervised Learning A Brief Review
6 pages
Chapter 05 - 1732187374
No ratings yet
Chapter 05 - 1732187374
15 pages
DUnit I
No ratings yet
DUnit I
25 pages
Deeplerning Ensmble Metyhode
No ratings yet
Deeplerning Ensmble Metyhode
20 pages
Data - and AI-driven Methods in Engineering
No ratings yet
Data - and AI-driven Methods in Engineering
40 pages
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
Self-Taught Learning: Transfer Learning From Unlabeled Data
No ratings yet
Self-Taught Learning: Transfer Learning From Unlabeled Data
8 pages
Pareto-Based Multiobjective Machine Learning: An Overview and Case Studies
No ratings yet
Pareto-Based Multiobjective Machine Learning: An Overview and Case Studies
19 pages
Machine Learning Unit3
No ratings yet
Machine Learning Unit3
26 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Pseudo Label Final
No ratings yet
Pseudo Label Final
6 pages
Semi-Supervised Learning in ML
No ratings yet
Semi-Supervised Learning in ML
7 pages
第八章
No ratings yet
第八章
28 pages
Types of Learning
No ratings yet
Types of Learning
19 pages
Semppl: Predicting Pseudo - Labels For Better Contrastive Representations
No ratings yet
Semppl: Predicting Pseudo - Labels For Better Contrastive Representations
25 pages
Cooperative Learning For Multi-View Analysis
No ratings yet
Cooperative Learning For Multi-View Analysis
32 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
24 pages
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
Algorithms 13 00140
No ratings yet
Algorithms 13 00140
4 pages
What Is Semi-Supervised Learning
No ratings yet
What Is Semi-Supervised Learning
5 pages
Article AReviewOfVariousSemi Supervise
No ratings yet
Article AReviewOfVariousSemi Supervise
16 pages
Module 1
No ratings yet
Module 1
47 pages
AlexSynasc2021 v2
No ratings yet
AlexSynasc2021 v2
7 pages
TH 01
No ratings yet
TH 01
63 pages
Braintree
No ratings yet
Braintree
1 page
Meetup
No ratings yet
Meetup
1 page
Moffitt Et Al 2021 Hunting Conspiracy Theories During The Covid 19 Pandemic
No ratings yet
Moffitt Et Al 2021 Hunting Conspiracy Theories During The Covid 19 Pandemic
17 pages
Akuma Paperon Hate Speech
No ratings yet
Akuma Paperon Hate Speech
8 pages
Review - Student Experience and New Media To Leverage An Infocommunicational Case Study Model
No ratings yet
Review - Student Experience and New Media To Leverage An Infocommunicational Case Study Model
5 pages
EVALITA Guidelines
No ratings yet
EVALITA Guidelines
3 pages
Information: COVID-19 Public Sentiment Insights and Machine Learning For Tweets Classification
No ratings yet
Information: COVID-19 Public Sentiment Insights and Machine Learning For Tweets Classification
22 pages
Ariston mtp1912f 81396110201 tb28656 Tech.m
No ratings yet
Ariston mtp1912f 81396110201 tb28656 Tech.m
8 pages
Math-18 Course-Syllabus 2024
No ratings yet
Math-18 Course-Syllabus 2024
18 pages
Management Expert Professor Henry Mintzberg Has Argued That A Manager
100% (1)
Management Expert Professor Henry Mintzberg Has Argued That A Manager
2 pages
The Effect of Entrepreneurial Mindset, Digital Training and Supervision On The Competitiveness of Small and Medium Enterprises (SME) For Women
No ratings yet
The Effect of Entrepreneurial Mindset, Digital Training and Supervision On The Competitiveness of Small and Medium Enterprises (SME) For Women
12 pages
UNIT 6 - Review
No ratings yet
UNIT 6 - Review
3 pages
Name: Noor Syamim Bin Noor Yawaris Muhammad Nazrin Bin Che Ya Class: Mathematics 2
No ratings yet
Name: Noor Syamim Bin Noor Yawaris Muhammad Nazrin Bin Che Ya Class: Mathematics 2
10 pages
Lesson 1 Technical Writing
No ratings yet
Lesson 1 Technical Writing
4 pages
Lesson Plan 2 - Grade 7
No ratings yet
Lesson Plan 2 - Grade 7
4 pages
Career Guidance, Participation of Students and Its Implication For Kano, Nigeria
No ratings yet
Career Guidance, Participation of Students and Its Implication For Kano, Nigeria
6 pages
Q2 Module 4
No ratings yet
Q2 Module 4
61 pages
E-Udaya: One Year Live Online Classroom Program
No ratings yet
E-Udaya: One Year Live Online Classroom Program
3 pages
Document
No ratings yet
Document
4 pages
One Model Paper in Exam
No ratings yet
One Model Paper in Exam
25 pages
Powers of The Mind (Perdev 11)
No ratings yet
Powers of The Mind (Perdev 11)
20 pages
14 Things GREAT Salespeople Do, That Average Salespeople Only Think About
No ratings yet
14 Things GREAT Salespeople Do, That Average Salespeople Only Think About
19 pages
Science Technology and Nation Building
No ratings yet
Science Technology and Nation Building
71 pages
Nguyễn Linh Xuân Nghi- Final portfolio- testing and assessment
No ratings yet
Nguyễn Linh Xuân Nghi- Final portfolio- testing and assessment
64 pages
Rogers Resume
No ratings yet
Rogers Resume
2 pages
Advantages and Disadvantages: ENG - B2.0704S
No ratings yet
Advantages and Disadvantages: ENG - B2.0704S
27 pages
Chen 2010
No ratings yet
Chen 2010
25 pages
Behavior Modification
100% (1)
Behavior Modification
23 pages
Data Analysis Rubric
No ratings yet
Data Analysis Rubric
2 pages
Industrial Engineering and Management by Ravi V PDF
No ratings yet
Industrial Engineering and Management by Ravi V PDF
2 pages
Learning Episode 5: Establishing Online Class Routines and Procedures
No ratings yet
Learning Episode 5: Establishing Online Class Routines and Procedures
5 pages
THE FIELD OF ENGINEERING MANAGEMENT - Chapter 1
No ratings yet
THE FIELD OF ENGINEERING MANAGEMENT - Chapter 1
23 pages
Midterm Reviewer Fme 01
No ratings yet
Midterm Reviewer Fme 01
5 pages
The Modern Hotelier Fin
No ratings yet
The Modern Hotelier Fin
11 pages
Department of Education: Gov. Rafael L. Lazatin Integrated School 1 Quarter (Pe - 9) Mapeh - 9
No ratings yet
Department of Education: Gov. Rafael L. Lazatin Integrated School 1 Quarter (Pe - 9) Mapeh - 9
3 pages
Activity Sheet #6 DE BELEN, DE GUZMAN, GARCIA, MORENO FILIPINO MAJORS
No ratings yet
Activity Sheet #6 DE BELEN, DE GUZMAN, GARCIA, MORENO FILIPINO MAJORS
4 pages

Democratic Co Learning

Uploaded by

Democratic Co Learning

Uploaded by

Democratic Co-Learning

Table 1. A framework for classifying semi-supervised algorithms.

 / data proposed for adding to /

   such that  

4. Democratic Priority Sampling

data # exs from labeled for avg. #

Table 2. A summary of amount of data labeled by democratic co-learning.

Algorithm cancer adult 3-of-9 xd6

Table 3. Our non-active learning results.

You might also like

/ data proposed for adding to /

such that