0% found this document useful (0 votes)

17 views23 pages

Advances in AI: Module-1

The document discusses various machine learning techniques, including Co-Training, Multi-Task Learning (MTL), and Coupled Semi-Supervised Learning (CSSL), highlighting their methodologies, applications, advantages, and challenges. Co-Training utilizes multiple views of data for semi-supervised learning, while MTL focuses on training models on related tasks simultaneously to improve performance. CSSL combines supervised and unsupervised learning to enhance model accuracy using both labeled and unlabeled data, addressing the complexities and challenges associated with each approach.

Uploaded by

gitelov533

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views23 pages

Advances in AI: Module-1

Uploaded by

gitelov533

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

Advances in AI

Module-1
Co-Training

Co-Training is a semi-supervised learning technique in machine

learning, where two or more models (or learners) are trained
simultaneously using different views (or subsets) of the data. The key
idea behind co-training is that each model is trained on a different
subset of features, and they help each other improve by providing
additional labeled data.
Co-Training

• Multiple Views: Co-training works under the assumption that the data can be
represented by multiple "views," where each view provides sufficient information
to make predictions. For example, in a text classification task, one view might be
based on the words in the text, and another view might be based on the
metadata (e.g., author, publication date).

• Semi-Supervised Learning: Co-training is especially useful in scenarios where

labeled data is scarce, but there is an abundance of unlabeled data. The models
start with a small amount of labeled data and gradually expand their labeled
dataset by labeling the unlabeled data they feel most confident about.
Co-Training

• Iterative Process: The models are trained iteratively. Initially, they are trained on
a small labeled dataset. Then, each model makes predictions on the unlabeled
data. The most confident predictions are added to the labeled dataset, and the
models are retrained on this expanded dataset. This process is repeated until the
models converge.

• Assumptions: Co-training assumes that the views are conditionally independent

given the class label, meaning that if you know the class label, knowing one view
doesn’t give you additional information about the other view. This assumption
allows the models to correct each other’s mistakes and improve overall accuracy.
Multi-Task Learning

Multi-task learning (MTL) is an approach in artificial intelligence and

machine learning where a model is trained on multiple tasks
simultaneously. Instead of focusing on a single objective, the model
learns to perform several related tasks, sharing knowledge across them.
This shared knowledge often leads to better performance, especially in
tasks with limited data, as the model can generalize better by
leveraging information from related tasks.
Multi-Task Learning
• In MTL, tasks share a common representation (e.g., shared layers in a neural
network). This allows the model to learn features that are useful across multiple
tasks, leading to better generalization.

• The effectiveness of MTL depends on the relatedness of the tasks. If the tasks are
too dissimilar, sharing representations might not be beneficial and could even
degrade performance.

• MTL can act as a form of regularization. By requiring the model to perform well
on multiple tasks, it discourages overfitting to a single task and promotes learning
more general features.
Multi-Task Learning
Applications

MTL is widely used in areas like natural language processing (NLP),

where tasks like sentiment analysis, translation, and named entity
recognition can benefit from shared learning. It's also applied in
computer vision, where tasks like object detection, segmentation, and
classification are often learned together.
Multi-Task Learning
Advantages of Multi-Task Learning

• Improved Performance: By learning related tasks together, MTL can

lead to better overall performance, especially on smaller datasets.
• Data Efficiency: MTL allows the model to leverage data from multiple
tasks, which is especially useful when data for individual tasks is
scarce.
• Generalization: The shared learning encourages the model to find
features that generalize well across tasks, reducing overfitting.
Multi-Task Learning
Challenges in Multi-Task Learning

Task Imbalance: If one task is much harder or has more data than
others, it can dominate the learning process, leading to suboptimal
performance on the other tasks.
Negative Transfer: If the tasks are not sufficiently related, the model
might struggle, as knowledge from one task could negatively impact the
performance on another.
Coupled Semi-Supervised
Learning (CSSL)
Coupled Semi-Supervised Learning (CSSL) is an approach in machine
learning that combines elements of both supervised and unsupervised
learning to leverage the strengths of both paradigms. The core idea is
to use a small amount of labeled data in conjunction with a large
amount of unlabeled data to improve learning performance.
Coupled Semi-Supervised
Learning (CSSL)
• In traditional supervised learning, a model is trained on labeled data, where each
input comes with an associated label. In contrast, semi-supervised learning uses
both labeled and unlabeled data. The main goal is to improve model accuracy by
exploiting the structure in the unlabeled data to assist the learning process. This
is particularly useful when labeling data is expensive or time-consuming.

• In the context of CSSL, "coupling" refers to the interaction or integration between

multiple learners or models, each learning from different views or aspects of the
data. These coupled learners share information during training, which allows
them to enhance each other’s learning process.
Coupled Semi-Supervised
Learning (CSSL)
Applications

• Image and Video Analysis: CSSL can be used in image classification, where
different views might be different augmentations of the same image or features
extracted using different convolutional layers.

• Natural Language Processing (NLP): In tasks like sentiment analysis or machine

translation, different views could be based on different linguistic features or
models.

• Bioinformatics: CSSL is used in genomic data analysis, where one model might
focus on sequence data and another on structural data.
Coupled Semi-Supervised
Learning (CSSL)
Advantages

• Efficiency: It reduces the need for large amounts of labeled data, which can
be expensive and time-consuming to obtain.

• Improved Accuracy: By leveraging unlabeled data, CSSL can achieve better

generalization and accuracy compared to purely supervised approaches.
Coupled Semi-Supervised
Learning (CSSL)
Challenges

• Complexity: Designing effective coupling strategies and ensuring the

models complement each other rather than reinforcing incorrect
assumptions can be challenging.

• Sensitivity to Noisy Labels: If the pseudo-labeling process introduces

errors, it can propagate through the learning process, potentially harming
performance.
Macro reading vs Micro reading
Macro Reading
Macro reading involves understanding the overall meaning or theme of a larger text or document. It
focuses on grasping the big picture, such as the main topics, key ideas, or general sentiment.
Use Cases:
• Document Summarization: AI condenses a lengthy document into a brief summary, capturing the
essential points.
• Topic Modeling: Identifying and categorizing the major themes or topics within a collection of
texts.
• Sentiment Analysis: Determining the general sentiment or emotion conveyed in a large piece of
text (e.g., a full article, book, or series of tweets).
• Content Categorization: Classifying large amounts of content into predefined categories (e.g.,
news articles into topics like politics, sports, or entertainment).
Macro reading vs Micro reading
Techniques

• Latent Dirichlet Allocation (LDA): For topic modeling.

• Recurrent Neural Networks (RNNs) or Transformers: For understanding
sequences in text over longer spans.
• TextRank: For summarization tasks.
• BERT or GPT models: For capturing context across paragraphs or entire
documents.
Macro reading vs Micro reading
Micro Reading
Micro reading focuses on understanding specific details, individual sentences, or even words within
a text. It emphasizes the precise interpretation of small text segments.

Use Cases:
• Named Entity Recognition (NER): Identifying and classifying proper nouns (e.g., names of people,
places, organizations) within a text.
• Part-of-Speech Tagging: Determining the grammatical categories of each word (e.g., noun, verb,
adjective).
• Dependency Parsing: Analyzing the grammatical structure of a sentence to understand how
words relate to each other.
• Question Answering: Finding precise answers to questions within a specific passage of text.
Macro reading vs Micro reading
Techniques

• Word2Vec or GloVe: For word embeddings to understand word meanings in

context.
• CRFs (Conditional Random Fields): For structured prediction tasks like NER.
• BERT: For understanding the nuanced meaning of words or sentences within their
context.
• Seq2Seq Models: For translating or generating text based on specific inputs.
Open IE
Open Information Extraction (Open IE) in AI refers to the process of
automatically extracting structured information (such as relationships,
entities, and events) from unstructured text without relying on a
predefined schema or ontology. Unlike traditional Information
Extraction (IE) systems, which require a predefined set of relations to
look for, Open IE systems can extract a wide range of relationships from
text, making them more flexible and scalable.
Open IE
• Unsupervised Learning: Open IE systems often operate in an unsupervised or
semi-supervised manner. They do not need extensive labeled training data to
learn specific relations but instead rely on linguistic patterns and heuristics to
identify and extract potential relations.
• Relation Extraction: The primary goal of Open IE is to extract relationships
between entities in a sentence.
• Textual Triples: Open IE systems typically represent the extracted information as
triples (subject, relation, object), which can then be used for further processing,
such as building knowledge graphs or querying.
• Scalability: Open IE is designed to handle large-scale text corpora, making it
suitable for applications like web-scale information extraction, where the number
of potential relations is vast and constantly growing.
Open IE
Applications of Open IE
• Knowledge Graph Construction: Open IE is used to populate knowledge graphs with
entities and relationships extracted from large text corpora, enabling more effective
search and discovery of information.
• Question Answering Systems: Open IE can be used to extract relevant facts from text to
answer user queries, especially in open-domain question answering where the range of
possible answers is broad.
• Summarization: Extracted triples can be used to generate concise summaries of large
documents or articles by highlighting the key entities and their relationships.
• Content Analysis: Open IE helps in analyzing and understanding large volumes of text
data, such as social media content, news articles, or scientific papers, by extracting and
organizing relevant information.
Open IE
Challenges in Open IE

• Ambiguity and Polysemy: Words and phrases can have multiple meanings
depending on the context, making it challenging for Open IE systems to
accurately extract the correct relations.
• Complex Sentences: Sentences with complex structures, such as nested clauses
or multiple entities and relations, can be difficult for Open IE systems to parse
correctly.
• Lack of Schema: While the lack of a predefined schema provides flexibility, it can
also lead to inconsistencies in the extracted information, as the same relation
might be represented in multiple ways.
QUESTIONS
• Name some popular Open IE.
• What challenges arise when applying coupled semi-supervised
learning to domains with highly imbalanced datasets, and how can
these challenges be mitigated to ensure robust model training?
• In what scenarios is coupled semi-supervised learning most effective,
and how does it compare to other learning paradigms like co-training
or multi-task learning in terms of data efficiency and generalization?

BV350 Workshop Manual PDF
No ratings yet
BV350 Workshop Manual PDF
346 pages
Transfer Learning - Qiang Yang
No ratings yet
Transfer Learning - Qiang Yang
393 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Polycopié
No ratings yet
Polycopié
3 pages
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Unit - V
No ratings yet
Unit - V
44 pages
Module 1
No ratings yet
Module 1
47 pages
Aiml Mca
100% (1)
Aiml Mca
38 pages
Self-Supervised Learning: Teaching AI with Unlabeled Data
From Everand
Self-Supervised Learning: Teaching AI with Unlabeled Data
Robert Johnson
No ratings yet
Multi Task Learning (MTL)
No ratings yet
Multi Task Learning (MTL)
15 pages
Assignment Ict Ai Machine Learning (1) - 084742
No ratings yet
Assignment Ict Ai Machine Learning (1) - 084742
7 pages
Few-Shot Machine Learning: Doing More with Less Data
From Everand
Few-Shot Machine Learning: Doing More with Less Data
Robert Johnson
No ratings yet
Workshop Master Revealed
From Everand
Workshop Master Revealed
Anil Soni
No ratings yet
Unit 1
No ratings yet
Unit 1
22 pages
2024 MTH058 Lecture04 AILearningParadigms
No ratings yet
2024 MTH058 Lecture04 AILearningParadigms
85 pages
A Survey On Multi-Task Learning: Yu Zhang and Qiang Yang
No ratings yet
A Survey On Multi-Task Learning: Yu Zhang and Qiang Yang
20 pages
Chapter 05 - 1732187374
No ratings yet
Chapter 05 - 1732187374
15 pages
2024 - Multi-Task Learning in Natural Language Processing - An Overview - Chen Et Al - ACM Computing Surveys
No ratings yet
2024 - Multi-Task Learning in Natural Language Processing - An Overview - Chen Et Al - ACM Computing Surveys
31 pages
Machine Learning (AI)
No ratings yet
Machine Learning (AI)
19 pages
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Machine Learning - Lec1
No ratings yet
Machine Learning - Lec1
56 pages
Introduction To Multitasking Notes Unit-5
No ratings yet
Introduction To Multitasking Notes Unit-5
23 pages
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
From Everand
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
Timothy King
No ratings yet
AI Machine Learning
No ratings yet
AI Machine Learning
3 pages
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
Machine Learning Unit-1
No ratings yet
Machine Learning Unit-1
22 pages
Constrained Conditional Model: Fundamentals and Applications
From Everand
Constrained Conditional Model: Fundamentals and Applications
Fouad Sabry
No ratings yet
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
15 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
60 pages
Design Patterns Made Easy: A Practical Guide with Examples
From Everand
Design Patterns Made Easy: A Practical Guide with Examples
William E. Clark
No ratings yet
AIML Overview
No ratings yet
AIML Overview
7 pages
AI UNIT - 4 Notes
No ratings yet
AI UNIT - 4 Notes
9 pages
4 CS826 - Meta Learning
No ratings yet
4 CS826 - Meta Learning
40 pages
Survey of Multitask Learning
No ratings yet
Survey of Multitask Learning
20 pages
IT Report PDF
No ratings yet
IT Report PDF
24 pages
Unit Iii
No ratings yet
Unit Iii
26 pages
ML Module 1
No ratings yet
ML Module 1
79 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Module 1-Basics of ML
No ratings yet
Module 1-Basics of ML
142 pages
ML Lab
No ratings yet
ML Lab
75 pages
AI Assignment 2
No ratings yet
AI Assignment 2
5 pages
Types of Learning
No ratings yet
Types of Learning
19 pages
Fundamentals of AI Hide01.Ir
No ratings yet
Fundamentals of AI Hide01.Ir
114 pages
ML Unit 1
No ratings yet
ML Unit 1
20 pages
Unit 1
No ratings yet
Unit 1
19 pages
11 Deep Transfer Learning and Multi Task Learning
No ratings yet
11 Deep Transfer Learning and Multi Task Learning
24 pages
NOTES: Fundamentales of Machine Learning: Vocabulary
No ratings yet
NOTES: Fundamentales of Machine Learning: Vocabulary
4 pages
Machine Learning Fundamentals: Concepts, Models, and Applications
From Everand
Machine Learning Fundamentals: Concepts, Models, and Applications
Amar Sahay
No ratings yet
Unit I Machine Learning
No ratings yet
Unit I Machine Learning
22 pages
ML Unit1.2
No ratings yet
ML Unit1.2
24 pages
Meta-Learning For Few-Shot Natural Language Processing - A Survey
No ratings yet
Meta-Learning For Few-Shot Natural Language Processing - A Survey
7 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
Unit 4 AI LASK
No ratings yet
Unit 4 AI LASK
7 pages
AI
No ratings yet
AI
52 pages
ML Reference-Material-I
No ratings yet
ML Reference-Material-I
41 pages
SEMINAR
No ratings yet
SEMINAR
24 pages
MAC 681c884450fb6
No ratings yet
MAC 681c884450fb6
6 pages
Fundamentals of AI
No ratings yet
Fundamentals of AI
114 pages
AML Unit-3 Material
No ratings yet
AML Unit-3 Material
26 pages
Title: Understanding Semi-Supervised Learning
No ratings yet
Title: Understanding Semi-Supervised Learning
8 pages
Advances in AI: Module-1
No ratings yet
Advances in AI: Module-1
18 pages
Biometric Recognition Based On Fingerprint (2017)
No ratings yet
Biometric Recognition Based On Fingerprint (2017)
6 pages
Volova 2018
No ratings yet
Volova 2018
12 pages
Yang 2017
No ratings yet
Yang 2017
6 pages
Monitoring Mammalian Mitochondrial Translation With Mitoriboseq
No ratings yet
Monitoring Mammalian Mitochondrial Translation With Mitoriboseq
24 pages
Pediatric Ependymoma: An Overview of A Complex Disease: Review Article
No ratings yet
Pediatric Ependymoma: An Overview of A Complex Disease: Review Article
13 pages
Glycol Dehydrator Design Manual
No ratings yet
Glycol Dehydrator Design Manual
36 pages
Damage Stability-3
No ratings yet
Damage Stability-3
1 page
Restaurant
No ratings yet
Restaurant
24 pages
Elder Disk
No ratings yet
Elder Disk
39 pages
JST PH Connectors - Datasheet
No ratings yet
JST PH Connectors - Datasheet
2 pages
Electro Chemistry (MS)
No ratings yet
Electro Chemistry (MS)
208 pages
3-Bearing Pressure and Bearing Capacity
100% (1)
3-Bearing Pressure and Bearing Capacity
62 pages
Fast Gradient Attack On Network Embedding
No ratings yet
Fast Gradient Attack On Network Embedding
13 pages
C11.4.QA1.Chemical Bonding.R
No ratings yet
C11.4.QA1.Chemical Bonding.R
9 pages
Cbjeit 24 P
No ratings yet
Cbjeit 24 P
30 pages
Fluid Power - 2
No ratings yet
Fluid Power - 2
11 pages
Intro To Stat (STAT 111) by Ewens
No ratings yet
Intro To Stat (STAT 111) by Ewens
113 pages
2024 12 17 628864v1 Full
No ratings yet
2024 12 17 628864v1 Full
25 pages
Heat 1
No ratings yet
Heat 1
28 pages
Edit - The Complete Guide To MACD Indicator
No ratings yet
Edit - The Complete Guide To MACD Indicator
18 pages
Solvent Deasphalting PPT Final - 1
100% (5)
Solvent Deasphalting PPT Final - 1
30 pages
Waxes
No ratings yet
Waxes
5 pages
Coficients of Positive Moment For Continous Beam
No ratings yet
Coficients of Positive Moment For Continous Beam
44 pages
Oops Assesment Sheet-1 PDF
No ratings yet
Oops Assesment Sheet-1 PDF
11 pages
Solid State Zelio Relay
No ratings yet
Solid State Zelio Relay
76 pages
Junior French Course PDF
No ratings yet
Junior French Course PDF
232 pages
Maximum Mark: 30: Cambridge International Advanced Subsidiary and Advanced Level
No ratings yet
Maximum Mark: 30: Cambridge International Advanced Subsidiary and Advanced Level
4 pages
Vlookuppractice
No ratings yet
Vlookuppractice
16 pages
Knowledge Based System PDF
No ratings yet
Knowledge Based System PDF
14 pages
Analytical Scalable PDF
No ratings yet
Analytical Scalable PDF
9 pages
Pse 2800 A1
No ratings yet
Pse 2800 A1
32 pages
CAT Arithmetic
No ratings yet
CAT Arithmetic
11 pages
Microwave Solid Antennas: Introduction and Antenna Descriptions
No ratings yet
Microwave Solid Antennas: Introduction and Antenna Descriptions
56 pages

Advances in AI: Module-1

Uploaded by

Advances in AI: Module-1

Uploaded by

Advances in AI

Co-Training is a semi-supervised learning technique in machine

• Semi-Supervised Learning: Co-training is especially useful in scenarios where

• Assumptions: Co-training assumes that the views are conditionally independent

Multi-task learning (MTL) is an approach in artificial intelligence and

MTL is widely used in areas like natural language processing (NLP),

• Improved Performance: By learning related tasks together, MTL can

• In the context of CSSL, "coupling" refers to the interaction or integration between

• Natural Language Processing (NLP): In tasks like sentiment analysis or machine

• Improved Accuracy: By leveraging unlabeled data, CSSL can achieve better

• Complexity: Designing effective coupling strategies and ensuring the

• Sensitivity to Noisy Labels: If the pseudo-labeling process introduces

• Latent Dirichlet Allocation (LDA): For topic modeling.

• Word2Vec or GloVe: For word embeddings to understand word meanings in

You might also like