0% found this document useful (0 votes)

32 views112 pages

CS215 LectureSlidesSet2 IntroductionToMachineLearning AI

This document provides an introduction and overview of machine learning and social issues related to its use. It outlines the instructor of the course, Jacob Levman, and their credentials. It then gives a brief overview of machine learning methods, including supervised learning techniques like artificial neural networks and K-nearest neighbors, as well as unsupervised learning, data visualization, and validation. Applications and implications of machine learning are also discussed at a high level.

Uploaded by

bojimir730

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views112 pages

CS215 LectureSlidesSet2 IntroductionToMachineLearning AI

Uploaded by

bojimir730

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 112

Introduction to Machine

Learning
Social Issues in the
Information Age
Jacob Levman, PhD
Associate Professor
Department of Computer Science
St. Francis Xavier University

Winter, 2024
Instructor
• Dr. Jacob Levman
Associate Professor, Department of Computer Science, St. Francis Xavier University

Visiting Faculty, Massachusetts General Hospital, Harvard Medical School

Research Affiliate, Nova Scotia Health Authority

• Office: Physical Sciences Building 1020

• 902-867-2221
• [email protected]
• Term: Fall 2023
• Course resources: moodle (to be set up)
• Office Hours:
• Tuesdays 1:30 pm to 2:20 pm
• Wednesdays 2:30 pm to 3:20 pm, and 3:30 pm to 4:20 pm
• Fridays 2:30 to 3:20 pm
Machine Learning
Why Machine Learning?
Why Now?
Machine Learning Methods
Overview
• Supervised learning
• Artificial neural networks
• K Nearest Neighbour
• Etc.
• Unsupervised learning
• K-means clustering
• Hierarchical clustering
• Etc.
• Data Visualization
• An adjunct to statistical validation
• Validation
• Evaluation criteria (Overall Accuracy, ROC analyses, sensitivity, specificity, PPV,
NPV)
• Independent datasets
• Within dataset validation
Types of Learning: Supervised
• Most common class of machine learning
• Also called Classification (assigning samples to
defined classes)
Examples of
Interest
Result:
Examples Sample is of
not of Machine Learning
Interest or
Interest NOT

New Sample
Applications
Learning

• But are machine learning techniques necessarily

adaptive?
• No! Not necessarily!
• Some techniques (Unsupervised learning esp.) don’t improve
as they operate at all. but we still call the machine learning
Basic Learning Paradigm

Examples of
Interest
Result:
Examples
Sample is of
not of Machine Learning
Interest or
Interest
NOT

New Sample
Feedback critical for future
tech

Examples of
Interest
Result:
Examples
Sample is of
not of Machine Learning
Interest or
Interest
NOT

New Sample
MACHINE LEARNING BEHAVIOUR CHANGES WITH TRAINING DATA!
NEEDS RE-VALIDATION
Except on USS Voyager?

Examples of
Interest
Result:
Examples
Sample is of
not of Machine Learning
Interest or
Interest
NOT

New Sample
HOW TO HANDLE THIS PROBLEM?
IN CURRENT PRACTICE WE REMOVE THE ‘LIVE’ FEEDBACK
VALIDATION OCCURS ONCE
AI behaviour doesn’t change while in operation
Updates require extensive re-validation, new version release

Examples of
Interest
Result:
Examples
Sample is of
not of Machine Learning
Interest or
Interest
NOT

New Sample
Implications of removing live
feedback

• Currently the only acceptable option for

medical diagnostics
• Without live feedback we lower the risk of
AI conquering us all!
• Limits on tech can produce suboptimal
performance
• Improving performance requires extensive
re-validation (time consuming, costly, ….)
• Machines that retrain ‘on-the-fly’ are
inherently dangerous, difficult to ensure
safety
Discussion Break

• AI Risks to our collective safety and security

Machine Learning’s Future

• How long until we trust a holographic doctor?

• A very long time!
Add a Slide Title - 2
Adaptive Learning – example
without ethical limitations:
Add a Slide Title - 2
• A major challenge of supervised learning classifiers that are LINEAR!
• Advanced techniques allow nonlinear solutions as well
• Future subjects
In practice things get complicated
quickly
• Not just 2 input feature measurements
• Maybe input is an image 300x600 or larger
• Maybe input is a time series of variable length data
(written words, audio clips, stock price history)
• These can be smooshed or clamped/trimmed to fit in a
spreadsheet (as in for your project), but modern
methods for these challenging data types retain aspects
of the input configuration in the learning machine
• How challenging do things get? Image analysis is a
great example…….
Supervised Learning Example
Supervised Learning Example
Supervised Learning Example
Supervised Learning Example
Supervised Learning Example
Supervised Learning Example
Supervised Learning Example
Supervised Learning Example
Supervised Learning Example
Supervised Learning Example
Types of Learning: Supervised
• Most common class of machine learning
• Also called Classification (assigning samples to
defined classes)
Examples of
Interest

Examples Result: Sample

not of Machine Learning is of Interest or
Interest NOT

New Sample
Data-driven approach
• Collect a dataset with example measurements (could be an
image for example) and labels
• Use machine learning to train a classifier
• Evaluate classifier on withheld set of test images
• Simple example of what API code will look like:
We package all that up with fair statistical
comparisons, feature selection, comparison
between many prominent ML technologies
and have a spreadsheet as the input to the
program! (no programming required)

• So how do you evaluate how well the machine worked??

Background – Validation

• TP: True Positives https://www.medcalc.org/manual/roc-curves.php

• Samples of interest correctly labelled by ML

• TN: True Negatives
• Samples not of interest correctly labelled by ML
• FP: False Positives
• Samples of not of interest incorrectly labelled by ML
• FN: False Negatives
• Samples of interest incorrectly labelled by ML
Background – Validation
• Sensitivity (also called recall): https://www.medcalc.org/manual/roc-curves.php

• The proportion of samples/patients of interest correctly classified

• TP / (TP + FN)
• Specificity:
• The proportion of samples/patients not of interest correctly classified
• TN / (TN + FP)
• Positive Predictive Value (PPV also called precision):
• The proportion of samples/patients predicted to be of interest that actually are of interest
• TP / (TP + FP)
• Negative Predictive Value (NPV):
• The proportion of samples/patients predicted to be not of interest that are actually not of interest
• TN / (TN + FN)
• Overall Accuracy (OA):
• The proportion of samples/patients correctly classified
• (TP + TN) / (TP+ TN + FP + FN)
• Test Error:
• Defined in various ways, generally simple – summarizing deviation from ground truth: Proportion of Errors
relative to all cases.
• (FP+FN)/(TP+TN+FP+FN)
• Sum of Squares Error:
• Forces positive values for all differences
Background – Validation
• Receiver Operating Characteristic Curve
Analysis
• Vary threshold/criterion

https://www.medcalc.org/manual/roc-curves.php

› Area under the ROC curve: AUC, a robust metric for separation between
two groups, assessing Dx potential w/o knowing operating point in
advance

› We have some basics about how to evaluate ML models, so let’s jump in

and learn our first basic technique……
K Nearest Neighbour

KNN is a simple algorithm that stores

all available cases and classifies
new cases based on a similarity
measure
K-NN
$250,000

$200,000

$150,000
Loan$ Non-Default
$100,000 Default

$50,000

$0
15 20 25 30 35 40 45 50 55 60 65
Age
K-NN $250,000

$200,000

$150,000
Non-Default
Loan$ Default
$100,000

$50,000

$0
15 20 25 30 35 40 45 50 55 60 65
Age

• All distance measurements are sorted from smallest to

largest
• Analyze the first (smallest) K (user defined parameter)
distance measurements
• Voting system, who wins?
K Nearest Neighbour
K Nearest Neighbour
K Nearest Neighbour – How to
choose K?
K Nearest Neighbour

• Strengths
• Simple and intuitive
• Effective (in a basic way!)
• Flexible decision boundary
• Weaknesses
• Easily misled by noise
• Easily misled by irrelevant features
• Must choose a distance function (Euclidean is often too simplistic)
• Vulnerable to high dimensionality problems
• Computation costs can be high
• Many irrelevant distances to distant training samples are computed
though unused
• How to handle unbalanced distributions (more of one group than the
other)?

› So we understand the KNN basics, how to practically implement this in

python?
K Nearest Neighbour – Case
Study

• 10 measurements per sample

• Thus 10 dimensions!
• Measurements from histopathology
• Ie. Analyzing cells under a microscope
K Nearest Neighbour – Case
Study
• Results:

› Older results (2000)

› Many more modern techniques available
now
› Likely to be outperformed in a more recent
repeat analysis
K Nearest Neighbour – Case
Study

• Results:

Machine learning: ECML-98, 1998 - Springer

› In 1998, SVM was new
– Will cover in detail later

› K-NN performed best of ‘traditional’

techniques
K Nearest Neighbour – Another
Case Study
My background

• Computer Engineering
• Electrical and Computer Engineering
• Medical Biophysics (Physics Stream)
• Imaging Research Postdoc
• Biomedical Engineering Postdoc
• Neuroscience Postdoc
Bringing Together Disparate Fields
Research Outline
• Physics Research
• Computational Neuroscience Research
• Machine Learning Research
• Neuroscience Research

Image from Forbes.com Image from Scientific

American
Research fMRI

Active and Passive fMRI for Presurgical Mapping of Motor and Language Cortex
By Bradley Goodyear, Einat Liebenthal and Victoria Mosher
DOI: 10.5772/58269
Research - fMRI
Research – Diffusion MRI
• Diffusion MRI based on Doppler effect
• Inherently less signal when based on phase shift
• Diffusion measurements acquired in many directions
• Reliability can be a challenge
• Particularly for making pretty tractograms

Image from Boston Children’s Hospital,

Harvard Medical School
Image from John Radcliffe Hospital,
University of Oxford
Scalar Diffusion MRI

I Fragata, et al., Early Prediction of Delayed Ischemia …, Stroke 2017, 48(8): 2091-2097.
Research - Diffusion MRI
Computational Neuroscience
Research
Machine Learning Applications:
PICUs
• Predict cardiac arrest before it happens
• Predict renal failure before it happens
• Predict any actionable circumstances in the clinic
Machine Learning Research
Modelling the Radiologist
• Reliably predict when machines can equal or outperform the radiologist
• Report triage statistics, like statistical percentiles governing where a given sample/patient falls
relative to training (i.e. machine can report, 100% of patients this extreme have autism, 94% of
patients presenting like this turn out to have multiple sclerosis, 100% of all patients with this kind of
an image profile are healthy or would be called normal by a radiologist)
• Eventually machines will be handling more and more of their workload
Machine Learning Research
Video Processing
Machine Learning Research
COVID-19 Detection from Lung CT
Machine Learning Research
Brain MRI

• Image from: https://www.google.com/imgres?imgurl

=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fconsultqd.clevelandclinic.org%2Fwp-content%2Fuploads%2Fsites%2F2%2F201
8%2F01%2F18-NEU-508-Nakamura-MRI-650x450.jpg&imgrefurl=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fconsultqd.clev
elandclinic.org%2Fmaking-the-most-of-brain-mri-machine-learning-integrated-with-image-post-p
rocessing%2F&tbnid=q476ZCnqd9AO0M&vet=12ahUKEwjojsSNhenrAhVol-AKHeudAeMQMygAeg
UIARCiAQ..i&docid=7lV7ygzEb3z3JM&w=650&h=450&q=brain%20MRI%20machine%20learning&
ved=2ahUKEwjojsSNhenrAhVol-AKHeudAeMQMygAegUIARCiAQ
General Purpose Machine Learning
• Test bias setting alters the curvature of the decision function to match local data
distribution

Levman et al., Journal of Digital Imaging, 27:145-151, 2014

General Purpose Machine Learning
Tissue outcome prediction – is tissue at risk of death?
• Combining physiological measurements

Levman et al., ISMRM 2014, Milan, selected for oral presentation

Ensembles of Learners

• Image from: https

://www.google.com/search?tbm=isch&sxsrf=ALeKk01d36aaxLAv5NJdttO7MzGGT3nu-w%3A1600
100144086&source=hp&biw=1360&bih=608&ei=MJdfX6zWAo-l_QbuyZVY&q=ensemble+learning
&oq=ensemble+learning&gs_lcp=CgNpbWcQAzICCAAyAggAMgIIADICCAAyAggAMgIIADICCAAyAg
gAMgIIADICCAA6BQgAELEDOggIABCxAxCDAVD_BliCG2D8G2gAcAB4AYAB-AGIAa4OkgEFOC44LjGY
AQCgAQGqAQtnd3Mtd2l6LWltZw&sclient=img&ved=0ahUKEwisrsPFhenrAhWPUt8KHe5kBQsQ4d
UDCAc&uact=5#imgrc=vYIRTEN-SXSnxM
Neuroscience Research

• FreeSurfer measurements
• Talk about how future work will entail looking into fMRI and
diffusion difference abnormalities in a variety of medical
conditions
Acknowledgements

• Dr. Emi Takahashi (PhD, Neuroscience)

• Funding: National Institute of Health (NIH)

• Faculty in the Institute of Biomedical Engineering: Dr. Stephen Payne

• Funding: Wellcome Trust

• Dr. Anne Martel (medical physics)

• Funding: Canadian Breast Cancer Foundation, CIHR, CBCRA, OGS
• Technical & Clinical Researchers

Current Funding: Canada Research Chair program (NSERC), CFI

Machine Learning Methods
Overview
• Supervised learning
• Artificial neural networks
• Support vector machines
• Linear discriminant analysis
• Etc.
• Unsupervised learning
• K-means clustering
• Hierarchical clustering
• Etc.
• Dimensionality reduction
• Principal components analysis
• Independent components analysis
• Etc.
• Data Visualization
• An adjunct to statistical validation
• Validation
• Evaluation criteria (Overall Accuracy, ROC analyses, sensitivity, specificity, PPV, NPV)
• Independent datasets
• Within dataset validation
Lecture Plan

• Introduction and Background

• Technique focused approach:
• Present a major technique, including key
mathematics/algorithms
• Present examples of its use in the real world
(scientific literature and/or industry)
• Demonstrate its use to the class as much as possible
• Live demo
• Real world data
• Will generally start with easier techniques work up
to harder ones
Machine Learning Methods
Overview
• Supervised learning
• Artificial neural networks
• Support vector machines
• Linear discriminant analysis
• Etc.
• Unsupervised learning
• K-means clustering
• Hierarchical clustering
• Etc.
• Dimensionality reduction
• Principal components analysis
• Independent components analysis
• Etc.
• Data Visualization
• An adjunct to statistical validation
• Validation
• Evaluation criteria (Overall Accuracy, ROC analyses, sensitivity, specificity, PPV, NPV)
• Independent datasets
• Within dataset validation
Intro to Unsupervised Learning

• No labels, no ground truth!

• No examples provided to the algorithm
• Algorithm typically groups samples into classes of which it knows
NOTHING! (except for the representative examples it places therein)
• Algorithm attempts to find patterns in the data w/o a priori info
• Medical Images: finding regions-of-interest
• Big medical data: find natural groupings within a condition (subtypes of
ADHD)
• Usually more challenging to evaluate performance compared with SL
Intro to Unsupervised Learning
Intro to Unsupervised Learning

• Generally we want to minimize the within-class

distance between samples while simultaneously
maximizing the between-class distance
• Can also be evaluated as per congruency with
known classes not provided to the algorithm
• Caveat: if you know classes, SL will probably
outperform by benefitting from this information
Example: a cholera outbreak in London

Many years ago, during a cholera outbreak in London, a physician

plotted the location of cases on a map. Properly visualized, the
data indicated that cases clustered around certain intersections,
where there were polluted wells, not only exposing the cause of
cholera, but indicating what to do about the problem.

X X
X
XX XX
X X X
X X
X X
X X
X X
XX
X
• A technique demanded by many real world
tasks
– Bank/Internet Security: fraud/spam pattern discovery
– Biology: taxonomy of living things such as kingdom, phylum, class, order, family,
genus and species
– City-planning: Identifying groups of houses according to their house type, value, and
geographical location
– Climate change: understanding earth’s climate, finding atmospheric and oceanic
weather change patterns
– Finance: stock clustering analysis to uncover correlation underlying shares
– Image Compression/segmentation: coherent pixels grouped
– Information retrieval/organisation: Google search, topic-based news
– Land use: Identification of areas of similar land use in an earth observation database
– Marketing: Help marketers discover distinct groups in their customer bases, and
then use this knowledge to develop targeted marketing programs
– Social network mining: special interest group automatic discovery
• Imaging: Unsupervised learning is often called
image segmentation

https://www.mathworks.com/discovery/image-segmentation.html
Data courtesy of Boston
Children’s Hospital, Harvard
Medical School
Intro to Validation

• Ideal validation:
• Assessment on many independently acquired datasets
• Challenges: independent dataset often not available to researcher
• Alternative: Publish on a single dataset, validation by other researchers comparing
their work to your publication
• How to have confidence in self assessed ML performance on a single dataset?
• Validation!
• K-fold validation
• Randomized trials
• Leave one out
• Efron’s bootstrap
• Metrics to assess performance
• AUC
• OA
• Sensitivity
• Specificity
• PPV
• NPV
Background – Validation in Supervised
Learning: Leave-one-out validation
Background – Validation in Supervised
Learning: Leave-one-out validation

• Or average the OA or other evaluative metric

Background – Validation in Supervised
Learning: K – fold cross validation

• Group-wise equivalent of Leave-one-out (LOO)

• Divide dataset into K random non overlapping
groups
• Perform LOO on a group wise basis
• Train on all but the current group
• Test on the current group
Background – Validation in
Supervised Learning
• Randomized Trials / Bootstrapping
• Randomly select X% of samples for training
• Remaining (100-X)% of samples are for testing
• Evaluate performance
• Repeat many many times
• Summarize performance with statistics (mean, SD, CI
etc.)
• Alternative variations available
• Efron’s 0.638+ bootstrap which allows repeat samples within
the training set only
Background – Validation: 3 Way
Splits
• So far we’ve discussed dividing data into training and testing
sets
• Additional validation approaches include 3 main groupings:
training, testing and validation datasets
• Training and testing proceed as before
• Once complete and a validated model selected, it is
evaluated for performance on the validation set (whose data
was NEVER used during the validation)
• Note sometimes what is referred to as the testing and
validation sets are reversed
Machine Learning Methods
Overview
• Supervised learning
• KNN
• Artificial neural networks
• Support vector machines
• Linear discriminant analysis
• Etc.
• Unsupervised learning
• K-means clustering
• Hierarchical clustering
• Etc.
• Dimensionality reduction
• Principal components analysis
• Independent components analysis
• Etc.
• Data Visualization
• An adjunct to statistical validation
• Validation
• Evaluation criteria (Overall Accuracy, ROC analyses, sensitivity, specificity, PPV, NPV)
• Having covered a nice intro/overview and the KNN basics,
•
•
Independent datasets
Within dataset validation
let’s look at some more advanced KNN approaches
K Nearest Neighbour
K Nearest Neighbour –
Regression Adaptation

• Example on whiteboard
Supervised Learning Example
Revisited
Supervised Learning Example
Revisited
Supervised Learning Example
Revisited

• Surely we can do better than KNN?

References for These Course
Lecture Slides
• (Cal Tech) Machine Learning & Data Mining
• http://www.yisongyue.com/courses/cs155/2017_winter/
• Lior Rokach, Ben Gurion University of the Negev (
https://www.slideshare.net/liorrokach/introduction-to-machine-learning-13809045/1)
• CS4811 AI lecture notes: (http://pages.mtu.edu/~nilufer/classes/cs4811/2009-spring/)
• Ke Chen COMP24111 (https://studentnet.cs.manchester.ac.uk/ugt/COMP24111/)
• Gwen Englebienne (http://gwenn.dk/mlpr/)
• http://www.robots.ox.ac.uk/~az/lectures/ml/lect1.pdf
• http://cs231n.stanford.edu/slides/2016/
• Saed Sayad chem-eng.utoronto.ca/~datamining/Presentations/KNN.ppt
• http://classes.engr.oregonstate.edu/eecs/spring2012/cs534/notes/knn.pdf
• http://dataaspirant.com/2016/12/23/k-nearest-neighbor-classifier-intro/
• David Sontag http://cs.nyu.edu/~dsontag/courses/ml12/slides

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Week 09 Lesson 1 Intro Machine Learning 1 to 32 (4)
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 to 32 (4)
61 pages
Asset-V1 ColumbiaX+CSMM.101x+1T2017+type@asset+block@AI Edx ML 5.1intro
No ratings yet
Asset-V1 ColumbiaX+CSMM.101x+1T2017+type@asset+block@AI Edx ML 5.1intro
70 pages
Military AI-Week 02-Key Concept Machine Learning
No ratings yet
Military AI-Week 02-Key Concept Machine Learning
84 pages
Classification Techniques
No ratings yet
Classification Techniques
33 pages
CS464 Ch1 Intro Fall2020
No ratings yet
CS464 Ch1 Intro Fall2020
83 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
Section 2 - Introduction To Machine Learning-Bje Edits - Ipynb - Colab
No ratings yet
Section 2 - Introduction To Machine Learning-Bje Edits - Ipynb - Colab
7 pages
ML-chap-2
No ratings yet
ML-chap-2
60 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
Machine Learning Fundamentals (Updated)
No ratings yet
Machine Learning Fundamentals (Updated)
42 pages
New Microsoft PowerPoint Presentation (Recovered)
No ratings yet
New Microsoft PowerPoint Presentation (Recovered)
23 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
01_ml_basics
No ratings yet
01_ml_basics
61 pages
KNN Evaluation
No ratings yet
KNN Evaluation
51 pages
Machine Learning Intro & Evaluation Metrics
No ratings yet
Machine Learning Intro & Evaluation Metrics
49 pages
ML 01
No ratings yet
ML 01
15 pages
ML - Hands On
No ratings yet
ML - Hands On
24 pages
Breast Cancer Classifier Using Machine Learning
No ratings yet
Breast Cancer Classifier Using Machine Learning
7 pages
Machine Learning For Beginners PDF
No ratings yet
Machine Learning For Beginners PDF
29 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
03 Supervised Classification
No ratings yet
03 Supervised Classification
68 pages
CH 4
No ratings yet
CH 4
106 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
Week 1
No ratings yet
Week 1
12 pages
1. Machine Learning - Introduction
No ratings yet
1. Machine Learning - Introduction
73 pages
An Introduction To Supervised Machine Learning and Pattern Classification - The Big Picture
No ratings yet
An Introduction To Supervised Machine Learning and Pattern Classification - The Big Picture
55 pages
AML - Mid Term - Merged
No ratings yet
AML - Mid Term - Merged
192 pages
Top 10 Machine Learning Algo PDF
No ratings yet
Top 10 Machine Learning Algo PDF
15 pages
Machine Learning Slides
No ratings yet
Machine Learning Slides
281 pages
Lecture 01
No ratings yet
Lecture 01
23 pages
FML - KNN
No ratings yet
FML - KNN
64 pages
Applied Machine Learning
No ratings yet
Applied Machine Learning
49 pages
ML Unit 4
No ratings yet
ML Unit 4
76 pages
Lect 1
No ratings yet
Lect 1
24 pages
ML 3RD Unit
No ratings yet
ML 3RD Unit
67 pages
Machine Learning
No ratings yet
Machine Learning
13 pages
Tirth.pdf
No ratings yet
Tirth.pdf
19 pages
Machine Learning BSP
No ratings yet
Machine Learning BSP
26 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
Chapter 7- Artificial Intelligence Application
No ratings yet
Chapter 7- Artificial Intelligence Application
29 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
Kilic - AI ML in CV Healthcare - 2020
No ratings yet
Kilic - AI ML in CV Healthcare - 2020
7 pages
Day 2 Part 1
No ratings yet
Day 2 Part 1
52 pages
Lecture 1.1 Introduction to Machine Learning
No ratings yet
Lecture 1.1 Introduction to Machine Learning
43 pages
PDSeasonableSchool ML4PD
No ratings yet
PDSeasonableSchool ML4PD
135 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
31 pages
Lec 04
No ratings yet
Lec 04
70 pages
QSRI-lecture1
No ratings yet
QSRI-lecture1
45 pages
Machine Learning
No ratings yet
Machine Learning
257 pages
An Introduction To Machine Learning
No ratings yet
An Introduction To Machine Learning
136 pages
Practicalintroductiontomachinelearning1561472049990 PDF
No ratings yet
Practicalintroductiontomachinelearning1561472049990 PDF
110 pages
10 SVMAndEvaluation PDF
No ratings yet
10 SVMAndEvaluation PDF
60 pages
Machine Learning Algorithms For Breast Cancer Prediction
No ratings yet
Machine Learning Algorithms For Breast Cancer Prediction
8 pages
Lecture 15 - Recap and Midterm Review
No ratings yet
Lecture 15 - Recap and Midterm Review
37 pages
12 Advanced Machine Learning Algorithms
No ratings yet
12 Advanced Machine Learning Algorithms
41 pages
1. Machine Learning - Introduction
No ratings yet
1. Machine Learning - Introduction
138 pages
Week 8
No ratings yet
Week 8
70 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
Mastering Test Automation: A Practical Guide to Scalable & Efficient Testing
From Everand
Mastering Test Automation: A Practical Guide to Scalable & Efficient Testing
Chizitere Sylvia Olebu
No ratings yet
Gartner - 9 Future of Work Trends For 2023
No ratings yet
Gartner - 9 Future of Work Trends For 2023
12 pages
CV Lab
No ratings yet
CV Lab
14 pages
Density Based Spatial Clustering (DBSCAN) : With Data Analysis
No ratings yet
Density Based Spatial Clustering (DBSCAN) : With Data Analysis
36 pages
Metode Sferis A X B
No ratings yet
Metode Sferis A X B
3 pages
El-Filibusterismo-Detailed-Reviewer-Kabanata-13-16
No ratings yet
El-Filibusterismo-Detailed-Reviewer-Kabanata-13-16
28 pages
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
No ratings yet
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
7 pages
أخلاقيات الذكاء الإصطناعي
No ratings yet
أخلاقيات الذكاء الإصطناعي
21 pages
Navigating Intellectual Property, Privacy, and Ethical accountability in AI-driven data ecosystems
No ratings yet
Navigating Intellectual Property, Privacy, and Ethical accountability in AI-driven data ecosystems
15 pages
Nui Galway
No ratings yet
Nui Galway
2 pages
Unit - 2 ML
No ratings yet
Unit - 2 ML
32 pages
1738735393module 1 Introduction to HR Automation
No ratings yet
1738735393module 1 Introduction to HR Automation
21 pages
CV lab manual (1)
No ratings yet
CV lab manual (1)
126 pages
5TH SEM ELECTRONICS & COMPUTER ENGG, 2019-20 Batch
No ratings yet
5TH SEM ELECTRONICS & COMPUTER ENGG, 2019-20 Batch
18 pages
Steps To Writing A Term Paper
100% (1)
Steps To Writing A Term Paper
7 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
15 pages
ĐỀ HSG LỚP 10 2024
No ratings yet
ĐỀ HSG LỚP 10 2024
10 pages
Teleperformance - Shareholders' Newsletter - November 2024
No ratings yet
Teleperformance - Shareholders' Newsletter - November 2024
5 pages
AI PROJECT
No ratings yet
AI PROJECT
26 pages
2024학년도 2학년 1학기 중간고사 영어i (정답포함)
No ratings yet
2024학년도 2학년 1학기 중간고사 영어i (정답포함)
10 pages
BTS Blanc 2025
No ratings yet
BTS Blanc 2025
2 pages
Zeroth Review PPT Template
No ratings yet
Zeroth Review PPT Template
18 pages
Artificial Intelligence in EFL Context- Rising Students’ Speaking Performance with Lyra Virtual Assistance2
No ratings yet
Artificial Intelligence in EFL Context- Rising Students’ Speaking Performance with Lyra Virtual Assistance2
7 pages
4 5854898563208709653
No ratings yet
4 5854898563208709653
38 pages
BFSI Thailand 2025- Agenda (1)
No ratings yet
BFSI Thailand 2025- Agenda (1)
5 pages
2-HC2024.nvidia.MarkRen.Intro.v04
No ratings yet
2-HC2024.nvidia.MarkRen.Intro.v04
25 pages
Lin, J. S., & Chen, K. H. (2024). A
No ratings yet
Lin, J. S., & Chen, K. H. (2024). A
3 pages
CS158 2 2020 Syllabus
No ratings yet
CS158 2 2020 Syllabus
10 pages
欢迎来到mmu创意写作网站！
100% (1)
欢迎来到mmu创意写作网站！
11 pages
sts
No ratings yet
sts
2 pages
AI framework draft01 nov23
No ratings yet
AI framework draft01 nov23
10 pages

CS215 LectureSlidesSet2 IntroductionToMachineLearning AI

Uploaded by

CS215 LectureSlidesSet2 IntroductionToMachineLearning AI

Uploaded by

Introduction to Machine

Visiting Faculty, Massachusetts General Hospital, Harvard Medical School

Research Affiliate, Nova Scotia Health Authority

• Office: Physical Sciences Building 1020

• But are machine learning techniques necessarily

• Currently the only acceptable option for

• AI Risks to our collective safety and security

• How long until we trust a holographic doctor?

Examples Result: Sample

• So how do you evaluate how well the machine worked??

• TP: True Positives https://www.medcalc.org/manual/roc-curves.php

• Samples of interest correctly labelled by ML

• The proportion of samples/patients of interest correctly classified

› We have some basics about how to evaluate ML models, so let’s jump in

KNN is a simple algorithm that stores

• All distance measurements are sorted from smallest to

› So we understand the KNN basics, how to practically implement this in

• 10 measurements per sample

› Older results (2000)

Machine learning: ECML-98, 1998 - Springer

› K-NN performed best of ‘traditional’

Image from Forbes.com Image from Scientific

Image from Boston Children’s Hospital,

• Image from: https://www.google.com/imgres?imgurl

Levman et al., Journal of Digital Imaging, 27:145-151, 2014

Levman et al., ISMRM 2014, Milan, selected for oral presentation

• Image from: https

• Dr. Emi Takahashi (PhD, Neuroscience)

• Faculty in the Institute of Biomedical Engineering: Dr. Stephen Payne

• Dr. Anne Martel (medical physics)

Current Funding: Canada Research Chair program (NSERC), CFI

• Introduction and Background

• No labels, no ground truth!

• Generally we want to minimize the within-class

Many years ago, during a cholera outbreak in London, a physician

• Or average the OA or other evaluative metric

• Group-wise equivalent of Leave-one-out (LOO)

• Surely we can do better than KNN?

You might also like