0% found this document useful (0 votes)

89 views20 pages

1 Lecture 2: Supervised Machine Learning

The document describes a supervised machine learning problem of predicting diabetes risk from patient data. It discusses: 1) The dataset contains measurements like BMI, blood pressure, age and sex for diabetes patients along with a target value for diabetes risk. 2) A linear regression algorithm is used to find parameters that model the relationship between BMI and diabetes risk as a linear function. 3) The trained model can then be used to predict diabetes risk for new patients based on their BMI.

Uploaded by

Jeremy Wang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views20 pages

1 Lecture 2: Supervised Machine Learning

Uploaded by

Jeremy Wang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

lecture2-supervised-learning

September 15, 2020

1 Lecture 2: Supervised Machine Learning

1.0.1 Applied Machine Learning

Volodymyr KuleshovCornell Tech

2 Recall: Supervised Learning

The most common approach to machine learning is supervised learning.

1. First, we collect a dataset of labeled training examples.
2. We train a model to output accurate predictions on this dataset.
3. When the model sees new, similar data, it will also be accurate.

3 Part 1: A First Supervised Machine Learning Problem

Let’s start with a simple example of a supervised learning problem: predicting diabetes risk.
Suppose we have a dataset of diabetes patients. * For each patient we have a access to measurements
from their medical record and an estimate of diabetes risk. * We are interested in understanding
how the measurements affect an individual’s diabetes risk.

4 Three Components of A Supervised Machine Learning Problem

At a high level, a supervised machine learning problem has the following structure:

Dataset + Algorithm → Predictive Model

The predictive model is chosen to model the relationship between inputs and targets. For instance,
it can predict future targets.

1
5 A Supervised Learning Dataset

Let’s return to our example: predicting diabates risk. What would a dataset look like?
We will use the UCI Diabetes Dataset; it’s a toy dataset that’s often used to demonstrate machine
learning algorithms. * For each patient we have a access to a measurement of their body mass index
(BMI) and a quantiative diabetes risk score (from 0-400). * We are interested in understanding
how BMI affects an individual’s diabetes risk.
[2]: import numpy as np
import pandas as pd
from sklearn import datasets

# Load the diabetes dataset

diabetes_X, diabetes_y = datasets.load_diabetes(return_X_y=True, as_frame=True)

# Use only the BMI feature

diabetes_X = diabetes_X.loc[:, ['bmi']]

# The BMI is zero-centered and normalized; we recenter it for ease of␣

,→presentation

diabetes_X = diabetes_X * 30 + 25

# Collect 20 data points

diabetes_X_train = diabetes_X.iloc[-20:]
diabetes_y_train = diabetes_y.iloc[-20:]

# Display some of the data points

pd.concat([diabetes_X_train, diabetes_y_train], axis=1).head()

[2]: bmi target

422 27.335902 233.0
423 23.811456 91.0
424 25.331171 111.0
425 23.779122 152.0
426 23.973128 120.0

We can also visualize this two-dimensional dataset.

[3]: %matplotlib inline
import matplotlib.pyplot as plt
plt.rcParams['figure.figsize'] = [12, 4]

plt.scatter(diabetes_X_train, diabetes_y_train, color='black')

plt.xlabel('Body Mass Index (BMI)')
plt.ylabel('Diabetes Risk')

[3]: Text(0, 0.5, 'Diabetes Risk')

2
6 A Supervised Learning Algorithm (Part 1)

What is the relationship between BMI and diabetes risk?

We could assume that risk is a linear function of BMI. In other words, for some unknown θ0 , θ1 ∈ R,
we have
y = θ1 · x + θ0 ,
where x is the BMI (also called the dependent variable), and y is the diabetes risk score (the
independent variable).
Note that θ1 , θ0 are the slope and the intercept of the line relates x to y. We call them parameters.
We can visualize this for a few values of θ1 , θ0 .
[4]: theta_list = [(1, 2), (2,1), (1,0), (0,1)]
for theta0, theta1 in theta_list:
x = np.arange(10)
y = theta1 * x + theta0
plt.plot(x,y)

3
7 A Supervised Learning Algorithm (Part 2)

Assuming that x, y follow the above linear relationship, the goal of the supervised learning
algorithm is to find a good set of parameters consistent with the data.
We will see many algorithms for this task. For now, let’s call the sklearn.linear_model library
to find a θ1 , θ0 that fit the data well.
[6]: from sklearn import linear_model
from sklearn.metrics import mean_squared_error

# Create linear regression object

regr = linear_model.LinearRegression()

# Train the model using the training sets

regr.fit(diabetes_X_train, diabetes_y_train.values)

# Make predictions on the training set

diabetes_y_train_pred = regr.predict(diabetes_X_train)

# The coefficients
print('Slope (theta1): \t', regr.coef_[0])
print('Intercept (theta0): \t', regr.intercept_)

Slope (theta1): 37.378842160517664

Intercept (theta0): -797.0817390342369

8 A Supervised Learning Model

The supervised learning algorithm gave us a pair of parameters θ1∗ , θ0∗ . These define the predictive
model f ∗ , defined as
f (x) = θ1∗ · x + θ0∗ ,
where again x is the BMI, and y is the diabetes risk score.
We can visualize the linear model that fits our data.
[7]: plt.xlabel('Body Mass Index (BMI)')
plt.ylabel('Diabetes Risk')
plt.scatter(diabetes_X_train, diabetes_y_train)
plt.plot(diabetes_X_train, diabetes_y_train_pred, color='black', linewidth=2)

[7]: [<matplotlib.lines.Line2D at 0x1253f9240>]

4
9 Predictions Using Supervised Learning

Given a new dataset of patients with a known BMI, we can use this model to estimate their diabetes
risk.
Given a new x′ , we can output a predicted y ′ as

y ′ = f (x′ ) = θ1∗ · x′ + θ0 .

Let’s start by loading more data. We will load three new patients (shown in red below) that we
haven’t seen before.
[8]: # Collect 3 data points
diabetes_X_test = diabetes_X.iloc[:3]
diabetes_y_test = diabetes_y.iloc[:3]

plt.scatter(diabetes_X_train, diabetes_y_train)
plt.scatter(diabetes_X_test, diabetes_y_test, color='red')
plt.xlabel('Body Mass Index (BMI)')
plt.ylabel('Diabetes Risk')
plt.legend(['Initial patients', 'New patients'])

[8]: <matplotlib.legend.Legend at 0x1259cd390>

5
Our linear model provides an estimate of the diabetes risk for these patients.
[9]: # generate predictions on the new patients
diabetes_y_test_pred = regr.predict(diabetes_X_test)

# visualize the results

plt.xlabel('Body Mass Index (BMI)')
plt.ylabel('Diabetes Risk')
plt.scatter(diabetes_X_train, diabetes_y_train)
plt.scatter(diabetes_X_test, diabetes_y_test, color='red', marker='o')
plt.plot(diabetes_X_train, diabetes_y_train_pred, color='black', linewidth=1)
plt.plot(diabetes_X_test, diabetes_y_test_pred, 'x', color='red', mew=3,␣
,→markersize=8)

plt.legend(['Model', 'Prediction', 'Initial patients', 'New patients'])

[9]: <matplotlib.legend.Legend at 0x125bfb048>

6
10 Why Supervised Learning?

Supervised learning can be useful in many ways. * Making predictions on new data. * Understand-
ing the mechanisms through which input variables affect targets.

11 Applications of Supervised Learning

Many of the most important applications of machine learning are supervised: * Classifying medical
images. * Translating between pairs of languages. * Detecting objects in a self-driving car.
# Part 2: Anatomy of a Supervised Learning Problem: Datasets
We have seen a simple example of a supervised machine learning problem and an algorithm for
solving this problem.
Let’s now look at what a general supervised learning problem looks like.

12 Recall: Three Components of A Supervised Machine Learning

Problem

At a high level, a supervised machine learning problem has the following structure:

Dataset + Algorithm → Predictive Model

The predictive model is chosen to model the relationship between inputs and targets. For instance,
it can predict future targets.

13 A Supervised Learning Dataset

We are going to dive deeper into what’s a supervised learning dataset. As an example, consider
the full version of the UCI Diabetes Dataset seen earlier.
Previsouly, we only looked at the patients’ BMI, but this dataset actually records many additional
measurements.
The UCI dataset contains many additional data columns besides bmi, including age, sex, and blood
pressure. We can ask sklearn to give us more information about this dataset.
[10]: import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
plt.rcParams['figure.figsize'] = [12, 4]
from sklearn import datasets

# Load the diabetes dataset

7
diabetes = datasets.load_diabetes(as_frame=True)
print(diabetes.DESCR)

.. _diabetes_dataset:

Diabetes dataset
----------------

Ten baseline variables, age, sex, body mass index, average blood
pressure, and six blood serum measurements were obtained for each of n =
442 diabetes patients, as well as the response of interest, a
quantitative measure of disease progression one year after baseline.

Data Set Characteristics:

:Number of Instances: 442

:Number of Attributes: First 10 columns are numeric predictive values

:Target: Column 11 is a quantitative measure of disease progression one year

after baseline

:Attribute Information:
- age age in years
- sex
- bmi body mass index
- bp average blood pressure
- s1 tc, T-Cells (a type of white blood cells)
- s2 ldl, low-density lipoproteins
- s3 hdl, high-density lipoproteins
- s4 tch, thyroid stimulating hormone
- s5 ltg, lamotrigine
- s6 glu, blood sugar level

Note: Each of these 10 feature variables have been mean centered and scaled by
the standard deviation times `n_samples` (i.e. the sum of squares of each column
totals 1).

Source URL:
https://www4.stat.ncsu.edu/~boos/var.select/diabetes.html

For more information see:

Bradley Efron, Trevor Hastie, Iain Johnstone and Robert Tibshirani (2004) "Least
Angle Regression," Annals of Statistics (with discussion), 407-499.
(https://web.stanford.edu/~hastie/Papers/LARS/LeastAngle_2002.pdf)

8
14 A Supervised Learning Dataset: Notation

We say that a training dataset of size n (e.g., n patients) is a set

D = {(x(i) , y (i) ) | i = 1, 2, ..., n}

Each x(i) denotes an input (e.g., the measurements for patient i), and each y (i) ∈ Y is a target
(e.g., the diabetes risk).
Together, (x(i) , y (i) ) form a training example.
We can look at the diabetes dataset in this form.
[11]: # Load the diabetes dataset
diabetes_X, diabetes_y = diabetes.data, diabetes.target

# Print part of the dataset

diabetes_X.head()

[11]: age sex bmi bp s1 s2 s3 \

0 0.038076 0.050680 0.061696 0.021872 -0.044223 -0.034821 -0.043401
1 -0.001882 -0.044642 -0.051474 -0.026328 -0.008449 -0.019163 0.074412
2 0.085299 0.050680 0.044451 -0.005671 -0.045599 -0.034194 -0.032356
3 -0.089063 -0.044642 -0.011595 -0.036656 0.012191 0.024991 -0.036038
4 0.005383 -0.044642 -0.036385 0.021872 0.003935 0.015596 0.008142

s4 s5 s6
0 -0.002592 0.019908 -0.017646
1 -0.039493 -0.068330 -0.092204
2 -0.002592 0.002864 -0.025930
3 0.034309 0.022692 -0.009362
4 -0.002592 -0.031991 -0.046641

15 Training Dataset: Inputs

More precisely, an input x(i) ∈ X is a d-dimensional vector of the form

 (i) 
x
 1(i) 
x2 
x(i) = 
 .. 

 . 
(i)
xd

For example, it could be the measurements the values of the d features for patient i.
The set X is called the feature space. Often, we have, X = Rd .
Let’s look at data for one patient.

9
[12]: diabetes_X.iloc[0]

[12]: age 0.038076

sex 0.050680
bmi 0.061696
bp 0.021872
s1 -0.044223
s2 -0.034821
s3 -0.043401
s4 -0.002592
s5 0.019908
s6 -0.017646
Name: 0, dtype: float64

16 Training Dataset: Attributes

We refer to the numerical variables describing the patient as attributes. Examples of attributes
include: * The age of a patient. * The patient’s gender. * The patient’s BMI.
Note that thes attributes in the above example have been mean-centered at zero and re-scaled to
have a variance of one.

17 Training Dataset: Features

Often, an input object has many attributes, and we want to use these attributes to define more
complex descriptions of the input.
• Is the patient old and a man? (Useful if old men are at risk).
• Is the BMI above the obesity threshold?
We call these custom attributes features.
Let’s create an “old man” feature.
[13]: diabetes_X['old_man'] = (diabetes_X['sex'] > 0) & (diabetes_X['age'] > 0.05)
diabetes_X.head()

[13]: age sex bmi bp s1 s2 s3 \

s4 s5 s6 old_man
0 -0.002592 0.019908 -0.017646 False

10
1 -0.039493 -0.068330 -0.092204 False
2 -0.002592 0.002864 -0.025930 True
3 0.034309 0.022692 -0.009362 False
4 -0.002592 -0.031991 -0.046641 False

18 Training Dataset: Features

More formally, we can define a function ϕ : X → Rp that takes an input x(i) ∈ X and outputs a
p-dimensional vector  
ϕ(x(i) )1
ϕ(x(i) )2 
 
ϕ(x(i) ) =  .. 
 . 
ϕ(x(i) )p
We say that ϕ(x(i) ) is a featurized input, and each ϕ(x(i) )j is a feature.

19 Features vs Attributes

In practice, the terms attribute and features are often used interchangeably. Most authors refer to
x(i) as a vector of features (i.e., they’ve been precomputed).
We will follow this convention and use attribute only when there is ambiguity between features and
attributes.

20 Features: Discrete vs. Continuous

Features can be either discrete or continuous. We will see later that they may be handled differently
by ML algorthims.
The BMI feature that we have seen earlier is an example of a continuous feature.
We can visualize its distribution.
[14]: diabetes_X.loc[:, 'bmi'].hist()

[14]: <AxesSubplot:>

11
Other features take on one of a finite number of discrete values. The sex column is an example of
a categorical feature.
In this example, the dataset has been pre-processed such that the two values happen to be
0.05068012 and -0.04464164.
[15]: print(diabetes_X.loc[:, 'sex'].unique())
diabetes_X.loc[:, 'sex'].hist()

[ 0.05068012 -0.04464164]

[15]: <AxesSubplot:>

21 Training Dataset: Targets

For each patient, we are interested in predicting a quantity of interest, the target. In our example,
this is the patient’s diabetes risk.

12
Formally, when (x(i) , y (i) ) form a training example, each y (i) ∈ Y is a target. We call Y the target
space.
We plot the distirbution of risk scores below.
[16]: plt.xlabel('Diabetes risk score')
plt.ylabel('Number of patients')
diabetes_y.hist()

[16]: <AxesSubplot:xlabel='Diabetes risk score', ylabel='Number of patients'>

22 Targets: Regression vs. Classification

We distinguish between two broad types of supervised learning problems that differ in the form of
the target variable.
1. Regression: The target variable y is continuous. We are fitting a curve in a high-dimensional
feature space that approximates the shape of the dataset.
2. Classification: The target variable y is discrete. Each discrete value corresponds to a class
and we are looking for a hyperplane that separates the different classes.
We can easily turn our earlier regression example into classification by discretizing the diabetes risk
scores into high or low.
[17]: # Discretize the targets
diabetes_y_train_discr = np.digitize(diabetes_y_train, bins=[150])

# Visualize it
plt.scatter(diabetes_X_train[diabetes_y_train_discr==0],␣
,→diabetes_y_train[diabetes_y_train_discr==0], marker='o', s=80,␣

,→facecolors='none', edgecolors='g')

13
plt.scatter(diabetes_X_train[diabetes_y_train_discr==1],␣
,→diabetes_y_train[diabetes_y_train_discr==1], marker='o', s=80,␣

,→facecolors='none', edgecolors='r')

plt.legend(['Low-Risk Patients', 'High-Risk Patients'])

[17]: <matplotlib.legend.Legend at 0x125ffc240>

Let’s try to generate predictions for this dataset.

[18]: # Create logistic regression object (note: this is actually a classification␣
,→algorithm!)

clf = linear_model.LogisticRegression()

# Train the model using the training sets

clf.fit(diabetes_X_train, diabetes_y_train_discr)

# Make predictions on the training set

diabetes_y_train_pred = clf.predict( )

# Visualize it
plt.scatter(diabetes_X_train[diabetes_y_train_discr==0],␣
,→diabetes_y_train[diabetes_y_train_discr==0], marker='o', s=140,␣

,→facecolors='none', edgecolors='g')

plt.scatter(diabetes_X_train[diabetes_y_train_discr==1],␣
,→diabetes_y_train[diabetes_y_train_discr==1], marker='o', s=140,␣

,→facecolors='none', edgecolors='r')

plt.scatter(diabetes_X_train[diabetes_y_train_pred==0],␣
,→diabetes_y_train[diabetes_y_train_pred==0], color='g', s=20)

plt.scatter(diabetes_X_train[diabetes_y_train_pred==1],␣
,→diabetes_y_train[diabetes_y_train_pred==1], color='r', s=20)

plt.legend(['Low-Risk Patients', 'High-Risk Patients', 'Low-Risk Predictions',␣

,→'High-Risk Predictions'])

14
[18]: <matplotlib.legend.Legend at 0x11847d320>

# Part 3: Anatomy of a Supervised Learning Problem: Learning Algorithm

Let’s now look at what a general supervised learning algorithm looks like.

23 Recall: Three Components of A Supervised Machine Learning

Problem

At a high level, a supervised machine learning problem has the following structure:

Dataset + Algorithm → Predictive Model

The predictive model is chosen to model the relationship between inputs and targets. For instance,
it can predict future targets.

24 The Components of A Supervised Machine Learning Algorithm

We can also define the high-level structure of a supervised learning algorithm as consisting of three
components: * A model class: the set of possible models we consider. * An objective function,
which defines how good a model is. * An optimizer, which finds the best predictive model in the
model class according to the objective function
Let’s look again at our diabetes dataset for an example.
[19]: import numpy as np
import pandas as pd
from sklearn import datasets
import matplotlib.pyplot as plt
plt.rcParams['figure.figsize'] = [12, 4]

15
# Load the diabetes dataset
diabetes = datasets.load_diabetes(as_frame=True)
diabetes_X, diabetes_y = diabetes.data, diabetes.target

# Print part of the dataset

diabetes_X.head()

[19]: age sex bmi bp s1 s2 s3 \

s4 s5 s6
0 -0.002592 0.019908 -0.017646
1 -0.039493 -0.068330 -0.092204
2 -0.002592 0.002864 -0.025930
3 0.034309 0.022692 -0.009362
4 -0.002592 -0.031991 -0.046641

25 Model: Notation

We’ll say that a model is a function

f :X →Y
that maps inputs x ∈ X to targets y ∈ Y.
Often, models have parameters θ ∈ Θ living in a set Θ. We will then write the model as

fθ : X → Y

to denote that it’s parametrized by θ.

26 Model Class: Notation

Formally, the model class is a set

M ⊆ {f | f : X → Y}
of possible models that map input features to targets.
When the models fθ are paremetrized by parameters θ ∈ Θ living in some set Θ. Thus we can also
write
M = {fθ | f : X → Y; θ ∈ Θ}.

16
27 Model Class: Example

One simple approach is to assume that x and y are related by a linear model of the form

y = θ0 + θ1 · x1 + θ2 · x2 + ... + θd · xd

where x is a featurized output and y is the target.

The θj are the parameters of the model.

[54]: # Collect 20 data points for training

diabetes_X_train = diabetes_X.iloc[-20:]
diabetes_y_train = diabetes_y.iloc[-20:]

# Create linear regression object

regr = linear_model.LinearRegression()

# Train the model using the training sets

regr.fit(diabetes_X_train, diabetes_y_train.values)

# Make predictions on the training set

diabetes_y_train_pred = regr.predict(diabetes_X_train)

# Collect 3 data points for testing

diabetes_X_test = diabetes_X.iloc[:3]
diabetes_y_test = diabetes_y.iloc[:3]

# generate predictions on the new patients

diabetes_y_test_pred = regr.predict(diabetes_X_test)

[55]: # visualize the results

plt.xlabel('Body Mass Index (BMI)')
plt.ylabel('Diabetes Risk')
plt.scatter(diabetes_X_train.loc[:, ['bmi']], diabetes_y_train)
plt.scatter(diabetes_X_test.loc[:, ['bmi']], diabetes_y_test, color='red',␣
,→marker='o')

# plt.scatter(diabetes_X_train.loc[:, ['bmi']], diabetes_y_train_pred,␣

,→color='black', linewidth=1)

plt.plot(diabetes_X_test.loc[:, ['bmi']], diabetes_y_test_pred, 'x',␣

,→color='red', mew=3, markersize=8)

plt.legend(['Model', 'Prediction', 'Initial patients', 'New patients'])

[55]: <matplotlib.legend.Legend at 0x12f6a46a0>

17
28 Objectives: Notation

To capture this intuition, we define an objective function (also called a loss function)

J(f ) : M → [0, ∞),

which describes the extent to which f “fits” the data D = {(x(i) , y (i) ) | i = 1, 2, ..., n}.
When f is parametrized by θ ∈ Θ, the objective becomes a function J(θ) : Θ → [0, ∞).

29 Objective: Examples

What would are some possible objective functions? We will see many, but here are a few examples:
* Mean squared error:
1 ∑( )2
n
J(θ) = fθ (x(i) ) − y (i)
2n
i=1

* Absolute (L1) error:

1 ∑
n

J(θ) = fθ (x(i) ) − y (i)
n
i=1

These are defined for a dataset D = {(x(i) , y (i) ) | i = 1, 2, ..., n}.

[60]: from sklearn.metrics import mean_squared_error, mean_absolute_error

y1 = np.array([1, 2, 3, 4])
y2 = np.array([-1, 1, 3, 5])

print('Mean squared error: %.2f' % mean_squared_error(y1, y2))

print('Mean absolute error: %.2f' % mean_absolute_error(y1, y2))

18
Mean squared error: 1.50
Mean absolute error: 1.00

30 Optimizer: Notation

At a high-level an optimizer takes an objective J and a model class M and finds a model f ∈ M
with the smallest value of the objective J.

min J(f )
f ∈M

Intuitively, this is the function that bests “fits” the data on the training dataset.
When f is parametrized by θ ∈ Θ, the optimizer minimizes a function J(θ) over all θ ∈ Θ.

31 Optimizer: Example

We will see that behind the scenes, the sklearn.linear_models.LinearRegression algorithm

optimizes the MSE loss.

1 ∑( )2
n
min fθ (x(i) ) − y (i)
θ∈R 2n
i=1

We can easily measure the quality of the fit on the training set and the test set.
[59]: from sklearn.metrics import mean_squared_error

print('Training set mean squared error: %.2f'

% mean_squared_error(diabetes_y_train, diabetes_y_train_pred))
print('Test set mean squared error: %.2f'
% mean_squared_error(diabetes_y_test, diabetes_y_test_pred))
print('Test set mean squared error on random inputs: %.2f'
% mean_squared_error(diabetes_y_test, np.random.
,→randn(*diabetes_y_test_pred.shape)))

Training set mean squared error: 1118.22

Test set mean squared error: 667.81
Test set mean squared error on random inputs: 15887.97

32 Summary: Components of A Supervised Machine Learning

Problem

At a high level, a supervised machine learning problem has the following structure:

19
Dataset + Algorithm → Predictive Model
| {z }
Model Class + Objective + Optimizer

The predictive model is chosen to model the relationship between inputs and targets. For instance,
it can predict future targets.

33 Notation: Feature Matrix

Suppose that we have a dataset of size n (e.g., n patients), indexed by i = 1, 2, ..., n. Each x(i) is a
vector of d features.

Feature Matrix Machine learning algorithms are most easily defined in the language of linear
algebra. Therefore, it will be useful to represent the entire dataset as one matrix X ∈ Rn×d , of the
form:  (1) 
(2) (n)
x1 x1 . . . x1
 (1) (2) (n) 
x2 x2 . . . x2 
X= .  .
. 
 . 
(1) (2) (n)
xd xd . . . xd

Similarly, we can vectorize the target variables into a vector y ∈ Rn of the form
 (1) 
x
 x(2) 
 
y =  . .
.
 . 
x(n)

[ ]:

Enterprise Artificial Intelligence and Machine Learning For Managers
100% (2)
Enterprise Artificial Intelligence and Machine Learning For Managers
97 pages
Machine Learning For Humans
100% (4)
Machine Learning For Humans
97 pages
Lecture2-Supervised-Learning Slides
No ratings yet
Lecture2-Supervised-Learning Slides
56 pages
IPL Winning Prediction Intern Report
No ratings yet
IPL Winning Prediction Intern Report
52 pages
23UCC554
No ratings yet
23UCC554
9 pages
Regression
No ratings yet
Regression
1 page
G26 Report
No ratings yet
G26 Report
4 pages
Estimating Diabetic Risk Accurately
No ratings yet
Estimating Diabetic Risk Accurately
26 pages
Diabetes - Test Report
No ratings yet
Diabetes - Test Report
62 pages
20BCE7620 AP2021228000397 Experiment-6 Removed
No ratings yet
20BCE7620 AP2021228000397 Experiment-6 Removed
19 pages
Diabetes Prediction - ML
No ratings yet
Diabetes Prediction - ML
29 pages
Sse 25 21 114-2
No ratings yet
Sse 25 21 114-2
13 pages
Diabetic Prediction Using LogicalRegression
No ratings yet
Diabetic Prediction Using LogicalRegression
9 pages
Dundee June 18
No ratings yet
Dundee June 18
56 pages
Internshippppp Fimnalllll
No ratings yet
Internshippppp Fimnalllll
16 pages
Machine Learning
100% (1)
Machine Learning
21 pages
Classifier Model For Diabetes Prediction
No ratings yet
Classifier Model For Diabetes Prediction
30 pages
Final
No ratings yet
Final
44 pages
Dataset
No ratings yet
Dataset
13 pages
Sse 25 21 114-1
No ratings yet
Sse 25 21 114-1
14 pages
Deep Learning
No ratings yet
Deep Learning
41 pages
Independent Project
No ratings yet
Independent Project
10 pages
MLPPT 11 45
No ratings yet
MLPPT 11 45
31 pages
Diabetes Prediction Using Logistic Regression - Untitled - Ipynb at Main Prajwal10031999 - Diabetes Prediction Using Logistic Regression GitHub
No ratings yet
Diabetes Prediction Using Logistic Regression - Untitled - Ipynb at Main Prajwal10031999 - Diabetes Prediction Using Logistic Regression GitHub
8 pages
Week 04 Logistic Regression
No ratings yet
Week 04 Logistic Regression
5 pages
A Comparative Analysis Using Machine Learning Algorithm On
No ratings yet
A Comparative Analysis Using Machine Learning Algorithm On
19 pages
241410
No ratings yet
241410
10 pages
Ext 74513
No ratings yet
Ext 74513
10 pages
Proposal
No ratings yet
Proposal
21 pages
Diabetes Classification Report
No ratings yet
Diabetes Classification Report
17 pages
Sse 25 21 114-3
No ratings yet
Sse 25 21 114-3
13 pages
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
No ratings yet
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
8 pages
c20 Final Final
No ratings yet
c20 Final Final
21 pages
Diabe PDF
No ratings yet
Diabe PDF
11 pages
Unit5 - Logistic Regression
No ratings yet
Unit5 - Logistic Regression
4 pages
Report - SVM
No ratings yet
Report - SVM
13 pages
Projectreport Diabetes Prediction
No ratings yet
Projectreport Diabetes Prediction
22 pages
14-Improving SVM Performance For Type II Diabetes Prediction With An Improved Non-Linear Kernel Insights From The PIMA Dataset
No ratings yet
14-Improving SVM Performance For Type II Diabetes Prediction With An Improved Non-Linear Kernel Insights From The PIMA Dataset
9 pages
Introduction To Regression: George Boorman
No ratings yet
Introduction To Regression: George Boorman
50 pages
Chapter 2
No ratings yet
Chapter 2
50 pages
Documentation Code
No ratings yet
Documentation Code
20 pages
Slides (A12 A14)
No ratings yet
Slides (A12 A14)
353 pages
Supervised Learning With Scikit-Learn
No ratings yet
Supervised Learning With Scikit-Learn
178 pages
IEEE Paper 1
No ratings yet
IEEE Paper 1
5 pages
DIABETES
No ratings yet
DIABETES
17 pages
chapter2
No ratings yet
chapter2
50 pages
Binod ML Project-052
No ratings yet
Binod ML Project-052
14 pages
Irjet V6i3277
No ratings yet
Irjet V6i3277
7 pages
Diabetes Prediction Using Machine Learning KNN - Algorithm Technique
No ratings yet
Diabetes Prediction Using Machine Learning KNN - Algorithm Technique
4 pages
GENAI-CAPSTONE_2 (1)
No ratings yet
GENAI-CAPSTONE_2 (1)
2 pages
Machine Learning in Health
No ratings yet
Machine Learning in Health
13 pages
Exp 5
No ratings yet
Exp 5
7 pages
ML Lab Record
No ratings yet
ML Lab Record
15 pages
Slide Presetatio
No ratings yet
Slide Presetatio
30 pages
5 Efficient Machine Learning Models for the Accurate Prediction of Diabetes
No ratings yet
5 Efficient Machine Learning Models for the Accurate Prediction of Diabetes
5 pages
Ek125 Final Project
No ratings yet
Ek125 Final Project
13 pages
Presentation - Yussup Tumgoyev
No ratings yet
Presentation - Yussup Tumgoyev
128 pages
Artificial Intelligence Approaches For Predicting Diabetes in Egypt
No ratings yet
Artificial Intelligence Approaches For Predicting Diabetes in Egypt
19 pages
BI Miniproject Report (Diabetes)
No ratings yet
BI Miniproject Report (Diabetes)
18 pages
Predicting Diabetes Mellitus in Healthcare: A Comparative Analysis of Machine Learning Algorithms On Big Dataset
No ratings yet
Predicting Diabetes Mellitus in Healthcare: A Comparative Analysis of Machine Learning Algorithms On Big Dataset
12 pages
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Lecture4 Foundations Supervised Learning
No ratings yet
Lecture4 Foundations Supervised Learning
22 pages
1 Lecture 1: Introduction To Machine Learning
No ratings yet
1 Lecture 1: Introduction To Machine Learning
12 pages
1 Lecture 5b: Probabilistic Perspectives On ML Algorithms
No ratings yet
1 Lecture 5b: Probabilistic Perspectives On ML Algorithms
6 pages
1 Lecture 3: Optimization and Linear Regression
No ratings yet
1 Lecture 3: Optimization and Linear Regression
27 pages
Data Science With Python Class Room Notes Qulaity Thought
100% (2)
Data Science With Python Class Room Notes Qulaity Thought
489 pages
ML 5units
No ratings yet
ML 5units
284 pages
Ai&ml Question Bank Answers
No ratings yet
Ai&ml Question Bank Answers
26 pages
AI - Module-III (Introduction To ML)
No ratings yet
AI - Module-III (Introduction To ML)
20 pages
CH 1 Notes FOML
No ratings yet
CH 1 Notes FOML
10 pages
Week 3: Introduction To Sensors' Science and Technology
100% (1)
Week 3: Introduction To Sensors' Science and Technology
14 pages
Machine Learning Algorithms Applications and Practices in Data Science PDF
No ratings yet
Machine Learning Algorithms Applications and Practices in Data Science PDF
113 pages
MLF Lec01
No ratings yet
MLF Lec01
23 pages
Python - Stdin, Stdout, and Stderr
No ratings yet
Python - Stdin, Stdout, and Stderr
20 pages
AI Documents
No ratings yet
AI Documents
25 pages
Ijsrp p8252
No ratings yet
Ijsrp p8252
6 pages
Network Intrusion Detection Using Supervised Machine Learnin (3) )
No ratings yet
Network Intrusion Detection Using Supervised Machine Learnin (3) )
24 pages
Research Trends in Machine Learning: Muhammad Kashif Hanif
No ratings yet
Research Trends in Machine Learning: Muhammad Kashif Hanif
80 pages
Data Science 100 MCQs
No ratings yet
Data Science 100 MCQs
16 pages
W1M1-Intro To ML
No ratings yet
W1M1-Intro To ML
19 pages
AI in Human-Computer Gaming: Techniques, Challenges and Opportunities
No ratings yet
AI in Human-Computer Gaming: Techniques, Challenges and Opportunities
14 pages
5-Supervised and Unsupervised
No ratings yet
5-Supervised and Unsupervised
7 pages
Labook DA
No ratings yet
Labook DA
59 pages
4.introduction To Learning - Unit 2
No ratings yet
4.introduction To Learning - Unit 2
8 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-06 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-06 Reference-Material-I
21 pages
1machine Learning Based Intelligent Career Counselling Chatbot ICCC
No ratings yet
1machine Learning Based Intelligent Career Counselling Chatbot ICCC
8 pages
Machine Learning and Artificial Intelligence Techniques in-2025
No ratings yet
Machine Learning and Artificial Intelligence Techniques in-2025
33 pages
ML 1
No ratings yet
ML 1
35 pages
Old Meets New: Integrating Artificial Intelligence in Museums' Management Practices
No ratings yet
Old Meets New: Integrating Artificial Intelligence in Museums' Management Practices
15 pages
Artificial Intelligence Interview Questions
100% (1)
Artificial Intelligence Interview Questions
28 pages
Machine Learning - Its Types
No ratings yet
Machine Learning - Its Types
8 pages
Machine Learning Use Case in Indian Agriculture Predictive Analysis of Bihar Agriculture Data To Forecast Crop Yield
No ratings yet
Machine Learning Use Case in Indian Agriculture Predictive Analysis of Bihar Agriculture Data To Forecast Crop Yield
8 pages
Expert Systems - Merged
No ratings yet
Expert Systems - Merged
39 pages