0% found this document useful (0 votes)

84 views80 pages

Machine Learning

The document provides an overview of machine learning concepts and techniques. It includes definitions of key statistical measures like mean, median, mode, and standard deviation. It also covers data types, data distribution, histograms, and the normal data distribution. Example code is provided to demonstrate calculating these measures and visualizing data distributions using Python libraries like NumPy and Matplotlib.

Uploaded by

Gabriel Chakhvashvili

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views80 pages

Machine Learning

Uploaded by

Gabriel Chakhvashvili

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 80

Machine learning

1
Contents:
 ML getting started --- page 3
 ML mean median mode --- page 7
 ML standard deviation --- page 11
 ML percentiles --- page 13
 ML data distribution --- page 14
 ML normal data distribution --- page 16
 ML scatter plot --- page 18
 ML linear regression --- page 21
 ML polynomial regression --- page 30
 ML multiple regression --- page 38
 ML scale --- page 45
 ML train/test --- page 49
 ML decision tree --- page 58
 ML confusion matrix --- page 71
 ML hierarchial clustering --- page 79
 W
 W
 W
 W
 W
 W
 W
 W
 W

2
ML getting started

 Machine Learning is making the computer learn from

studying data and statistics.
 Machine Learning is a step into the direction of artificial
intelligence (AI).
 Machine Learning is a program that analyses data and
learns to predict the outcome.

Where To Start?
In this tutorial we will go back to mathematics and study
statistics, and how to calculate important numbers based on
data sets.

We will also learn how to use various Python modules to get

the answers we need.
And we will learn how to make functions that are able to
predict the outcome based on what we have learned.

Data Set
In the mind of a computer, a data set is any collection of data. It
can be anything from an array to a complete database.
3
Example of an array:

[99,86,87,88,111,86,103,87,94,78,77,85,86]

Example of data base:

By looking at the array, we can guess that the average value is

probably around 80 or 90, and we are also able to determine
the highest value and the lowest value, but what else can we
do?

And by looking at the database we can see that the most

popular color is white, and the oldest car is 17 years, but what

4
if we could predict if a car had an AutoPass, just by looking at
the other values?

That is what Machine Learning is for! Analyzing data and

predicting the outcome!

In Machine Learning it is common to work with very large data

sets. In this tutorial we will try to make it as easy as possible to
understand the different concepts of machine learning, and we
will work with small easy-to-understand data sets.

Data Types
To analyze data, it is important to know what type of data we
are dealing with.

We can split the data types into three main categories:

 Numerical
 Categorical
 Ordinal

5
 Numerical data are numbers, and can be split into two
numerical categories:
Discrete Data
 numbers that are limited to integers. Example: The
number of cars passing by.
Continuous Data
 numbers that are of infinite value. Example: The price of
an item, or the size of an item

 Categorical data are values that cannot be measured up

against each other. Example: a color value, or any yes/no
values.
 Ordinal data are like categorical data, but can be
measured up against each other. Example: school grades
where A is better than B and so on.

By knowing the data type of your data source, you will be able
to know what technique to use when analyzing them.
You will learn more about statistics and analyzing data in the
next chapters.

6
ML Mean Median Mode

Mean, Median, and Mode

What can we learn from looking at a group of numbers?

In Machine Learning (and in mathematics) there are often three

values that interests us:

 Mean - The average value

 Median - The mid point value
 Mode - The most common value

Example: We have registered the speed of 13 cars:

speed = [99,86,87,88,111,86,103,87,94,78,77,85,86]

What is the average, the middle, or the most common speed

value?

Mean
The mean value is the average value.

7
To calculate the mean, find the sum of all values, and divide the
sum by the number of values:

(99+86+87+88+111+86+103+87+94+78+77+85+86) / 13 = 89.77

Use the NumPy mean() method to find the average speed:

import numpy

speed = [99,86,87,88,111,86,103,87,94,78,77,85,86]
x = numpy.mean(speed)

print(x)

Median
The median value is the value in the middle, after you have
sorted all the values:

77, 78, 85, 86, 86, 86, 87, 87, 88, 94, 99, 103, 111

8
It is important that the numbers are sorted before you can find
the median.
The NumPy module has a method for this.
Use the NumPy median() method to find the middle value:

import numpy

speed = [99,86,87,88,111,86,103,87,94,78,77,85,86]
x = numpy.median(speed)

print(x)

If there are two numbers in the middle, divide the sum of those
numbers by two.

77, 78, 85, 86, 86, 86, 87, 87, 94, 98, 99, 103

(86 + 87) / 2 = 86.5

9
Mode
The Mode value is the value that appears the most number of
times:

99, 86, 87, 88, 111, 86, 103, 87, 94, 78, 77, 85, 86 = 86

The SciPy module has a method for this. Learn about the SciPy
module in our SciPy Tutorial.
Use the SciPy mode() method to find the number that appears
the most:

from scipy import stats

speed = [99,86,87,88,111,86,103,87,94,78,77,85,86]
x = stats.mode(speed)

print(x)

10
ML Standard Deviation

What is Standard Deviation?

Standard deviation is a number that describes how spread out
the values are.

A low standard deviation means that most of the numbers are

close to the mean (average) value.
A high standard deviation means that the values are spread out
over a wider range.

Example: This time we have registered the speed of 7 cars:

speed = [86,87,88,86,87,85,86]

The standard deviation is:

0.9

Meaning that most of the values are within the range of 0.9
from the mean value, which is 86.4.

11
Let us do the same with a selection of numbers with a wider
range:

speed = [32,111,138,28,59,77,97]

The standard deviation is:

37.85

Meaning that most of the values are within the range of 37.85
from the mean value, which is 77.4.
As you can see, a higher standard deviation indicates that the
values are spread out over a wider range.
The NumPy module has a method to calculate the standard
deviation:
Use the NumPy std() method to find the standard deviation:

import numpy
speed = [86,87,88,86,87,85,86]
x = numpy.std(speed)
print(x)

12
ML Percentiles

What are Percentiles?

Percentiles are used in statistics to give you a number that
describes the value that a given percent of the values are lower
than.

Example: Let's say we have an array of the ages of all the

people that live in a street.

ages = [5,31,43,48,50,41,7,11,15,39,80,82,25,36,27,61,31]

What is the 75. percentile? The answer is 43, meaning that 75%
of the people are 43 or younger.
Use the NumPy percentile() method to find the percentiles:

import numpy
ages = [5,31,43,48,50,41,7,11,15,39,80,82,32,36,27,61,31]

x = numpy.percentile(ages, 75)
print(x)

13
ML data distribution

Data Distribution
Earlier in this tutorial we have worked with very small amounts
of data in our examples, just to understand the different
concepts.

In the real world, the data sets are much bigger, but it can be
difficult to gather real world data, at least at an early stage of a
project.

How Can we Get Big Data Sets?

To create big data sets for testing, we use the Python module
NumPy, which comes with a number of methods to create
random data sets, of any size.
Create an array containing 250 random floats between 0 and 5:

import numpy

x = numpy.random.uniform(0.0, 5.0, 250)

print(x)

14
Histogram
To visualize the data set we can draw a histogram with the data
we collected.
We will use the Python module Matplotlib to draw a histogram.
Learn about the Matplotlib module in our Matplotlib Tutorial.
Draw a histogram:

import numpy
import matplotlib.pyplot as plt

x = numpy.random.uniform(0.0, 5.0, 250)

plt.hist(x, 5)
plt.show()

Histogram Explained
We use the array from the example above to draw a histogram
with 5 bars.

The first bar represents how many values in the array are
between 0 and 1.

15
The second bar represents how many values are between 1 and
2.
Which gives us this result:

 52 values are between 0 and 1

 48 values are between 1 and 2
 49 values are between 2 and 3
 51 values are between 3 and 4
 50 values are between 4 and 5

ML Normal Data Distribution

Normal Data Distribution

In the previous chapter we learned how to create a completely
random array, of a given size, and between two given values.

In this chapter we will learn how to create an array where the

values are concentrated around a given value.

In probability theory this kind of data distribution is known as

the normal data distribution, or the Gaussian data distribution,

16
after the mathematician Carl Friedrich Gauss who came up with
the formula of this data distribution.
A typical normal data distribution:

import numpy
import matplotlib.pyplot as plt

x = numpy.random.normal(5.0, 1.0, 100000)

plt.hist(x, 100)
plt.show()

Note: A normal distribution graph is also known as the bell

curve because of it's characteristic shape of a bell.

Histogram Explained

 We use the array from the numpy.random.normal()

method, with 100000 values, to draw a histogram with
100 bars.
 We specify that the mean value is 5.0, and the standard
deviation is 1.0.
17
 Meaning that the values should be concentrated around
5.0, and rarely further away than 1.0 from the mean.
 And as you can see from the histogram, most values are
between 4.0 and 6.0, with a top at approximately 5.0.

ML Scatter Plot

Scatter Plot
A scatter plot is a diagram where each value in the data set is
represented by a dot.

The Matplotlib module has a method for drawing scatter plots,

it needs two arrays of the same length, one for the values of
the x-axis, and one for the values of the y-axis:

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

 The x array represents the age of each car.

 The y array represents the speed of each car.
18
Use the scatter() method to draw a scatter plot diagram:

import matplotlib.pyplot as plt

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

plt.scatter(x, y)
plt.show()

Scatter Plot Explained

The x-axis represents ages, and the y-axis represents speeds.

What we can read from the diagram is that the two fastest cars
were both 2 years old, and the slowest car was 12 years old.

Note: It seems that the newer the car, the faster it drives, but
that could be a coincidence, after all we only registered 13 cars.

19
Random Data Distributions
In Machine Learning the data sets can contain thousands-, or
even millions, of values.

You might not have real world data when you are testing an
algorithm, you might have to use randomly generated values.

As we have learned in the previous chapter, the NumPy module

can help us with that!
Let us create two arrays that are both filled with 1000 random
numbers from a normal data distribution.

 The first array will have the mean set to 5.0 with a
standard deviation of 1.0.
 The second array will have the mean set to 10.0 with a
standard deviation of 2.0:

A scatter plot with 1000 dots:

import numpy
import matplotlib.pyplot as plt

20
x = numpy.random.normal(5.0, 1.0, 1000)
y = numpy.random.normal(10.0, 2.0, 1000)

plt.scatter(x, y)
plt.show()

ML linear regression

Regression
The term regression is used when you try to find the
relationship between variables.

In Machine Learning, and in statistical modeling, that

relationship is used to predict the outcome of future events.

Linear Regression
Linear regression uses the relationship between the data-points
to draw a straight line through all them.
This line can be used to predict future values.

21
How Does it Work?
Python has methods for finding a relationship between data-
points and to draw a line of linear regression. We will show you
how to use these methods instead of going through the
mathematic formula.

In the example below, the x-axis represents age, and the y-axis
represents speed. We have registered the age and speed of 13
cars as they were passing a tollbooth. Let us see if the data we
collected could be used in a linear regression:
Import scipy and draw the line of Linear Regression:

import matplotlib.pyplot as plt

from scipy import stats

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

slope, intercept, r, p, std_err = stats.linregress(x, y)

def myfunc(x):
return slope * x + intercept
22
mymodel = list(map(myfunc, x))

plt.scatter(x, y)
plt.plot(x, mymodel)
plt.show()

Example Explained
Import the modules you need.

import matplotlib.pyplot as plt

from scipy import stats

Create the arrays that represent the values of the x and y axis:

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

Execute a method that returns some important key values of

Linear Regression:

23
slope, intercept, r, p, std_err = stats.linregress(x, y)

Create a function that uses the slope and intercept values to

return a new value. This new value represents where on the y-
axis the corresponding x value will be placed:

def myfunc(x):
return slope * x + intercept

Run each value of the x array through the function. This will
result in a new array with new values for the y-axis:

mymodel = list(map(myfunc, x))

Draw the original scatter plot:

plt.scatter(x, y)

Draw the line of linear regression:

plt.plot(x, mymodel)

24
Display the diagram:

plt.show()

R for Relationship
It is important to know how the relationship between the
values of the x-axis and the values of the y-axis is, if there are
no relationship the linear regression can not be used to predict
anything.

 This relationship - the coefficient of correlation - is called r.

 The r value ranges from -1 to 1, where 0 means no
relationship, and 1 (and -1) means 100% related.
 Python and the Scipy module will compute this value for
you, all you have to do is feed it with the x and y values.

How well does my data fit in a linear regression?

from scipy import stats

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

25
slope, intercept, r, p, std_err = stats.linregress(x, y)
print(r)

Note: The result -0.76 shows that there is a relationship, not

perfect, but it indicates that we could use linear regression in
future predictions.

Predict Future Values

Now we can use the information we have gathered to predict
future values.
Example: Let us try to predict the speed of a 10 years old car.

To do so, we need the same myfunc() function from the

example above:

def myfunc(x):
return slope * x + intercept

Predict the speed of a 10 years old car:

26
from scipy import stats

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

slope, intercept, r, p, std_err = stats.linregress(x, y)

def myfunc(x):
return slope * x + intercept

speed = myfunc(10)

print(speed)

The example predicted a speed at 85.6, which we also could

read from the diagram.

27
Bad Fit?
Let us create an example where linear regression would not be
the best method to predict future values.
These values for the x- and y-axis should result in a very bad fit
for linear regression:

import matplotlib.pyplot as plt

from scipy import stats

x=
[89,43,36,36,95,10,66,34,38,20,26,29,48,64,6,5,36,66,72,40]
y=
[21,46,3,35,67,95,53,72,58,10,26,34,90,33,38,20,56,2,47,15]

slope, intercept, r, p, std_err = stats.linregress(x, y)

def myfunc(x):
return slope * x + intercept

mymodel = list(map(myfunc, x))

28
plt.scatter(x, y)
plt.plot(x, mymodel)
plt.show()

And the r for relationship?

You should get a very low r value.

import numpy
from scipy import stats

x=
[89,43,36,36,95,10,66,34,38,20,26,29,48,64,6,5,36,66,72,40]
y=
[21,46,3,35,67,95,53,72,58,10,26,34,90,33,38,20,56,2,47,15]

slope, intercept, r, p, std_err = stats.linregress(x, y)

print(r)

29
ML Polynomial Regression

Polynomial Regression
If your data points clearly will not fit a linear regression (a
straight line through all data points), it might be ideal for
polynomial regression.

Polynomial regression, like linear regression, uses the

relationship between the variables x and y to find the best way
to draw a line through the data points.

How Does it Work?

Python has methods for finding a relationship between data-
points and to draw a line of polynomial regression. We will
show you how to use these methods instead of going through
the mathematic formula.

In the example below, we have registered 18 cars as they were

passing a certain tollbooth.
We have registered the car's speed, and the time of day (hour)
the passing occurred.

30
The x-axis represents the hours of the day and the y-axis
represents the speed:
Start by drawing a scatter plot:

import matplotlib.pyplot as plt

x = [1,2,3,5,6,7,8,9,10,12,13,14,15,16,18,19,21,22]
y = [100,90,80,60,60,55,60,65,70,70,75,76,78,79,90,99,99,100]

plt.scatter(x, y)
plt.show()

Import numpy and matplotlib then draw the line of Polynomial

Regression:

import numpy
import matplotlib.pyplot as plt

x = [1,2,3,5,6,7,8,9,10,12,13,14,15,16,18,19,21,22]
y = [100,90,80,60,60,55,60,65,70,70,75,76,78,79,90,99,99,100]
31
mymodel = numpy.poly1d(numpy.polyfit(x, y, 3))
myline = numpy.linspace(1, 22, 100)

plt.scatter(x, y)
plt.plot(myline, mymodel(myline))
plt.show()

Example Explained
Import the modules you need.

import numpy
import matplotlib.pyplot as plt

Create the arrays that represent the values of the x and y axis:

x = [1,2,3,5,6,7,8,9,10,12,13,14,15,16,18,19,21,22]
y = [100,90,80,60,60,55,60,65,70,70,75,76,78,79,90,99,99,100]

NumPy has a method that lets us make a polynomial model:

32
mymodel = numpy.poly1d(numpy.polyfit(x, y, 3))

Then specify how the line will display, we start at position 1,

and end at position 22:

myline = numpy.linspace(1, 22, 100)

Draw the original scatter plot:

plt.scatter(x, y)

Draw the line of polynomial regression:

plt.plot(myline, mymodel(myline))

Display the diagram:

plt.show()

33
R-Squared
It is important to know how well the relationship between the
values of the x- and y-axis is, if there are no relationship the
polynomial regression can not be used to predict anything.

 The relationship is measured with a value called the r-

squared.
 The r-squared value ranges from 0 to 1, where 0 means no
relationship, and 1 means 100% related.
 Python and the Sklearn module will compute this value for
you, all you have to do is feed it with the x and y arrays:
How well does my data fit in a polynomial regression?

import numpy
from sklearn.metrics import r2_score

x = [1,2,3,5,6,7,8,9,10,12,13,14,15,16,18,19,21,22]
y = [100,90,80,60,60,55,60,65,70,70,75,76,78,79,90,99,99,100]

mymodel = numpy.poly1d(numpy.polyfit(x, y, 3))

34
print(r2_score(y, mymodel(x)))
Predict Future Values
Now we can use the information we have gathered to predict
future values.

Example: Let us try to predict the speed of a car that passes the
tollbooth at around the time 17:00:

To do so, we need the same mymodel array from the example

above:

mymodel = numpy.poly1d(numpy.polyfit(x, y, 3))

Predict the speed of a car passing at 17:00:

import numpy
from sklearn.metrics import r2_score

x = [1,2,3,5,6,7,8,9,10,12,13,14,15,16,18,19,21,22]
y = [100,90,80,60,60,55,60,65,70,70,75,76,78,79,90,99,99,100]

35
mymodel = numpy.poly1d(numpy.polyfit(x, y, 3))
speed = mymodel(17)
print(speed)

Bad Fit?
Let us create an example where polynomial regression would
not be the best method to predict future values.
These values for the x- and y-axis should result in a very bad fit
for polynomial regression:

import numpy
import matplotlib.pyplot as plt

x=
[89,43,36,36,95,10,66,34,38,20,26,29,48,64,6,5,36,66,72,40]
y=
[21,46,3,35,67,95,53,72,58,10,26,34,90,33,38,20,56,2,47,15]

mymodel = numpy.poly1d(numpy.polyfit(x, y, 3))

myline = numpy.linspace(2, 95, 100)

36
plt.scatter(x, y)
plt.plot(myline, mymodel(myline))
plt.show()

And the r-squared value?

You should get a very low r-squared value.

import numpy
from sklearn.metrics import r2_score

x=
[89,43,36,36,95,10,66,34,38,20,26,29,48,64,6,5,36,66,72,40]
y=
[21,46,3,35,67,95,53,72,58,10,26,34,90,33,38,20,56,2,47,15]

mymodel = numpy.poly1d(numpy.polyfit(x, y, 3))

print(r2_score(y, mymodel(x)))

37
ML multiple regression

Multiple Regression
Multiple regression is like linear regression, but with more than
one independent value, meaning that we try to predict a value
based on two or more variables.

We can predict the CO2 emission of a car based on the size of

the engine, but with multiple regression we can throw in more
variables, like the weight of the car, to make the prediction
more accurate.

How Does it Work?

In Python we have modules that will do the work for us. Start
by importing the Pandas module.

import pandas

Learn about the Pandas module in our Pandas Tutorial.

38
The Pandas module allows us to read csv files and return a
DataFrame object.
df = pandas.read_csv("data.csv")

Then make a list of the independent values and call this

variable X.
Put the dependent values in a variable called y.

X = df[['Weight', 'Volume']]
y = df['CO2']

Tip: It is common to name the list of independent values with a

upper case X, and the list of dependent values with a lower
case y.

We will use some methods from the sklearn module, so we will

have to import that module as well:

from sklearn import linear_model

From the sklearn module we will use the LinearRegression()

method to create a linear regression object.
39
This object has a method called fit() that takes the independent
and dependent values as parameters and fills the regression
object with data that describes the relationship:

regr = linear_model.LinearRegression()
regr.fit(X, y)

Now we have a regression object that are ready to predict CO2

values based on a car's weight and volume:

#predict the CO2 emission of a car where the weight is 2300kg,

and the volume is 1300cm3:
predictedCO2 = regr.predict([[2300, 1300]])

See the whole example in action:

import pandas
from sklearn import linear_model

df = pandas.read_csv("data.csv")

40
X = df[['Weight', 'Volume']]
y = df['CO2']

regr = linear_model.LinearRegression()
regr.fit(X, y)

#predict the CO2 emission of a car where the weight is 2300kg,

and the volume is 1300cm3:
predictedCO2 = regr.predict([[2300, 1300]])

print(predictedCO2)

Coefficient
The coefficient is a factor that describes the relationship with
an unknown variable.

Example: if x is a variable, then 2x is x two times. x is the

unknown variable, and the number 2 is the coefficient.

41
In this case, we can ask for the coefficient value of weight
against CO2, and for volume against CO2. The answer(s) we get
tells us what would happen if we increase, or decrease, one of
the independent values.
Print the coefficient values of the regression object:

import pandas
from sklearn import linear_model

df = pandas.read_csv("data.csv")

X = df[['Weight', 'Volume']]
y = df['CO2']

regr = linear_model.LinearRegression()
regr.fit(X, y)

print(regr.coef_)

Result:
[0.00755095 0.00780526]

42
Result Explained
The result array represents the coefficient values of weight and
volume.

Weight: 0.00755095
Volume: 0.00780526

These values tell us that if the weight increase by 1kg, the CO2
emission increases by 0.00755095g.
And if the engine size (Volume) increases by 1 cm3, the CO2
emission increases by 0.00780526 g.

I think that is a fair guess, but let test it!

We have already predicted that if a car with a 1300cm3 engine

weighs 2300kg, the CO2 emission will be approximately 107g.

What if we increase the weight with 1000kg?

Copy the example from before, but change the weight from
2300 to 3300:

43
import pandas
from sklearn import linear_model

df = pandas.read_csv("data.csv")

X = df[['Weight', 'Volume']]
y = df['CO2']

regr = linear_model.LinearRegression()
regr.fit(X, y)

predictedCO2 = regr.predict([[3300, 1300]])

print(predictedCO2)

Result:
[114.75968007]

44
We have predicted that a car with 1.3 liter engine, and a weight
of 3300 kg, will release approximately 115 grams of CO2 for
every kilometer it drives.

Which shows that the coefficient of 0.00755095 is correct:

107.2087328 + (1000 * 0.00755095) = 114.75968

ML scale

Scale Features
When your data has different values, and even different
measurement units, it can be difficult to compare them. What
is kilograms compared to meters? Or altitude compared to
time?

The answer to this problem is scaling. We can scale data into

new values that are easier to compare.

Take a look at the table below, it is the same data set that we
used in the multiple regression chapter, but this time the
45
volume column contains values in liters instead of cm3 (1.0
instead of 1000).

It can be difficult to compare the volume 1.0 with the weight

790, but if we scale them both into comparable values, we can
easily see how much one value is compared to the other.

There are different methods for scaling data, in this tutorial we

will use a method called standardization.
The standardization method uses this formula:

z = (x - u) / s

Where z is the new value, x is the original value, u is the mean

and s is the standard deviation.
If you take the weight column from the data set above, the first
value is 790, and the scaled value will be:

(790 - 1292.23) / 238.74 = -2.1

If you take the volume column from the data set above, the
first value is 1.0, and the scaled value will be:

46
(1.0 - 1.61) / 0.38 = -1.59

Now you can compare -2.1 with -1.59 instead of comparing 790
with 1.0.
You do not have to do this manually, the Python sklearn
module has a method called StandardScaler() which returns a
Scaler object with methods for transforming data sets.
Scale all values in the Weight and Volume columns:

import pandas
from sklearn import linear_model
from sklearn.preprocessing import StandardScaler
scale = StandardScaler()

df = pandas.read_csv("data.csv")

X = df[['Weight', 'Volume']]

scaledX = scale.fit_transform(X)

47
print(scaledX)

Predict CO2 Values

The task in the Multiple Regression chapter was to predict the
CO2 emission from a car when you only knew its weight and
volume.

When the data set is scaled, you will have to use the scale when
you predict values:
Predict the CO2 emission from a 1.3 liter car that weighs 2300
kilograms:

import pandas
from sklearn import linear_model
from sklearn.preprocessing import StandardScaler
scale = StandardScaler()

df = pandas.read_csv("data.csv")

X = df[['Weight', 'Volume']]

48
y = df['CO2']

scaledX = scale.fit_transform(X)
regr = linear_model.LinearRegression()
regr.fit(scaledX, y)

scaled = scale.transform([[2300, 1.3]])

predictedCO2 = regr.predict([scaled[0]])
print(predictedCO2)

ML train/test

Evaluate Your Model

In Machine Learning we create models to predict the outcome
of certain events, like in the previous chapter where we
predicted the CO2 emission of a car when we knew the weight
and engine size.

49
To measure if the model is good enough, we can use a method
called Train/Test.

What is Train/Test
Train/Test is a method to measure the accuracy of your model.
It is called Train/Test because you split the data set into two
sets: a training set and a testing set.

80% for training, and 20% for testing.

 You train the model using the training set.

 You test the model using the testing set.

 Train the model means create the model.

 Test the model means test the accuracy of the model.

Start With a Data Set

Start with a data set you want to test.

50
Our data set illustrates 100 customers in a shop, and their
shopping habits.

import numpy
import matplotlib.pyplot as plt
numpy.random.seed(2)

x = numpy.random.normal(3, 1, 100)
y = numpy.random.normal(150, 40, 100) / x

plt.scatter(x, y)
plt.show()

Split Into Train/Test

The training set should be a random selection of 80% of the
original data.

The testing set should be the remaining 20%.

train_x = x[:80]
train_y = y[:80]
51
test_x = x[80:]
test_y = y[80:]

Display the Training Set

Display the same scatter plot with the training set:

plt.scatter(train_x, train_y)
plt.show()

It looks like the original data set, so it seems to be a fair

selection.

Display the Testing Set

To make sure the testing set is not completely different, we will
take a look at the testing set as well.

plt.scatter(test_x, test_y)
plt.show()

The testing set also looks like the original data set.
52
Fit the Data Set
What does the data set look like? In my opinion I think the best
fit would be a polynomial regression, so let us draw a line of
polynomial regression.

To draw a line through the data points, we use the plot()

method of the matplotlib module.
Draw a polynomial regression line through the data points:

import numpy
import matplotlib.pyplot as plt
numpy.random.seed(2)

x = numpy.random.normal(3, 1, 100)
y = numpy.random.normal(150, 40, 100) / x

train_x = x[:80]

53
train_y = y[:80]

test_x = x[80:]
test_y = y[80:]

mymodel = numpy.poly1d(numpy.polyfit(train_x, train_y, 4))

myline = numpy.linspace(0, 6, 100)

plt.scatter(train_x, train_y)
plt.plot(myline, mymodel(myline))
plt.show()

The result can back my suggestion of the data set fitting a

polynomial regression, even though it would give us some
weird results if we try to predict values outside of the data set.
Example: the line indicates that a customer spending 6 minutes
in the shop would make a purchase worth 200. That is probably
a sign of overfitting.

But what about the R-squared score? The R-squared score is a

good indicator of how well my data set is fitting the model.

54
R2
Remember R2, also known as R-squared?

It measures the relationship between the x axis and the y axis,

and the value ranges from 0 to 1, where 0 means no
relationship, and 1 means totally related.

The sklearn module has a method called r2_score() that will

help us find this relationship.

In this case we would like to measure the relationship between

the minutes a customer stays in the shop and how much money
they spend.
How well does my training data fit in a polynomial regression?

import numpy
from sklearn.metrics import r2_score
numpy.random.seed(2)

x = numpy.random.normal(3, 1, 100)
y = numpy.random.normal(150, 40, 100) / x

55
train_x = x[:80]
train_y = y[:80]

test_x = x[80:]
test_y = y[80:]

mymodel = numpy.poly1d(numpy.polyfit(train_x, train_y, 4))

r2 = r2_score(train_y, mymodel(train_x))
print(r2)

Bring in the Testing Set

Now we have made a model that is OK, at least when it comes
to training data.

Now we want to test the model with the testing data as well, to
see if gives us the same result.
Let us find the R2 score when using testing data:

import numpy
56
from sklearn.metrics import r2_score
numpy.random.seed(2)

x = numpy.random.normal(3, 1, 100)
y = numpy.random.normal(150, 40, 100) / x

train_x = x[:80]
train_y = y[:80]

test_x = x[80:]
test_y = y[80:]

mymodel = numpy.poly1d(numpy.polyfit(train_x, train_y, 4))

r2 = r2_score(test_y, mymodel(test_x))

print(r2)

Note: The result 0.809 shows that the model fits the testing set
as well, and we are confident that we can use the model to
predict future values.

57
Predict Values
Now that we have established that our model is OK, we can
start predicting new values.
How much money will a buying customer spend, if she or he
stays in the shop for 5 minutes?

print(mymodel(5))

The example predicted the customer to spend 22.88 dollars, as

seems to correspond to the diagram.

ML decision tree

Decision Tree
In this chapter we will show you how to make a "Decision
Tree". A Decision Tree is a Flow Chart, and can help you make
decisions based on previous experience.

In the example, a person will try to decide if he/she should go

to a comedy show or not.

58
Luckily our example person has registered every time there was
a comedy show in town, and registered some information
about the comedian, and also registered if he/she went or not.
 Age Experience Rank Nationality Go
 36 10 9 UK NO
 42 12 4 USA NO
 23 4 6 N NO
 52 4 4 USA NO
 43 21 8 USA YES
 44 14 5 UK NO
 66 3 7 N YES
 35 14 9 UK YES
 52 13 7 N YES
 35 5 9 N YES
 24 3 5 USA NO
 18 3 7 UK YES
 45 9 9 UK YES

Now, based on this data set, Python can create a decision tree
that can be used to decide if any new shows are worth
attending to.

How Does it Work?

First, read the dataset with pandas:

59
Read and print the data set:

import pandas
df = pandas.read_csv("data.csv")
print(df)

60
To make a decision tree, all data has to be numerical.
We have to convert the non numerical columns 'Nationality'
and 'Go' into numerical values.
Pandas has a map() method that takes a dictionary with
information on how to convert the values.

{'UK': 0, 'USA': 1, 'N': 2}

Means convert the values 'UK' to 0, 'USA' to 1, and 'N' to 2.

Change string values into numerical values:

d = {'UK': 0, 'USA': 1, 'N': 2}

df['Nationality'] = df['Nationality'].map(d)
d = {'YES': 1, 'NO': 0}
df['Go'] = df['Go'].map(d)

print(df)

61
Then we have to separate the feature columns from the target
column.

The feature columns are the columns that we try to predict

from, and the target column is the column with the values we
try to predict.
X is the feature columns, y is the target column:

features = ['Age', 'Experience', 'Rank', 'Nationality']

X = df[features]
y = df['Go']

print(X)
print(y)

Now we can create the actual decision tree, fit it with our
details. Start by importing the modules we need:
Create and display a Decision Tree:

import pandas

62
from sklearn import tree
from sklearn.tree import DecisionTreeClassifier
import matplotlib.pyplot as plt
df = pandas.read_csv("data.csv")

d = {'UK': 0, 'USA': 1, 'N': 2}

df['Nationality'] = df['Nationality'].map(d)
d = {'YES': 1, 'NO': 0}
df['Go'] = df['Go'].map(d)

features = ['Age', 'Experience', 'Rank', 'Nationality']

X = df[features]
y = df['Go']

dtree = DecisionTreeClassifier()
dtree = dtree.fit(X, y)

tree.plot_tree(dtree, feature_names=features)

63
Result Explained
The decision tree uses your earlier decisions to calculate the
odds for you to wanting to go see a comedian or not.

Let us read the different aspects of the decision tree:

Rank
 Rank <= 6.5 means that every comedian with a rank of 6.5
or lower will follow the True arrow (to the left), and the
rest will follow the False arrow (to the right).
 gini = 0.497 refers to the quality of the split, and is always
a number between 0.0 and 0.5, where 0.0 would mean all
of the samples got the same result, and 0.5 would mean
that the split is done exactly in the middle.
 samples = 13 means that there are 13 comedians left at
this point in the decision, which is all of them since this is
the first step.

64
 value = [6, 7] means that of these 13 comedians, 6 will get
a "NO", and 7 will get a "GO".

Gini
There are many ways to split the samples, we use the GINI
method in this tutorial.
The Gini method uses this formula:

Gini = 1 - (x/n)2 + (y/n)2

Where x is the number of positive answers("GO"), n is the

number of samples, and y is the number of negative answers
("NO"), which gives us this calculation:

1 - (7 / 13)2 + (6 / 13)2 = 0.497

The next step contains two boxes, one box for the comedians
with a 'Rank' of 6.5 or lower, and one box with the rest.

65
 True - 5 Comedians End Here:
 gini = 0.0 means all of the samples got the same result.
 samples = 5 means that there are 5 comedians left in this
branch (5 comedian with a Rank of 6.5 or lower).
 value = [5, 0] means that 5 will get a "NO" and 0 will get a
"GO".

False - 8 Comedians Continue:

Nationality
 Nationality <= 0.5 means that the comedians with a
nationality value of less than 0.5 will follow the arrow to
the left (which means everyone from the UK, ), and the
rest will follow the arrow to the right.
 gini = 0.219 means that about 22% of the samples would
go in one direction.
 samples = 8 means that there are 8 comedians left in this
branch (8 comedian with a Rank higher than 6.5).
66
 value = [1, 7] means that of these 8 comedians, 1 will get a
"NO" and 7 will get a "GO".

True - 4 Comedians Continue:

Age
 Age <= 35.5 means that comedians at the age of 35.5 or
younger will follow the arrow to the left, and the rest will
follow the arrow to the right.

67
 gini = 0.375 means that about 37,5% of the samples would
go in one direction.
 samples = 4 means that there are 4 comedians left in this
branch (4 comedians from the UK).
 value = [1, 3] means that of these 4 comedians, 1 will get a
"NO" and 3 will get a "GO".

False - 4 Comedians End Here:

 gini = 0.0 means all of the samples got the same result.
 samples = 4 means that there are 4 comedians left in this
branch (4 comedians not from the UK).
 value = [0, 4] means that of these 4 comedians, 0 will get a
"NO" and 4 will get a "GO".

68
True - 2 Comedians End Here:

 gini = 0.0 means all of the samples got the same result.
 samples = 2 means that there are 2 comedians left in this
branch (2 comedians at the age 35.5 or younger).
 value = [0, 2] means that of these 2 comedians, 0 will get a
"NO" and 2 will get a "GO".

False - 2 Comedians Continue:

Experience
 Experience <= 9.5 means that comedians with 9.5 years of
experience, or less, will follow the arrow to the left, and
the rest will follow the arrow to the right.
 gini = 0.5 means that 50% of the samples would go in one
direction.
 samples = 2 means that there are 2 comedians left in this
branch (2 comedians older than 35.5).
 value = [1, 1] means that of these 2 comedians, 1 will get a
"NO" and 1 will get a "GO".

69
True - 1 Comedian Ends Here:

 gini = 0.0 means all of the samples got the same result.
 samples = 1 means that there is 1 comedian left in this
branch (1 comedian with 9.5 years of experience or less).
 value = [0, 1] means that 0 will get a "NO" and 1 will get a
"GO".

False - 1 Comedian Ends Here:

 gini = 0.0 means all of the samples got the same result.
 samples = 1 means that there is 1 comedians left in this
branch (1 comedian with more than 9.5 years of
experience).
 value = [1, 0] means that 1 will get a "NO" and 0 will get a
"GO".

70
Predict Values
We can use the Decision Tree to predict new values.

Example: Should I go see a show starring a 40 years old

American comedian, with 10 years of experience, and a comedy
ranking of 7?
Use predict() method to predict new values:

print(dtree.predict([[40, 10, 7, 1]]))

What would the answer be if the comedy rank was 6?

print(dtree.predict([[40, 10, 6, 1]]))

ML confusion matrix

What is a confusion matrix?

It is a table that is used in classification problems to assess
where errors in the model were made.

71
The rows represent the actual classes the outcomes should
have been. While the columns represent the predictions we
have made. Using this table it is easy to see which predictions
are wrong.

Creating a Confusion Matrix

Confusion matrixes can be created by predictions made from a
logistic regression.

For now we will generate actual and predicted values by

utilizing NumPy:

import numpy

Next we will need to generate the numbers for "actual" and

"predicted" values.

actual = numpy.random.binomial(1, 0.9, size = 1000)

predicted = numpy.random.binomial(1, 0.9, size = 1000)

72
In order to create the confusion matrix we need to import
metrics from the sklearn module.

from sklearn import metrics

Once metrics is imported we can use the confusion matrix

function on our actual and predicted values.

confusion_matrix = metrics.confusion_matrix(actual, predicted)

To create a more interpretable visual display we need to

convert the table into a confusion matrix display.

cm_display = metrics.ConfusionMatrixDisplay(confusion_matrix
= confusion_matrix, display_labels = [False, True])

Vizualizing the display requires that we import pyplot from

matplotlib.

import matplotlib.pyplot as plt

73
Finally to display the plot we can use the functions plot() and
show() from pyplot.

cm_display.plot()
plt.show()

See the whole example in action:

import matplotlib.pyplot as plt

import numpy
from sklearn import metrics

actual = numpy.random.binomial(1,.9,size = 1000)

predicted = numpy.random.binomial(1,.9,size = 1000)

confusion_matrix = metrics.confusion_matrix(actual, predicted)

74
cm_display = metrics.ConfusionMatrixDisplay(confusion_matrix
= confusion_matrix, display_labels = [False, True])

cm_display.plot()
plt.show()

Results Explained
The Confusion Matrix created has four different quadrants:

 True Negative (Top-Left Quadrant)

 False Positive (Top-Right Quadrant)
 False Negative (Bottom-Left Quadrant)
 True Positive (Bottom-Right Quadrant)

True means that the values were accurately predicted, False

means that there was an error or wrong prediction.

75
Now that we have made a Confusion Matrix, we can calculate
different measures to quantify the quality of the model. First,
lets look at Accuracy.

Created Metrics
The matrix provides us with many useful metrics that help us to
evaluate out classification model.

 The different measures include: Accuracy, Precision,

Sensitivity (Recall), Specificity, and the F-score, explained
below.

Accuracy
Accuracy measures how often the model is correct.

How to Calculate
(True Positive + True Negative) / Total Predictions

Accuracy = metrics.accuracy_score(actual, predicted)

Precision

76
Of the positives predicted, what percentage is truly positive?

How to Calculate
True Positive / (True Positive + False Positive)

Precision does not evaluate the correctly predicted negative

cases:

Precision = metrics.precision_score(actual, predicted)

Sensitivity (Recall)
Of all the positive cases, what percentage are predicted
positive?

Sensitivity (sometimes called Recall) measures how good the

model is at predicting positives.

This means it looks at true positives and false negatives (which

are positives that have been incorrectly predicted as negative).

How to Calculate

77
True Positive / (True Positive + False Negative)

Sensitivity is good at understanding how well the model

predicts something is positive:

Sensitivity_recall = metrics.recall_score(actual, predicted)

F-score
F-score is the "harmonic mean" of precision and sensitivity.

It considers both false positive and false negative cases and is

good for imbalanced datasets.

How to Calculate
2 * ((Precision * Sensitivity) / (Precision + Sensitivity))

This score does not take into consideration the True Negative
values:

F1_score = metrics.f1_score(actual, predicted)

78
Specificity
How well the model is at prediciting negative results?

Specificity is similar to sensitivity, but looks at it from the

persepctive of negative results.

How to Calculate
True Negative / (True Negative + False Positive)

Since it is just the opposite of Recall, we use the recall_score

function, taking the opposite position label:

Specificity = metrics.recall_score(actual, predicted,

pos_label=0)

All calulations in one:

Example

79
print({"Accuracy":Accuracy,"Precision":Precision,"Sensitivity_re
call":Sensitivity_recall,"Specificity":Specificity,"F1_score":F1_sc
ore})

ML hierarchial clustering

Vedic Mathematics
No ratings yet
Vedic Mathematics
188 pages
Ebook Advanced Data Structuresand Algorithms
No ratings yet
Ebook Advanced Data Structuresand Algorithms
205 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
ML Unit 1
No ratings yet
ML Unit 1
143 pages
NSTSE Class 3 Solved Paper 2022
No ratings yet
NSTSE Class 3 Solved Paper 2022
25 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
385 pages
Machine Learning - Part 1
100% (1)
Machine Learning - Part 1
80 pages
Java 21 and Java 17 Available Now
No ratings yet
Java 21 and Java 17 Available Now
5 pages
6.lab Activity
No ratings yet
6.lab Activity
23 pages
Machine Learning
100% (1)
Machine Learning
124 pages
Machine Learning
No ratings yet
Machine Learning
135 pages
Maths of Machine Learning
No ratings yet
Maths of Machine Learning
75 pages
My File
No ratings yet
My File
252 pages
Machine Learning in Translation Corpora Processing
No ratings yet
Machine Learning in Translation Corpora Processing
281 pages
Machine Learning Lecture Notes
100% (1)
Machine Learning Lecture Notes
54 pages
Machine Learning: Data Set
100% (1)
Machine Learning: Data Set
52 pages
ML Models Concepts
No ratings yet
ML Models Concepts
32 pages
Essay Dreams
No ratings yet
Essay Dreams
12 pages
FINAL Vedic Maths
No ratings yet
FINAL Vedic Maths
29 pages
ML Notes MAKAUT 7th Sem
No ratings yet
ML Notes MAKAUT 7th Sem
31 pages
Important Math Concept For DSA
No ratings yet
Important Math Concept For DSA
2 pages
Ai-Unit Ii
No ratings yet
Ai-Unit Ii
61 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
Machine Learning AndrewNg
No ratings yet
Machine Learning AndrewNg
116 pages
IGKO Olympiad Sample Paper 1 PDF With Answers For Class 2
No ratings yet
IGKO Olympiad Sample Paper 1 PDF With Answers For Class 2
16 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
46 pages
Crash Course Coding Companion
No ratings yet
Crash Course Coding Companion
136 pages
React Js Interview Questions 1700318879
No ratings yet
React Js Interview Questions 1700318879
40 pages
UE20CS302 Unit3 Slides
No ratings yet
UE20CS302 Unit3 Slides
308 pages
Machine Learning: Where To Start?
No ratings yet
Machine Learning: Where To Start?
71 pages
Django ORM Cheatsheet
No ratings yet
Django ORM Cheatsheet
13 pages
Modul 7 Praktikum Machine Learning Python
No ratings yet
Modul 7 Praktikum Machine Learning Python
32 pages
Ker As Tutorial
No ratings yet
Ker As Tutorial
33 pages
Lectures Machine Learning
No ratings yet
Lectures Machine Learning
205 pages
CrimPro Lakas Atenista Notes
No ratings yet
CrimPro Lakas Atenista Notes
46 pages
G.K Olympiad Questions For Final Round
No ratings yet
G.K Olympiad Questions For Final Round
5 pages
Blind 75 PDF
No ratings yet
Blind 75 PDF
129 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
CS601PC - MACHINE LEARNING Unit - 1-2
No ratings yet
CS601PC - MACHINE LEARNING Unit - 1-2
145 pages
Concise Machine Learning PDF
No ratings yet
Concise Machine Learning PDF
172 pages
Jee Advanced 2015 Paper 1 Solution v2
No ratings yet
Jee Advanced 2015 Paper 1 Solution v2
62 pages
MCA Syllabus
No ratings yet
MCA Syllabus
69 pages
Ai DL ML
No ratings yet
Ai DL ML
151 pages
ATP Module 1 Examples
No ratings yet
ATP Module 1 Examples
24 pages
Agent Environment in AI
No ratings yet
Agent Environment in AI
2 pages
Introduction To AI, ML and DL: Dr. Manjubala Bisi
No ratings yet
Introduction To AI, ML and DL: Dr. Manjubala Bisi
33 pages
Petts, Ann - Shapley, Bernard - On Supervision - Psychoanalytic and Jungian Analytic Perspectives-Karnac (2007)
100% (1)
Petts, Ann - Shapley, Bernard - On Supervision - Psychoanalytic and Jungian Analytic Perspectives-Karnac (2007)
266 pages
Asset Life Cycle Cost
100% (1)
Asset Life Cycle Cost
67 pages
CRIMINAL LAW - Estrada Problems and Answers
67% (3)
CRIMINAL LAW - Estrada Problems and Answers
3 pages
Seminar On Vedic Mathematics
100% (1)
Seminar On Vedic Mathematics
21 pages
Marine Spread Specification
No ratings yet
Marine Spread Specification
38 pages
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
No ratings yet
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
101 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
Machine Learning Notes 1686281543
No ratings yet
Machine Learning Notes 1686281543
113 pages
Methods of Training Needs Identification
No ratings yet
Methods of Training Needs Identification
32 pages
Chapter 3-Problem Solving by Searching Part 1
No ratings yet
Chapter 3-Problem Solving by Searching Part 1
80 pages
Machine Learning Basic Principles
No ratings yet
Machine Learning Basic Principles
124 pages
Vedic Mathematics: Prof M. Basanna
No ratings yet
Vedic Mathematics: Prof M. Basanna
60 pages
R&S ESW User Manual en 01
No ratings yet
R&S ESW User Manual en 01
828 pages
Sharda University School of Business Studies Probability - Assignment 2
No ratings yet
Sharda University School of Business Studies Probability - Assignment 2
2 pages
CBSE Class 12 Question Paper Physics 2019 Set 4
No ratings yet
CBSE Class 12 Question Paper Physics 2019 Set 4
19 pages
Spouses Cha Vs CA GR No. 124520
No ratings yet
Spouses Cha Vs CA GR No. 124520
2 pages
Machine Learning For Marketers PowerPoint Presentation Storyboard
No ratings yet
Machine Learning For Marketers PowerPoint Presentation Storyboard
25 pages
Cima f7 dvanced-Financial-Reporting PDF
100% (1)
Cima f7 dvanced-Financial-Reporting PDF
590 pages
AHM13e Chapter - 01 - Solution To Problems and Key To Cases
No ratings yet
AHM13e Chapter - 01 - Solution To Problems and Key To Cases
19 pages
NSTSE Class 01 - Solution - Paper 436 - 2018
No ratings yet
NSTSE Class 01 - Solution - Paper 436 - 2018
2 pages
Vedic Mathematics/Techniques/Multiplication
No ratings yet
Vedic Mathematics/Techniques/Multiplication
13 pages
CIT-651 Introduction To Machine Learning and Statistical Analysis (42 Hours, 14 Lectures)
No ratings yet
CIT-651 Introduction To Machine Learning and Statistical Analysis (42 Hours, 14 Lectures)
4 pages
Ganit Gyan Vedic Maths Fast Maths Vedic Maths Tricks Mental Maths Speed Maths
No ratings yet
Ganit Gyan Vedic Maths Fast Maths Vedic Maths Tricks Mental Maths Speed Maths
1 page
ANNEX B LGU User Registration Form
No ratings yet
ANNEX B LGU User Registration Form
1 page
Va 28 16 00
No ratings yet
Va 28 16 00
48 pages
List of Books and Notebooks - 2025-26 Class 6-12
No ratings yet
List of Books and Notebooks - 2025-26 Class 6-12
7 pages
Dig CHINA - US - CANADA NEXUS Incl NXIVM Clinton Epstein Feinstein Belzberg
No ratings yet
Dig CHINA - US - CANADA NEXUS Incl NXIVM Clinton Epstein Feinstein Belzberg
69 pages
3 Com
No ratings yet
3 Com
465 pages
Using Genetic Algorithms in Process Planning For Job Shop Machining
No ratings yet
Using Genetic Algorithms in Process Planning For Job Shop Machining
12 pages
BMW530 Wire Color Coding Description
No ratings yet
BMW530 Wire Color Coding Description
2 pages
NTSE (National Talent Search Examination) VIII: Click Here
No ratings yet
NTSE (National Talent Search Examination) VIII: Click Here
5 pages
Industrial Disputes Act
No ratings yet
Industrial Disputes Act
2 pages
Corporation Testbank
No ratings yet
Corporation Testbank
45 pages
Ultralight Shape Load Table PDF
No ratings yet
Ultralight Shape Load Table PDF
13 pages
Civpro Digests b1
No ratings yet
Civpro Digests b1
15 pages
LI 2024 Invitation - Online
No ratings yet
LI 2024 Invitation - Online
15 pages
Estrada V Sandiganbayan Digest
No ratings yet
Estrada V Sandiganbayan Digest
2 pages
Maths Mount Olives.
No ratings yet
Maths Mount Olives.
16 pages
Fraud Alert!: "@ril - VC" and "@ril - Sg". These
No ratings yet
Fraud Alert!: "@ril - VC" and "@ril - Sg". These
2 pages
HTML - Multiple Web Frameset
No ratings yet
HTML - Multiple Web Frameset
8 pages
Right To Privacy Essay
No ratings yet
Right To Privacy Essay
18 pages
Corporate Social Responsibility - What Does It Mean ?: by Mallen Baker: First Published 8 Jun 2004
No ratings yet
Corporate Social Responsibility - What Does It Mean ?: by Mallen Baker: First Published 8 Jun 2004
4 pages
Nikhil Sharma Resume
No ratings yet
Nikhil Sharma Resume
6 pages
Study Plan
No ratings yet
Study Plan
1 page