Introduction To ML Linear Regression
Introduction To ML Linear Regression
[email protected]
Machine Learning
[email protected]
“[Machine Learning is the] field of study that gives computers the ability to
[email protected] learn without being explicitly programmed.” Arthur Samuel 1959:
[email protected]
2. The ability to do the tasks come from the underlying model which is the result of
the learning process. Sometimes the ability comes from an mathematical
algorithm
[email protected]
[email protected]
3. The model generated represents behaviour of the processes that were earlier
performed before machine learning
4. The model is generated from huge volume of data, huge both in breadth and
depth reflecting the real world in which the processes are performed
5. The more representative data is of the real world, the better the model would be.
The challenge is how to make it a true representative
1. Cannot express our knowledge about patterns as a program. For e.g. Character
recognition or natural language processing
2. Do not have an algorithm to identify a pattern of interest. For e.g. In spam mail detection
1. Fraud detection
2. Sentiment analysis
[email protected]
[email protected]
4. Prediction of equipment failures
[email protected]
Introduction to Supervised
[email protected]
Machine Learning
[email protected]
c. The model thus generated is used to make predictions about future instances
[email protected]
where the predictor feature values are known but the target / class value is
unknown
a. E.g.-1 building model to predict the re-sale value of a car based on its current mileage,
age, color etc.
b. E.g.-2 Predicting the final year scores based on student performance in previous
years.
Pre-process Address data quality issues such as missing values, outliers, data
Data pollution etc. Establish veracity of the data. Select attributes for model,
Need domain expertise
Create
Split the data into training set and test set. Generally
training & 70:30 ratio is used
[email protected] test set
[email protected]
Select
Select appropriate algorithm/s to model. For e.g. Random
appropriate Forest, K Nearest Neighbors etc. Depends on data
algorithm/s
Deploy at scale
OK?
No Yes
Productionize
This file is meant for personal use by [email protected] only. & calibrate
20 Sharing or publishing the contents in part or full is liable for legal action.