0% found this document useful (0 votes)
12 views

Introduction to Statistical Learning _ Why Do We Need Statistical Learning

The document provides an overview of Statistical Learning, emphasizing its role in data analysis, prediction, and classification. It explains the methodologies of supervised and unsupervised learning, including regression and classification techniques. The importance of Statistical Learning in today's data-driven world is highlighted, showcasing its applications in various fields and the advantages it offers in decision-making and efficiency.

Uploaded by

Tanishq Saini
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views

Introduction to Statistical Learning _ Why Do We Need Statistical Learning

The document provides an overview of Statistical Learning, emphasizing its role in data analysis, prediction, and classification. It explains the methodologies of supervised and unsupervised learning, including regression and classification techniques. The importance of Statistical Learning in today's data-driven world is highlighted, showcasing its applications in various fields and the advantages it offers in decision-making and efficiency.

Uploaded by

Tanishq Saini
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀 OFFER - ENTIRE WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 27 S

Introduction to Statistical Learning


(https://www.educba.
Updated com/)
June 16, 2023

*Please p

Overview of Statistical Learning


People can comprehend statistics as a study of collecting and analyzing data. Statistica
Learning servers as a means to extract facts and summarize the available data. Since th
18th century, people have predominantly utilized statistics for taxation and military
purposes. Later towards the end of the 20th century, with the advent of computers, the

applications of statistical concepts broadened with its contributions towards technolog


such as Machine Learning and Neural Nets. In this topic, we will learn about Introductio
(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)
Statistical Learning.

https://www.educba.com/introduction-to-statistical-learning/ 1/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

g
🚀 OFFER - ENTIRE WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 27 S
ADVERTI
 Popular Course in this category
STATISTICS Course Bundle - 24 Courses in 1 | 11 Mock Tests
(https://www.educba.
com/)

(https://www.educba.com/data-science/courses/statistical-analysis-course/?btnz=edu-right-pos
banner)

Start Your Free Data Science Course


*Please p
Hadoop, Data Science, Statistics & others

Statistical Learning enables data prediction and classification by effectively handling la


volumes of data. It involves performing numerous iterations to analyze and select the m
valuable and relevant data, ultimately leading to an optimized result.

What is Statistical Learning?


Data is the fuel that drives Statistical Learning, and statistics are all about making sense
the data in hand. The results obtained from statistical learning help us determine trend
and predict a possible outcome for the future.

Statistical Learning is a tool to accomplish the goals of supervised and unsupervised


Machine Learning techniques. With supervised statistical learning, we get to predict or
estimate an outcome based on previously present output, whereas, with unsupervised
statistical learning, we find various patterns present within the data by clustering them
similar groups.

This article shows Supervised Statistical Learning methodologies, namely Regression an


Classification.
(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)

https://www.educba.com/introduction-to-statistical-learning/ 2/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀All-in-One
ENTIRE WEBSITE
OFFER - 3700+ Courses | 1900+ Test Series | 12000+ HoursADVERTIS
Data Science Bundle - 400+ Courses | 550+
|
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 2 7 S
Mock Tests | 2000+ Hours | Lifetime |
(https://www.educba.
2000+ Hour of HD Videos | 80 Learning Paths | 400+ Courses | Verifiable Certificate of Completion
com/)
| Lifetime Access
     4.7
(https://www.educba.com/data-science/courses/data-science-course/?btnz=edu-in
between-para-banner)

1. Regression *Please p

Ever wondered how stock market predictions work? Or how a realtor estimates a house
price? Or want to know if a new car in the market is worth the buy? If yes, you can find
answers to these in Regression’s statistical methodology. We utilize regression equatio
and analysis to make unbiased and accurate predictions of quantitative data. In additio
regression Analysis helps us to identify the relationship between two or more variables

(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)

https://www.educba.com/introduction-to-statistical-learning/ 3/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀 OFFER - ENTIRE WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 27 S

(https://www.educba.
com/)

In Simple Linear Regression (SLR), the relationship between a dependent variable (Y) an
independent variable (X) is determined. The equation illustrated below estimates how
*Please p
change in X will affect Y.

(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)

https://www.educba.com/introduction-to-statistical-learning/ 4/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀 OFFER - ENTIRE WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 27 S

(https://www.educba.
com/)

*Please p

Bias-Variance Trade-off:
Linear Regression is all about finding the best fit straight line. Errors in regression mod
are mainly due to bias and variance. Minimizing these two prediction errors is essentia
obtain a generalized model that works well on training and testing data sets.

Bias:
The linear Regression Model assumes the target variable has a linear relationship with
features. In reality, though, this might not be the case, and the inability of the Linear
Regression model to capture the true relationship is termed bias. The error due to bias
determined by calculating the difference between predicted and actual values.

ADVERTIS
MS Excel & VBA for Data Science Course Bundle - 24
Courses in 1 | 10 Mock Tests
87+ Hours of HD Videos | 24 Courses | 10 Mock Tests & Quizzes | Verifiable Certificate of

Completion | Lifetime Access


     4.5
(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
(https://www.educba.com/excel/courses/excel-data-analysis-course/?btnz=edu-in-
1d3108bb141c)
between-para-banner)

https://www.educba.com/introduction-to-statistical-learning/ 5/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀 OFFER - ENTIRE WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 27 S
Variance:
The variance gives us a picture of how far the data points under consideration are spre
(https://www.educba.
com/)
The Variance error refers to the fluctuations in the predictions when data sets are chan
and are calculated as the variability of a model prediction from a given data point.

*Please p

Consider the scenarios where a model has high bias and low variance; then, it is likely t
less complex and probably will tend to underfit the data. If the model has low bias and
variance, it will likely overfit the data, making it more complex and inconsistent when t
for unseen inputs. Hence to avoid such scenarios, there is a need to come to a commo
ground w.r.t the bias and variance to have an acceptable model.

An ideal model is selected to have a low bias that can capture the proper relationship
between its variables and low variance that produces consistent predictions across diff
(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)
datasets. This can be achieved by obtaining a sweet spot between a simple and compl

https://www.educba.com/introduction-to-statistical-learning/ 6/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

y g p p p
🚀 OFFER - ENTIRE
Regression WEBSITE 3700+
Model. Regularization, Courses
bagging, and |boosting
1900+ Test Series
help | 12000+
achieve the Hours
sweat |spot.

@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 2 7 S

2. Classification
(https://www.educba.
Classification
com/) is applied to qualitative (non-numeric) data wherein the target variable c

classified or grouped into two (Binary Classification) or more classes (Multi-Class


Classification). Examples of Classification Statistical Learning include Tagging an e-mail
“spam” or “ham,” predicting customer churn, classifying animals based on their breeds

*Please p
In classification, the output is often obtained using probabilistic approaches so that the
results from the statistical inference give out a probability of an instance belonging to
class rather than just assigning the best class.

ADVERTIS
MACHINE LEARNING Course Bundle - 57 Courses in 1 | 32
Mock Tests
220+ Hours of HD Videos | 58 Courses | 32 Mock Tests & Quizzes | Verifiable Certificate of
Completion | Lifetime Access
     4.7
(https://www.educba.com/data-science/courses/machine-learning-course/?btnz=ed
in-between-para-banner)

Logistic Regression:
People widely use Logistic Regression as one of the classification algorithms for binary
classification. This model uses a logistic function to determine the target value betwee
range of 0 to 1 and can be represented as the Sigmoid function shown below.

(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)

https://www.educba.com/introduction-to-statistical-learning/ 7/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀 OFFER - ENTIRE WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 27 S

(https://www.educba.
com/)

Why do we Need Statistical Learning? *Please p

In today’s age, if one thing is becoming more abundant than natural resources, then th
ought to be Data. A million bytes of data we generate daily need a source for analyzing
summering them. If not used wisely, people can easily misinterpret or manipulate thes
data to showcase only a particular point of view. Therefore, to avoid dangerous mishap
with data, Statistical Learning becomes a tool to ensure data integrity and proper and
efferent usage.

Statistical Learning helps us understand why a system behaves the way it does. It reduc
ambiguity and produces results that matter in the real world. Statistical Learning provid
accurate results that can find medical, business, banking, and government applications

(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)

https://www.educba.com/introduction-to-statistical-learning/ 8/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀 OFFER - ENTIRE WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 27 S

(https://www.educba.
com/)

Advantages
*Please p

Easily identifies patterns and trends. With the identified trends, targeting
(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c) specific customers for specific products becomes more accessible.

https://www.educba.com/introduction-to-statistical-learning/ 9/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀 OFFER - ENTIRE WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 27 S

Saves time. Hundreds and thousands of epochs for achieving the optimiz
(https://www.educba.
com/) result are possible within a few minutes.

Can work with large numbers and a wide variety of parameters.


*Please p

Improves Decision Making and Prediction techniques by logically analyzin


the data rather than calling shots based on “gut feeling.”

Once the system is functional, no human intervention is required except f


occasional updates to maintain its functionality.

Conclusion – Introduction to Statistical


Learning

With our advancing technologies, we now deal with more statistics in our daily lives th
ever. The correct interpretation of the stories told by every billion bytes of data we
(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)
accumulate is impossible without intersecting statistics with other branches such as Da

https://www.educba.com/introduction-to-statistical-learning/ 10/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

p g
🚀 - ENTIRE
OFFERMachine
Mining, WEBSITE
Learning, 3700+ Courses
and Artificial | 1900+ Test Series | 12000+ Hours | 
Intelligence.
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 2 7 S

Recommended Articles
(https://www.educba.
com/)
ADVERTIS
MICROSOFT POWER BI Course Bundle - 8 Courses in 1
34+ Hours of HD Videos | 8 Courses | Verifiable Certificate of Completion | Lifetime Access
     4.5
(https://www.educba.com/data-science/courses/microsoft-power-bi-training/?
btnz=edu-in-between-para-banner) *Please p

This is a guide to Introduction to Statistical Learning. Here we discuss the introduction,


do we need statistical learning and advantages. You may also have a look at the follow
articles to learn more –

1. Statistical Analysis Tools (https://www.educba.com/statistical-analysis-tools/


2. Statistical Analysis Regression (https://www.educba.com/statistical-analysis-
regression/)
3. Statistical Analysis Methods (https://www.educba.com/statistical-analysis-
methods/)
4. Statistical Analysis Types (https://www.educba.com/statistical-analysis-types

ADVERTISEMENT

SPSS Course Bundle - 14


Courses in 1 | 5 Mock
Tests
 34+ Hours of HD Videos
 14 Courses
 5 Mock Tests & Quizzes
(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
 Verifiable Certificate of Completion
1d3108bb141c)
 Lifetime Access

https://www.educba.com/introduction-to-statistical-learning/ 11/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀
 Lifetime Access
ENTIRE
  - 
OFFER  4.5 WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
@ 90% OFF Ends in 0 0 D 0 9
(https://www.educba.com/data- H 1 2 M 27 S

science/courses/spss-certification-
course/?btnz=edu-after-post-banner)
(https://www.educba.
com/)
ADVERTISEMENT

MICROSOFT AZURE
Course Bundle - 15
Courses in 1 | 12 Mock
Tests *Please p

 63+ Hour of HD Videos


 15 Courses
 12 Mock Tests & Quizzes
 Verifiable Certificate of Completion
 Lifetime Access

     4.5
(https://www.educba.com/data-
science/courses/azure-training-
course/?btnz=edu-after-post-banner)

ADVERTISEMENT

HADOOP Course Bundle -


32 Courses in 1 | 4 Mock
Tests
 125+ Hour of HD Videos
 32 Courses
 4 Mock Tests & Quizzes
 Verifiable Certificate of Completion
 Lifetime Access
     4.5
(https://www.educba.com/data-
science/courses/hadoop-certification-
training/?btnz=edu-after-post-banner)
(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)
ADVERTISEMENT

https://www.educba.com/introduction-to-statistical-learning/ 12/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀 OFFER - ENTIRE
ADVERTISEMENT
WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
INFORMATICA Course
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 2 7 S
Bundle - 7 Courses in 1
(https://www.educba.
 47+ Hours of HD Videos
com/)
 7 Courses
 Verifiable Certificate of Completion
 Lifetime Access
     4.5
(https://www.educba.com/data-
science/courses/informatica-training/?
btnz=edu-after-post-banner) *Please p

About Us

Blog (https://www.educba.com/blog/?source=footer)
Who is EDUCBA? (https://www.educba.com/about-us/?source=footer)
Sign Up (https://www.educba.com/data-science/signup/?source=footer)
Live Classes (https://www.educba.com/daily-live-classes/?source=footer)
Certificate from Top Institutions (https://www.educba.com/educbalive/?
source=footer)
Contact Us (https://www.educba.com/contact-us/?source=footer)
(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)

https://www.educba.com/introduction-to-statistical-learning/ 13/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀
Verifiable Certificate (https://www.educba.com/data-science/verifiable-
OFFER - ENTIRE WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
certificate/?source=footer)
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 27 S
Reviews (https://www.educba.com/data-science/reviews/?source=footer)
Terms and Conditions (https://www.educba.com/terms-and-conditions/?
(https://www.educba.
source=footer)
com/)
Privacy Policy (https://www.educba.com/privacy-policy/?source=footer)

Apps

iPhone & iPad (https://apps.apple.com/app/educba-learning-app/id1341654580)*Please p


Android (https://play.google.com/store/apps/details?id=com.educba.www)

Resources

Free Courses (https://www.educba.com/data-science/free-courses/?


source=footer)
Database Management (https://www.educba.com/data-science/data-science-
tutorials/database-management-tutorial/?source=footer)
Machine Learning (https://www.educba.com/data-science/data-science-
tutorials/machine-learning-tutorial/?source=footer)
All Tutorials (https://www.educba.com/data-science/data-science-tutorials/?
source=footer)

Certification Courses

All Courses (https://www.educba.com/data-science/courses/?source=footer)


Data Science Course - All in One Bundle (https://www.educba.com/data-
science/courses/data-science-course/?source=footer)
Machine Learning Course (https://www.educba.com/data-
science/courses/machine-learning-course/?source=footer)
Hadoop Certification Training (https://www.educba.com/data-
science/courses/hadoop-certification-training/?source=footer)
Cloud Computing Training Course (https://www.educba.com/data-
science/courses/cloud-computing-training-course/?source=footer)
(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)

https://www.educba.com/introduction-to-statistical-learning/ 14/15
11/8/23, 2:47 PM Introduction to Statistical Learning | Why do we Need Statistical Learning?

🚀
R Programming Course (https://www.educba.com/data-science/courses/r-
OFFER - ENTIRE WEBSITE 3700+ Courses | 1900+ Test Series | 12000+ Hours | 
programming-course/?source=footer)
@ 90% OFF Ends in 0 0 D 0 9 H 1 2 M 27 S
AWS Training Course (https://www.educba.com/data-science/courses/aws-
training-course/?source=footer)
(https://www.educba.
SAS Training Course (https://www.educba.com/data-science/courses/sas-
com/)
training-course/?source=footer)

ISO 10004:2018 & ISO 9001:2015 Certified


*Please
© 2023 - EDUCBA. ALL RIGHTS RESERVED. THE CERTIFICATION NAMES ARE THE TRADEMARKS OF p
THEIR RESPECTIVE OWNERS.

(https://vue.comm100.com/chatwindow.aspx?siteId=90004140&planId=88106100-2271-458b-b0c7-
1d3108bb141c)

https://www.educba.com/introduction-to-statistical-learning/ 15/15

You might also like