0% found this document useful (0 votes)

31 views5 pages

Machine Learning

Uploaded by

Acu Healer Rasa Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views5 pages

Machine Learning

Uploaded by

Acu Healer Rasa Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

CP4252 MACHINE LEARNING

Machine learning

Machine Learning is defined as the study of computer algorithms for

automatically constructing computer software through past experience and
training data.

It is a branch of Artificial Intelligence and computer science that helps

build a model based on training data and make predictions and decisions
without being constantly programmed. Machine Learning is used in various
applications such as email filtering, speech recognition, computer vision, self-
driven cars, Amazon product recommendation, etc.

Commonly used Algorithms in Machine Learning

Machine Learning is the study of learning algorithms using past experience
and making future decisions. Although, Machine Learning has a variety of
models, here is a list of the most commonly used machine learning algorithms
by all data scientists and professionals in today's world.

o Linear Regression
o Logistic Regression
o Decision Tree
o Bayes Theorem and Naïve Bayes Classification
o Support Vector Machine (SVM) Algorithm
o K-Nearest Neighbor (KNN) Algorithm
o K-Means
o Gradient Boosting algorithms
o Dimensionality Reduction Algorithms
o Random Forest

Common issues in Machine Learning

Although machine learning is being used in every industry and helps
organizations make more informed and data-driven choices that are more
effective than classical methodologies, it still has so many problems that cannot
be ignored. Here are some common issues in Machine Learning that
professionals face to inculcate ML skills and create an application from scratch.

1. Inadequate Training Data

The major issue that comes while using machine learning algorithms is the
lack of quality as well as quantity of data. Although data plays a vital role in the
processing of machine learning algorithms, many data scientists claim that
inadequate data, noisy data, and unclean data are extremely exhausting the
machine learning algorithms. For example, a simple task requires thousands of
sample data, and an advanced task such as speech or image recognition needs
millions of sample data examples. Further, data quality is also important for the
algorithms to work ideally, but the absence of data quality is also found in
Machine Learning applications. Data quality can be affected by some factors as
follows:

o Noisy Data- It is responsible for an inaccurate prediction that affects the

decision as well as accuracy in classification tasks.
o Incorrect data- It is also responsible for faulty programming and results
obtained in machine learning models. Hence, incorrect data may affect the
accuracy of the results also.
o Generalizing of output data- Sometimes, it is also found that generalizing
output data becomes complex, which results in comparatively poor future
actions.

2. Poor quality of data

As we have discussed above, data plays a significant role in machine
learning, and it must be of good quality as well. Noisy data, incomplete data,
inaccurate data, and unclean data lead to less accuracy in classification and low-
quality results. Hence, data quality can also be considered as a major common
problem while processing machine learning algorithms.

3. Non-representative training data

To make sure our training model is generalized well or not, we have to
ensure that sample training data must be representative of new cases that we
need to generalize. The training data must cover all cases that are already
occurred as well as occurring.

Further, if we are using non-representative training data in the model, it

results in less accurate predictions. A machine learning model is said to be ideal
if it predicts well for generalized cases and provides accurate decisions. If there
is less training data, then there will be a sampling noise in the model, called the
non-representative training set. It won't be accurate in predictions. To overcome
this, it will be biased against one class or a group.

Hence, we should use representative data in training to protect against

being biased and make accurate predictions without any drift.

4. Overfitting and Underfitting

Overfitting:

Overfitting is one of the most common issues faced by Machine Learning

engineers and data scientists. Whenever a machine learning model is trained
with a huge amount of data, it starts capturing noise and inaccurate data into
the training data set. It negatively affects the performance of the model. Let's
understand with a simple example where we have a few training data sets such
as 1000 mangoes, 1000 apples, 1000 bananas, and 5000 papayas. Then there is
a considerable probability of identification of an apple as papaya because we
have a massive amount of biased data in the training data set; hence prediction
got negatively affected. The main reason behind overfitting is using non-linear
methods used in machine learning algorithms as they build non-realistic data
models. We can overcome overfitting by using linear and parametric algorithms
in the machine learning models.

Methods to reduce overfitting:

o Increase training data in a dataset.

o Reduce model complexity by simplifying the model by selecting one with fewer
parameters
o Ridge Regularization and Lasso Regularization
o Early stopping during the training phase
o Reduce the noise
o Reduce the number of attributes in training data.
o Constraining the model.

Underfitting:

Underfitting is just the opposite of overfitting. Whenever a machine

learning model is trained with fewer amounts of data, and as a result, it provides
incomplete and inaccurate data and destroys the accuracy of the machine
learning model.

Underfitting occurs when our model is too simple to understand the base
structure of the data, just like an undersized pant. This generally happens when
we have limited data into the data set, and we try to build a linear model with
non-linear data. In such scenarios, the complexity of the model destroys, and
rules of the machine learning model become too easy to be applied on this data
set, and the model starts doing wrong predictions as well.

Methods to reduce Underfitting:

o Increase model complexity

o Remove noise from the data
o Trained on increased and better features
o Reduce the constraints
o Increase the number of epochs to get better results.

5. Monitoring and maintenance

As we know that generalized output data is mandatory for any machine
learning model; hence, regular monitoring and maintenance become
compulsory for the same. Different results for different actions require data
change; hence editing of codes as well as resources for monitoring them also
become necessary.
6. Getting bad recommendations
A machine learning model operates under a specific context which results
in bad recommendations and concept drift in the model. Let's understand with
an example where at a specific time customer is looking for some gadgets, but
now customer requirement changed over time but still machine learning model
showing same recommendations to the customer while customer expectation
has been changed. This incident is called a Data Drift. It generally occurs when
new data is introduced or interpretation of data changes. However, we can
overcome this by regularly updating and monitoring data according to the
expectations.

7. Lack of skilled resources

Although Machine Learning and Artificial Intelligence are continuously
growing in the market, still these industries are fresher in comparison to others.
The absence of skilled resources in the form of manpower is also an issue.
Hence, we need manpower having in-depth knowledge of mathematics, science,
and technologies for developing and managing scientific substances for machine
learning.

8. Customer Segmentation
Customer segmentation is also an important issue while developing a
machine learning algorithm. To identify the customers who paid for the
recommendations shown by the model and who don't even check them. Hence,
an algorithm is necessary to recognize the customer behavior and trigger a
relevant recommendation for the user based on past experience.

9. Process Complexity of Machine Learning

The machine learning process is very complex, which is also another major
issue faced by machine learning engineers and data scientists. However,
Machine Learning and Artificial Intelligence are very new technologies but are
still in an experimental phase and continuously being changing over time. There
is the majority of hits and trial experiments; hence the probability of error is
higher than expected. Further, it also includes analyzing the data, removing
data bias, training data, applying complex mathematical calculations, etc.,
making the procedure more complicated and quite tedious.

10. Data Bias

Data Biasing is also found a big challenge in Machine Learning. These
errors exist when certain elements of the dataset are heavily weighted or need
more importance than others. Biased data leads to inaccurate results, skewed
outcomes, and other analytical errors. However, we can resolve this error by
determining where data is actually biased in the dataset. Further, take
necessary steps to reduce it.

Methods to remove Data Bias:

o Research more for customer segmentation.

o Be aware of your general use cases and potential outliers.
o Combine inputs from multiple sources to ensure data diversity.
o Include bias testing in the development process.
o Analyze data regularly and keep tracking errors to resolve them easily.
o Review the collected and annotated data.
o Use multi-pass annotation such as sentiment analysis, content moderation, and
intent recognition.

Deep Learning with Python 2nd Edition François Chollet - The complete ebook version is now available for download
No ratings yet
Deep Learning with Python 2nd Edition François Chollet - The complete ebook version is now available for download
28 pages
Machine Learning?
100% (2)
Machine Learning?
114 pages
Issues in ML and Generating Algo
No ratings yet
Issues in ML and Generating Algo
31 pages
Issues in ML
No ratings yet
Issues in ML
2 pages
Machine Learning - 1 (UNIT - 1)
No ratings yet
Machine Learning - 1 (UNIT - 1)
6 pages
UNIT 3__ML
No ratings yet
UNIT 3__ML
15 pages
LECTURE - 1
No ratings yet
LECTURE - 1
35 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Unit 1 Notes_FML
No ratings yet
Unit 1 Notes_FML
95 pages
Machine Learning
No ratings yet
Machine Learning
57 pages
ML Bu
No ratings yet
ML Bu
31 pages
Machine Learning Unit - 1
No ratings yet
Machine Learning Unit - 1
154 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
26 pages
Common Issues in Machine Learning (1)
No ratings yet
Common Issues in Machine Learning (1)
6 pages
ML Unit-1
No ratings yet
ML Unit-1
39 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Machine Learning - ch1
No ratings yet
Machine Learning - ch1
46 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
ML Iat 1
No ratings yet
ML Iat 1
23 pages
5.3 Model
No ratings yet
5.3 Model
26 pages
ML & DL
No ratings yet
ML & DL
19 pages
Machine Learning Notes "2023
No ratings yet
Machine Learning Notes "2023
31 pages
7 Machine Learning and Deep Learning Mistakes and Limitations To Avoid
No ratings yet
7 Machine Learning and Deep Learning Mistakes and Limitations To Avoid
10 pages
Machine learning
No ratings yet
Machine learning
12 pages
Note - Before Use Check Answers According To Your Syllabus.: Importance
No ratings yet
Note - Before Use Check Answers According To Your Syllabus.: Importance
31 pages
1 Limitation of Machine Learning
No ratings yet
1 Limitation of Machine Learning
6 pages
Define Machine Learning Explain With Examples Why Machine Learning Is Important? Ans
No ratings yet
Define Machine Learning Explain With Examples Why Machine Learning Is Important? Ans
10 pages
Unit 1
No ratings yet
Unit 1
8 pages
Air quality prediction using machine learning
No ratings yet
Air quality prediction using machine learning
29 pages
Study Notes - Lesson 1 - 7 PDF
No ratings yet
Study Notes - Lesson 1 - 7 PDF
25 pages
ML - CSA 301 - ML Perspective and Issues
No ratings yet
ML - CSA 301 - ML Perspective and Issues
34 pages
Machine Learning: From: Atul Ranjan Jha
No ratings yet
Machine Learning: From: Atul Ranjan Jha
11 pages
dbms-10 marks
No ratings yet
dbms-10 marks
32 pages
DSF - UNIT III Notes
No ratings yet
DSF - UNIT III Notes
17 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
ML notes
No ratings yet
ML notes
26 pages
Machine Learning for Data Science Unit-4
No ratings yet
Machine Learning for Data Science Unit-4
16 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Engineer Being Machine Learning Notes
No ratings yet
Engineer Being Machine Learning Notes
95 pages
Engineer Being Machine Learning notes
No ratings yet
Engineer Being Machine Learning notes
95 pages
Engineer Being Machine Learning Notes
No ratings yet
Engineer Being Machine Learning Notes
95 pages
Module_-1
No ratings yet
Module_-1
9 pages
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Designing A Learning System
No ratings yet
Designing A Learning System
12 pages
MLE
No ratings yet
MLE
15 pages
ML Unit 1
No ratings yet
ML Unit 1
20 pages
Fin Irjmets1652378206
No ratings yet
Fin Irjmets1652378206
6 pages
Unit 1
No ratings yet
Unit 1
62 pages
Unit - 1 1.introduction To ML
No ratings yet
Unit - 1 1.introduction To ML
74 pages
Machine Learning Predicted Qs
No ratings yet
Machine Learning Predicted Qs
17 pages
MLT unit -1
No ratings yet
MLT unit -1
38 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
19 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
What Are Issues in Machine Learning
No ratings yet
What Are Issues in Machine Learning
2 pages
Deep Learnng IA
No ratings yet
Deep Learnng IA
69 pages
AI Unit 1
No ratings yet
AI Unit 1
30 pages
Beyond The Algorithm: Practical Machine Learning Strategies
From Everand
Beyond The Algorithm: Practical Machine Learning Strategies
Jane Onwuchekwa
No ratings yet
Big-Data Unit-3
100% (1)
Big-Data Unit-3
54 pages
DAIOT UNIT 5 (1) Own
No ratings yet
DAIOT UNIT 5 (1) Own
13 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
Fundamentals of ANN
No ratings yet
Fundamentals of ANN
213 pages
CNN Case Studies Unit 4
No ratings yet
CNN Case Studies Unit 4
13 pages
Movie Recommendation System
No ratings yet
Movie Recommendation System
57 pages
The Role of Artificial Intelligence in Improving Education
No ratings yet
The Role of Artificial Intelligence in Improving Education
6 pages
Semester II: Discipline: Information Technology Stream: IT1
No ratings yet
Semester II: Discipline: Information Technology Stream: IT1
188 pages
POA - Tracker
No ratings yet
POA - Tracker
60 pages
Artificial Intelligrnce (2)
No ratings yet
Artificial Intelligrnce (2)
30 pages
Deep Learning Lab Manual-36-41
No ratings yet
Deep Learning Lab Manual-36-41
6 pages
Modelling Supply Chain Information Collaboration Empowered With Machine Learning Technique
No ratings yet
Modelling Supply Chain Information Collaboration Empowered With Machine Learning Technique
15 pages
Clustering Algorithms: I I M M M N S
No ratings yet
Clustering Algorithms: I I M M M N S
16 pages
Download Full International Conference on Innovative Computing and Communications Proceedings of ICICC 2021 Volume 1 Advances in Intelligent Systems and Computing 1387 Ashish Khanna (Editor) PDF All Chapters
100% (3)
Download Full International Conference on Innovative Computing and Communications Proceedings of ICICC 2021 Volume 1 Advances in Intelligent Systems and Computing 1387 Ashish Khanna (Editor) PDF All Chapters
40 pages
Video Summarization Overview: Cyberagent, Inc. Otani - Mayu@Cyberagent - Co.Jp
No ratings yet
Video Summarization Overview: Cyberagent, Inc. Otani - Mayu@Cyberagent - Co.Jp
55 pages
XAI Grafos Autismo PUB
No ratings yet
XAI Grafos Autismo PUB
20 pages
Use of AI To Enhance Steel Production Value Chain: Tsutomu Ito Zhe Cao Kenji Maegawa Naoya Eimaeda Taku Saito
No ratings yet
Use of AI To Enhance Steel Production Value Chain: Tsutomu Ito Zhe Cao Kenji Maegawa Naoya Eimaeda Taku Saito
5 pages
MG Arr Phee Project Word Book
No ratings yet
MG Arr Phee Project Word Book
16 pages
Project Report End
No ratings yet
Project Report End
85 pages
Fabric Data Science 1 150
No ratings yet
Fabric Data Science 1 150
150 pages
ada
No ratings yet
ada
4 pages
21CS644
No ratings yet
21CS644
3 pages
Sanjay Reddy Choudapur: Education
No ratings yet
Sanjay Reddy Choudapur: Education
1 page
An Explainable Machine Learning Framework For Intrusion Detection Systems
No ratings yet
An Explainable Machine Learning Framework For Intrusion Detection Systems
15 pages
Research Paper1
No ratings yet
Research Paper1
5 pages
Face Recognition Web App Tutorial: by Fatih Cagatay Akyon
No ratings yet
Face Recognition Web App Tutorial: by Fatih Cagatay Akyon
14 pages
Global Mapping of Soil Salinity Change
No ratings yet
Global Mapping of Soil Salinity Change
12 pages
2022 Applications
No ratings yet
2022 Applications
6 pages
24 cv
No ratings yet
24 cv
21 pages
remotesensing-14-01645-v2
No ratings yet
remotesensing-14-01645-v2
27 pages
Weather Prediction 2
No ratings yet
Weather Prediction 2
33 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
33 pages