Chapter 1 Overview
Chapter 1 Overview
Overview
In the real world, we are surrounded by humans who can learn everything from their
experiences with their learning capability, and we have computers or machines which
work on our instructions. But can a machine also learn from experiences or past data like
a human does? So here comes the role of Machine Learning.
A) It is very hard to write programs that solve problems like recognizing a face.
– We don’t know what program to write because we don’t know how our
brain does it.
– Even if we had a good idea about how to do it, the program might be
horrendously complicated.
B) Instead of writing a program by hand, we collect lots of examples that specify the
correct output for a given input.
C) A machine learning algorithm then takes these examples and produces a program
that does the job.
– The program produced by the learning algorithm may look very different
from a typical hand-written program. It may contain millions of numbers.
– If we do it right, the program works for new cases as well as the ones we
trained it on.
Some more examples of tasks that are best solved by using a learning algorithm.
A) Recognizing Patterns:
– Facial identities or facial expressions
– Handwritten or Spoken words
– Medical images
B) Generating Patterns:
– Generating images or motion sequences
C) Recognizing Anomalies:
– Unusual sequences of credit card transactions
– Unusual patterns of sensor readings in a nuclear power plant or unusual
sound in your car engine.
D) Prediction:
– Future stock prices or currency exchange rates
A classic example of a task that requires machine learning: It is very hard to say what
makes a 2.
However, machine learning access a huge amount of data using data pre-processing. This data
can be either structured, semi-structured, or unstructured. Further, this data is fed through some
techniques and algorithms to machines, and then based on previous trends; it predicts the outputs
automatically.
The importance of machine learning can be easily understood by its uses cases.
Currently, machine learning is used in self-driving cars, cyber fraud detection, face
recognition, and friend suggestion by Facebook, etc. Various top companies such as
Netflix and Amazon have built machine learning models that are using a vast amount of
data to analyze the user interest and recommend product accordingly.
Following are some key points which show the importance of Machine Learning:
Rapid increment in the production of data
Solving complex problems, which are difficult for a human
Decision making in various sector including finance
Finding hidden patterns and extracting useful information from data.
Before some years (about 40-50 years), machine learning was science fiction, but today
it is the part of our daily life. Machine learning is making our day to day life easy
from self-driving cars to Amazon virtual assistant "Alexa". However, the idea behind
machine learning is so old and has a long history. The following are some milestones
which have occurred in the history of machine learning:
1834: In 1834, Charles Babbage, the father of the computer, conceived a device
that could be programmed with punch cards. However, the machine was never
built, but all modern computers rely on its logical structure.
1936: In 1936, Alan Turing gave a theory that how a machine can determine and
execute a set of instructions.
The era of stored program computers:
1940: In 1940, the first manually operated computer, "ENIAC" was invented, which
was the first electronic general-purpose computer. After that stored program
computer such as EDSAC in 1949 and EDVAC in 1951 were invented.
1943: In 1943, a human neural network was modeled with an electrical circuit. In
1950, the scientists started applying their idea to work and analyzed how human
neurons might work.
Computer Machinery and Intelligence:
1950: In 1950, Alan Turing published a seminal paper, "Computer Machinery and
Intelligence," on the topic of artificial intelligence. In his paper, he asked, "Can
machines think?"
Machine intelligence in Games:
1952: Arthur Samuel, who was the pioneer of machine learning, created a program
that helped an IBM computer to play a checkers game. It performed better more it
played.
1959: In 1959, the term "Machine Learning" was first coined by Arthur Samuel.
The first "AI" winter:
The duration of 1974 to 1980 was the tough time for AI and ML researchers, and
this duration was called as AI winter.
In this duration, failure of machine translation occurred, and people had reduced
their interest from AI, which led to reduced funding by the government to the
researches.
Machine Learning from theory to reality
1959: In 1959, the first neural network was applied to a real-world problem to
remove echoes over phone lines using an adaptive filter.
1985: In 1985, Terry Sejnowski and Charles Rosenberg invented a neural
network NETtalk, which was able to teach itself how to correctly pronounce
20,000 words in one week.
1997: The IBM's Deep blue intelligent computer won the chess game against the
chess expert Garry Kasparov, and it became the first computer which had beaten
a human chess expert.
1) Supervised Learning
Supervised learning is a type of machine learning method in which we provide sample
labeled data to the machine learning system in order to train it, and on that basis, it
predicts the output.
2) Unsupervised Learning
Unsupervised learning is a learning method in which a machine learns without any
supervision.
The training is provided to the machine with the set of data that has not been labeled,
classified, or categorized, and the algorithm needs to act on that data without any
supervision. The goal of unsupervised learning is to restructure the input data into new
features or a group of objects with similar patterns.
In unsupervised learning, we don't have a predetermined result. The machine tries to
find useful insights from the huge amount of data. It can be further classifieds into two
categories of algorithms:
Clustering
Association
3) Reinforcement Learning
Reinforcement learning is a feedback-based learning method, in which a learning agent
gets a reward for each right action and gets a penalty for each wrong action. The agent
learns automatically with these feedbacks and improves its performance. In
reinforcement learning, the agent interacts with the environment and explores it. The
goal of an agent is to get the most reward points, and hence, it improves its
performance. The robotic dog, which automatically learns the movement of his arms, is
an example of Reinforcement learning.
Applications of Machine learning
Machine learning is a buzzword for today's technology, and it is growing very rapidly day by
day. We are using machine learning in our daily life even without knowing it such as Google
Maps, Google assistant, Alexa, etc. The following are some of the most trending real-world
applications of Machine Learning.
Everyone who is using Google Map is helping this app to make it better. It takes information
from the user and sends back to its database to improve the performance.
5. Self-driving cars:
One of the most exciting applications of machine learning is self-driving cars. Machine learning
plays a significant role in self-driving cars. Tesla, the most popular car manufacturing company
is working on self-driving car. It is using unsupervised learning method to train the car models to
detect people and objects while driving.
Some machine learning algorithms such as Multi-Layer Perceptron, Decision tree, and Naïve
Bayes classifier are used for email spam filtering and malware detection.
The most important thing in the complete process is to understand the problem and to know the
purpose of the problem. Therefore, before starting the life cycle, we need to understand the
problem because the good result depends on the better understanding of the problem.
In the complete life cycle process to solve a problem, we create a machine learning system called
"model", and this model is created by providing "training". But to train a model, we need data.
Hence, life cycle starts by collecting data.
1. Gathering Data:
Data gathering is the first step of the machine learning life cycle. The goal of this step is to
identify and obtain all data-related to the problem.
In this step, we need to identify the different data sources, as data can be collected from various
sources such as files, database, Internet, or mobile devices. It is one of the most important steps
of the life cycle. The quantity and quality of the collected data will determine the efficiency of
the output. The more will be the data, the more accurate will be the prediction.
This step includes the following tasks:
Identify various data sources
Collect data
Integrate the data obtained from different sources
By performing the above task, we get a coherent set of data, also called as a dataset. It will be
used in further steps.
3. Data Wrangling
Data wrangling is the process of cleaning and converting raw data into a useable format. It is the
process of cleaning the data, selecting the variable to use, and transforming the data in a proper
format to make it more suitable for analysis in the next step. It is one of the most important steps
of the complete process. Cleaning of data is required to address the quality issues.
It is not necessary that data we have collected is always of our use as some of the data may not
be useful. In real-world applications, collected data may have various issues, including:
Missing Values
Duplicate data
Invalid data
Noise
So, we use various filtering techniques to clean the data. It is mandatory to detect and remove the
above issues because it can negatively affect the quality of the outcome.
4. Data Analysis
Now the cleaned and prepared data is passed on to the analysis step. This step involves:
Selection of analytical techniques
Building models
Review the result
The aim of this step is to build a machine learning model to analyze the data using various
analytical techniques and review the outcome. It starts with the determination of the type of the
problems, where we select the machine learning techniques such
as Classification, Regression, Cluster Analysis, Association, etc. then build the model using
prepared data, and evaluate the model.
Hence, in this step, we take the data and use machine learning algorithms to build the model.
5. Train Model
Now the next step is to train the model. In this step we train our model to improve its
performance for better outcome of the problem. We use datasets to train the model using various
machine learning algorithms. Training a model is required so that it can understand the various
patterns, rules, and, features of the data.
7. Deployment
The last step of machine learning life cycle is deployment, where we deploy the model in the
real-world system. If the above-prepared model is producing an accurate result as per our
requirement with acceptable speed, then we deploy the model in the real system. But, before
deploying the project, we will check whether it is improving its performance using available data
or not. The deployment phase is similar to making the final report for a project.
Difference between Data Science and Machine Learning
Data Science is the study of data cleansing, preparation, and analysis, while machine learning is
a branch of AI and subfield of data science. Data Science and Machine Learning are the two
popular modern technologies, and they are growing with an immoderate rate. But these two
buzzwords, along with artificial intelligence and deep learning are very confusing term, so it is
important to understand how they are different from each other.
Data Science and Machine Learning are closely related to each other but have different
functionalities and different goals. At a glance, Data Science is a field to study the approaches to
find insights from the raw data. Whereas, Machine Learning is a technique used by the group of
data scientists to enable the machines to learn automatically from the past data. To understand
the difference in-depth, let's first have a brief introduction to these two technologies.
Note: Data Science and Machine Learning are closely related to each other but cannot be treated as
synonyms.
The popular applications of ML are Email spam filtering, product recommendations, online
fraud detection, etc.
Skills Needed for the Machine Learning Engineer:
Understanding and implementation of Machine Learning Algorithms.
Natural Language Processing.
Good Programming knowledge of Python or R.
Knowledge of Statistics and probability concepts.
Knowledge of data modeling and data evaluation.
It deals with understanding and finding hidden It is a subfield of data science that enables the machine to
patterns or useful insights from the data, which learn from the past data and experiences automatically.
helps to take smarter business decisions.
It is used for discovering insights from the data. It is used for making predictions and classifying the result
for new data points.
It is a broad term that includes various steps to create It is used in the data modeling step of the data science as a
a model for a given problem and deploy the model. complete process.
A data scientist needs to have skills to use big data Machine Learning Engineer needs to have skills such as
tools like Hadoop, Hive and Pig, statistics, computer science fundamentals, programming skills in
programming in Python, R, or Scala. Python or R, statistics and probability concepts, etc.
It can work with raw, structured, and unstructured data. It mostly requires structured data to work on.
Data scientists spent lots of time in handling the ML engineers spend a lot of time for managing the
data, cleansing the data, and understanding its complexities that occur during the implementation of
patterns. algorithms and mathematical concepts behind that.
With the above image, you can understand Artificial Intelligence is a branch of computer science
that helps us to create smart, intelligent machines. Further, ML is a subfield of AI that helps to
teach machines and build AI-driven applications. On the other hand, Deep learning is the sub-
branch of ML that helps to train ML models with a huge amount of input data and complex
algorithms and mainly works with neural networks.
We have discussed machine learning and artificial intelligence basics, and it's time to move
towards the basics of deep learning.
Conclusion
Artificial intelligence is one of the most popular 5th generation technologies that is changing the
world using its subdomains, machine learning, and deep learning. AI helps us to create an
intelligent system and provide cognitive abilities to the machine. Further, machine learning
enables machines to learn based on experience without human intervention and makes them
capable of learning and predicting results with given data. At the same time, deep learning is the
breakthrough in the field of AI that uses various layers of artificial neural networks to achieve
impressive outputs for various problems such as image recognition and text recognition.