FAM Unit4
FAM Unit4
In the real world, we are surrounded by humans who can learn everything from their experiences
with their learning capability, and we have computers or machines which work on our instructions.
But can a machine also learn from experiences or past data like a human does? So here comes the
role of Machine Learning.
o The duration of 1974 to 1980 was the tough time for AI and ML
researchers, and this duration was called as AI winter.
o In this duration, failure of machine translation occurred, and people
had reduced their interest from AI, which led to reduced funding by
the government to the researches.
2007:
o Participants were tasked with increasing the accuracy of Netflix's
recommendation algorithm when the Netflix Prize competition
began.
o Support learning made critical progress when a group of specialists
utilized it to prepare a PC to play backgammon at a top-notch level.
2008:
2009:
2010:
2011:
2013:
2014:
2015:
2016:
2017:
Machine learning life cycle involves seven major steps, which are given
below:
o Gathering Data
o Data preparation
o Data Wrangling
o Analyse Data
o Train the model
o Test the model
o Deployment
1. Gathering Data:
Data Gathering is the first step of the machine learning life cycle. The goal
of this step is to identify and obtain all data-related problems.
In this step, we need to identify the different data sources, as data can be
collected from various sources such as files, database, internet,
or mobile devices. It is one of the most important steps of the life cycle.
The quantity and quality of the collected data will determine the efficiency
of the output. The more will be the data, the more accurate will be the
prediction.
By performing the above task, we get a coherent set of data, also called
as a dataset. It will be used in further steps.
2. Data preparation
After collecting the data, we need to prepare it for further steps. Data
preparation is a step where we put our data into a suitable place and
prepare it to use in our machine learning training.
In this step, first, we put all data together, and then randomize the
ordering of data.
o Data exploration:
It is used to understand the nature of data that we have to work with. We
need to understand the characteristics, format, and quality of data.
A better understanding of data leads to an effective outcome. In this, we
find Correlations, general trends, and outliers.
o Data pre-processing:
Now the next step is preprocessing of data for its analysis.
3. Data Wrangling
Data wrangling is the process of cleaning and converting raw data into a
useable format. It is the process of cleaning the data, selecting the
variable to use, and transforming the data in a proper format to make it
more suitable for analysis in the next step. It is one of the most important
steps of the complete process. Cleaning of data is required to address the
quality issues.
o Missing Values
o Duplicate data
o Invalid data
o Noise
4. Data Analysis
Now the cleaned and prepared data is passed on to the analysis step. This
step involves:
Hence, in this step, we take the data and use machine learning algorithms
to build the model.
5. Train Model
Now the next step is to train the model, in this step we train our model to
improve its performance for better outcome of the problem.
6. Test Model
Once our machine learning model has been trained on a given dataset,
then we test the model. In this step, we check for the accuracy of our
model by providing a test dataset to it.
7. Deployment
The last step of machine learning life cycle is deployment, where we
deploy the model in the real-world system.