Class Xii Model Life Cycle
Class Xii Model Life Cycle
Ans. There are 5 steps in building an AI powered solution . Have a look at the picture below , we
discuss each stage one by one.
Data Acquisition
For performing analysis on data first you need to gather data , from reliable data sources. Real life data
can be weird and misleading. Human entries are always prone to errors, for example someone
mistyping 30.0 as 3.00 . Somebody making spelling mistakes or maybe labelling the data wrong.
1. Databases
2. web pages
3. devices like cameras and sensors ( eg. in Autopilots , weather predictions)
4. Public surveys /records of purchases, transactions, registrations and more.
Data exploration
Once you gather data you need to perform operations like data cleaning to find missing values , to
remove useless data and perform basic statistical analysis like drawing plots, comparing different
features of the data set and more. The entire process is known as EXPLORATORY DATA
ANALYSIS.
It helps to see which features are more important and what is the overall trend of the data that you have.
For example suppose you have a data set which has the features of a house ( like number of bedrooms ,
bathrooms , floor area etc.) and the final market price of the house.
You would expect the house price to vary linearly with data in smaller cities and it may vary
quadratically in metro-politan areas depending on location . A sea facing duplex near marine drive in
Mumbai will be way more expensive than a duplex located in a rural area.
Modelling
Now having cleaned the data and understanding the basic trends , a data scientist tries to formulate an
approximate mathematical relation between features and the final market price. We use feature
importances in deciding the final price.
The ability to mathematically describe the relationship between parameters is the heart of every AI
model.
Evaluation
The final task is to test the trained AI model on new real life data and see how it is performing.
According to problem statement different “loss functions” are used to see how much error our model is
making. The purpose of these functions is to provide a mathematical estimate as to how far we are from
making correct predictions
Finally if the model performs well on unseen new data , the deployment( using it as service on internet
applications) stage is started.
Who?: Refers that who is facing a problem, who the stakeholders of the problem are and who are
What?: Refers to what the problem is and what you know about the problem. What is the nature
of the problem? Can it be explained simply? How do you know it’s a problem? What is the
evidence to support that the problem exists? What solutions are possible in this situation? etc. At
this stage, you need to determine the exact nature of the problem.
Where?: It is related to the context or situation or location of the problem, focus on the
context/situation/location of the problem.
Why?: Refers to the reason we need to solve the problem, the benefits which the stakeholders
would get from the solution and how would it benefit them as well as the society, what are the
benefits to the stakeholders after solving the problem.
a. Open languages — Python is the most popular, with R and Scala also in the mix.
b. Open frameworks — Scikit-learn, XGBoost, TensorFlow, etc.
c. Approaches and techniques — Classic ML techniques from regression all the way to state-of-theart
GANs and RL
d. Productivity-enhancing capabilities — Visual modelling, AutoAI to help with feature engineering,
algorithm selection and hyperparameter optimization
e. Development tools — DataRobot, H2O, Watson Studio, Azure ML Studio, Sagemaker, Anaconda,
etc.
To aid the development teams, various AI development platforms include substantial documentation.
You need to go to the relevant webpages for this documentation, which are as follows, depending on
the AI platform you choose: