0% found this document useful (0 votes)

694 views11 pages

Podar Pearl School: Chapter 1: Capstone Project Question and Answers

The document provides answers to 25 questions about capstone projects, artificial intelligence, data science methodology, and machine learning models. Some key points covered include: 1) A capstone project requires students to independently research a topic and demonstrate their knowledge through a comprehensive final project. 2) The main steps in developing an AI project are problem definition, data gathering, feature definition, model construction, evaluation, and deployment. 3) Common machine learning techniques like classification, regression, clustering, anomaly detection, and recommendation can be applied based on the type of question asked. 4) Data science methodology involves business understanding, analytical approach, data requirements, data compilation/preparation, and model development. 5

Uploaded by

Adarsh Gopakumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

694 views11 pages

Podar Pearl School: Chapter 1: Capstone Project Question and Answers

Uploaded by

Adarsh Gopakumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

PODAR PEARL SCHOOL

(Under the supervision of the Ministry of Education and Higher Education, Qatar)

Chapter 1: Capstone Project

Question and Answers:

Short Answer questions:

1. What is a Capstone Project?

Ans: A capstone project is a project where students must research a topic

independently to find a deep understanding of the subject matter. It gives an
opportunity for the student to integrate all their knowledge and demonstrate it
through a comprehensive project.
2. Mention some Capstone Project ideas
Ans:
1. Stock Prices Predictor
2. Develop a Sentiment Analyzer
3. Movie Ticket Price Predictor
4. Students Results Predictor
5. Human Activity Recognition using Smartphone Data set
6. Classifying humans and animals in a photo

3. List down the steps involved in developing an AI Project

Ans:
1) Problem definition or Understanding the problem
2) Data gathering
3) Feature definition
4) AI model construction
5) Evaluation & refinements
6) Deployment

4. State the important criteria in understanding the problem

Ans:
When we begin formulating or understanding a problem using Machine Learning
techniques, we have to look for a pattern in the data. If there is no pattern, then the
problem cannot be solved with AI technology.

5. What are the five types of questions to be geared towards applying AI

development techniques?
(or)
What are the five questions to be asked to begin with predictive analysis?

Ans:
1) Which category? (Classification)
1
2) How much or how many? (Regression)
3) Which group? (Clustering)
4) Is this unusual? (Anomaly Detection)
5) Which option should be taken? (Recommendation)
It is important to determine which of these questions we are asking, and how
answering it helps us solve our problem.
6. Define Design Thinking
Ans:
Design Thinking is a design methodology that provides a solution-based approach
to solving problems. It’s extremely useful in tackling complex problems that are ill-
defined or unknown.
7. Mention the five stages in Design Thinking
Ans:
1) Empathize
2) Define
3) Ideate
4) Prototype
5) Test
8. Briefly explain the term ‘ Business Understanding’
Ans:
Every project, regardless of its size, starts with business understanding, which lays
the foundation for successful resolution of the business problem. The business
sponsors who need the analytic solution play the critical role in this stage by
defining the problem, project objectives and solution requirements from a business
perspective. It is the first stage in foundational methodology for data science.

9. Briefly explain the term ‘ Analytical approach’

Ans:
 After clearly stating a business problem, the data scientist can define the
analytic approach to solve it.
 It involves expressing the problem in the context of statistical and machine
learning techniques so that the data scientist can identify techniques suitable
for achieving the desired outcome.
 Selecting the right analytic approach depends on the question being asked.
 Once the problem to be addressed is defined, the appropriate analytic
approach for the problem is selected in the context of the business
requirements.
 This is the second stage of the data science methodology.
10. How the appropriate analytic approach is getting selected based on the type
of question?
Ans:

 If the question is to determine probabilities of an action, then a predictive
model might be used.

2
 If the question is to show relationships, a descriptive approach maybe be
required.

 Statistical analysis applies to problems that require counts: if the question

requires a yes/ no answer, then a classification approach to predicting a
response would be suitable.

11. How will you identify the data requirements as a part of solving problem?
Ans:
We can identify the data requirements by answering to the following:
 Who?
 What?
 Where?
 When?
 Why?
 How?
12. What are the points that a data scientist needs to identify at data requirement
stage in data science methodology?
Ans:
1. Which data ingredients are required?
2. How to source or the collect them?
3. How to understand or work with them?
4. How to prepare the data to meet the desired outcome?

13. How the data requirement stage is playing a vital role in data science
methodology?
Ans:
 It is vital to define the data requirements for decision-tree classification prior to
undertaking the data collection and data preparation stages of the methodology.
 This includes identifying the necessary data content, formats and sources for
initial data collection.
 In this phase the data requirements are revised and decisions are made as to
whether or not the collection requires more or less data.
 Once the data ingredients are collected, the data scientist will have a good
understanding of what they will be working with.
14. Why the techniques such as descriptive statistics and visualization can be
applied to the data set?
Ans:
The techniques such as descriptive statistics and visualization can be applied to the
data set to assess the content, quality, and initial insights about the data. Gaps in data
will be identified and plans to either fill or make substitutions will have to be made.

3
15. What are the two types of AI models?
Ans:
The two types of AI models are,
a) Descriptive
An example of a descriptive model might examine things such as Netflix uses this type
of analytics to see what genres and TV shows interest their subscribers most.

b) Predictive
A predictive model tries to yield yes/no, or stop/go type outcomes. These models are
based on the analytic approach that was taken, either statistically driven or machine
learning driven.
Data Modelling focuses on developing models that are either descriptive or predictive.

16. What is training data set?

Ans:
A training set is a set of historical data in which the outcomes are already known. The
training set acts like a gauge to determine if the model needs to be calibrated. The
data scientist will use a training set for predictive modelling.
17. What are the steps to be undertaken for the success of data compilation,
preparation and modelling?
Ans:

 First, understand the question at hand.
 Second, select an analytic approach or method to solve the problem.

 Third, obtain, understand, prepare, and model the data.

The end goal is to move the data scientist to a point where a data model can be built to
answer the question.
18. What is train-test split Evaluation?
Ans:
 The train-test split is a technique for evaluating the performance of a machine
learning algorithm.
 It can be used for classification or regression problems and can be used for any
supervised learning algorithm.
 The procedure involves taking a dataset and dividing it into two subsets, training
dataset and testing dataset.
 Train Dataset: Used to fit the machine learning model.
 Test Dataset: Used to evaluate the fit machine learning model.

19. What is the main parameter in configuring the Train-Test Split?

Ans:
 The main configuration parameter is the size of the train and test sets. This is
most commonly expressed as a percentage between 0 and 1 for either the train
or test datasets.
 For example, a training set with the size of 0.67 (67 percent) means that the
remainder percentage 0.33 (33 percent) is assigned to the test set.
 There is no optimal split percentage.
4
20. What are the considerations in choosing the split percentage to meet our
project’s objectives?
Ans:

 Computational cost in training the model.

 Computational cost in evaluating the model.
 Training set representativeness.
 Test set representativeness.

21. What are the common split percentages?

Ans:
 The most common split percentage is Train: 80%, Test: 20%
The other split percentages include:
 Train: 67%, Test: 33%
 Train: 50%, Test: 50%
22. What are the prerequisites or Python libraries needed for Train-Test Split?
Ans:
 Pandas
 Sklearn
23. We split this data into labels and features. What do the terms ‘labels’ and
‘features’ refer to?
Ans:
Features - the data we use to predict labels
Labels - the data we want to predict
Using features, we predict labels.

24. What are the advantages and shortcomings of Train-Test Split?

Ans:
Advantages
 Easy to implement and interpret
 Less time consuming in execution
Disadvantages
 The train-test procedure is not appropriate when the dataset available is small.
The reason is that when the dataset is split into train and test sets, there will
not be enough data in the training dataset for the model to learn an effective
mapping of inputs to outputs. There will also not be enough data in the test
set to effectively evaluate the model performance. It will decrease the
accuracy of the predictive model.
 If the split is not random, the output of the evaluation matrices are inaccurate.
 Can cause over-fitted predictive models.

25. What is K-fold Cross Validation?

Ans:
 Cross-validation is a statistical method used to estimate the skill of machine
learning models.
 The procedure has a single parameter called k that refers to the number of
groups that a given data sample is to be split into. Hence the procedure is often
5
called k-fold cross-validation. When a specific value for k is chosen, it may be
used in place of k in the reference to the model, such as k=10 becoming 10-fold
cross-validation.
 It is a popular method because it is simple to understand and generally results
in a less biased estimate of the model skill than other methods, such as a
simple train/test split.

26. Differentiate Train-Test split and K-fold Cross Validation

Ans:
Train-Test split K-fold Cross Validation
There are problems where model quality Cross-validation gives a more accurate
scores would be least reliable with train- measure of model quality.
test split because only a portion of the
dataset are used for generating
evaluation matrices.
It will run faster It can take more time to run, because it
estimates models once for each fold.
It is suitable for larger datasets It is suitable for smaller datasets
We will not be able to explain the We can take mean accuracy and explain
stakeholders the exact accuracy. the stakeholders model accuracy. Also
able to explain what will be the min and
max accuracy the model will predict.

27. What are the standard mathematical measures to evaluate model quality?
Ans:
 RMSE – Root Mean Square Error Method
 MSE - Mean Square Error Method
 MAPE - Mean Absolute Percentage Error
 MAE – Mean Absolute Error Method

28. Explain the terms

a) Objective function:
All the algorithms in machine learning rely on minimizing or maximizing a function,
which we call “objective function”.
b) Loss Function:
The group of functions that are minimized are called “loss functions”. A loss function is
a measure of how good a prediction model does in terms of being able to predict the
expected outcome. Loss functions can be broadly categorized into 2 types:
Classification and Regression Loss.
c) Gradient Descent:
A most commonly used method of finding the minimum point of function is “gradient
descent”.

6
29. What are the types of Classification Loss and Regression Loss?
Ans:

30. What is RMSE (Root Mean Squared Error)?

Ans:
RMSE is one of the methods to determine the accuracy of our model in predicting the
target values. We take the root mean square of the error that has occurred between
the test values and the predicted values mathematically.
Formula:
For a single value:
Let a= (predicted value- actual value) ^2
Let b= mean of a = a (for single value)
Then RMSE= square root of b
For wide set of values:

7
31. What is MSE (Mean Squared Error)?
Ans:
MSE is the most commonly used regression loss function. MSE is the sum of squared
distances between our target variable and predicted values.
Formula:

32. Why and when to use mean squared error?

Ans:
 MSE is sensitive towards outliers and given several examples with the same
input feature values, the optimal prediction will be their mean target value. This
should be compared with Mean Absolute Error, where the optimal prediction is
the median.
 MSE is thus good to use if we believe that our target data, conditioned on the
input, is normally distributed around a mean value, and when it’s important to
penalize outliers/large errors extra much than small ones.

Extra Questions
33. What is MAE and MAPE?
Ans:
MAE
The Mean Absolute Error is the squared mean of the difference between the actual
values and predictable values.

MAPE
Mean Absolute Percentage Error (MAPE) is a statistical measure to define the
accuracy of a machine learning algorithm on a particular dataset.
It represents the average of the absolute percentage errors of each entry in a dataset
to calculate how accurate the forecasted quantities were in comparison with the actual
quantities.

Long answer questions:

1. Explain the Problem decomposition steps
Ans:
Real computational tasks are complicated. To accomplish them we need to break
down the problem into smaller units before coding.

1. Understand the problem and then restate the problem in your own words
 Know what the desired inputs and outputs are
8
 Ask questions for clarification
2. Break the problem down into a few large pieces. Write these down, either on paper
or as comments in a file.
3. Break complicated pieces down into smaller pieces. Keep doing this until all of the
pieces are small.
4. Code one small piece at a time.
 Think about how to implement it
 Write the code/query
 Test it.

 Fix problems, if any

2. Imagine that you want to create your first app. How would you
decompose the task of creating an app?

Ans:
To decompose this task, we would need to know the answer to a series of smaller
problems:
 What kind of app you want to create?
 What will your app will look like?
 Who is the target audience for your app?
 What will the graphics will look like?
 What audio will you include?
 What software will you use to build your app?
 How will the user navigate your app?
 How will you test your app?

This list has broken down the complex problem of creating an app into
much simpler problems that can now be worked out.

3. Explain Time Series Decomposition

Ans:
Time series decomposition involves thinking of a series as a combination of level,
trend, seasonality, and noise components. Decomposition provides a useful abstract
model for thinking about time series generally and for better understanding problems
during time series analysis and forecasting.
These components are defined as follows:
Level: The average value in the series.
Trend: The increasing or decreasing value in the series.
Seasonality: The repeating short-term cycle in the series.
Noise: The random variation in the series.

9
4. Depict the Foundational Methodology of Data Science using a
diagram

Ans:

5. Explain the Train-Test Split Evaluation

Ans:
 The train-test split is a technique for evaluating the performance of a machine
learning algorithm.
 It can be used for classification or regression problems and can be used for any
supervised learning algorithm.
 The procedure involves taking a dataset and dividing it into two subsets. The
first subset is used to fit the model and is referred to as the training dataset. The
second dataset is referred to as the test dataset.
 The test dataset is not used to train the model; instead, the input element of the
dataset is provided to the model, then predictions are made and compared to
the expected values.
 The objective is to estimate the performance of the machine learning model on
new data – the data which is not used to train the model.
 It is to fit it on available data with known inputs and outputs, then make
predictions on new examples in the future where we do not have the expected
output or target values.
 The train-test procedure is appropriate when there is a
10
sufficiently large dataset available.
6. Explain the procedure of K-fold cross validation
Ans:
 Shuffle the dataset randomly.
 Split the dataset into k groups
 For example if k=5, we divide the data into 5 pieces, each being 20% of the full
dataset.
 We run an experiment called experiment 1 which uses the first fold as a holdout
set, and everything else as training data. This gives us a measure of model
quality based on a 20% holdout set.
 We then run a second experiment, where we hold out data from the second fold
(using everything except the 2nd fold for training the model.) This gives us a
second estimate of model quality. We repeat this process, using every fold once
as the holdout. Putting this together, 100% of the data is used as a holdout at
some point.
 Finally, Summarize the skill of the model using the sample of model evaluation
scores

AI Part B (XII) 2023-24
No ratings yet
AI Part B (XII) 2023-24
20 pages
Information Technology - Class12 - Practical File - New
No ratings yet
Information Technology - Class12 - Practical File - New
51 pages
Sample Paper 2024-25
No ratings yet
Sample Paper 2024-25
394 pages
12 CS RevisionPapers2025
No ratings yet
12 CS RevisionPapers2025
86 pages
Class Xii Project File Work - Indigo
0% (2)
Class Xii Project File Work - Indigo
1 page
English Project
0% (1)
English Project
8 pages
Class 12 - Engish Core (301) - Ce - QP - Set 1
100% (1)
Class 12 - Engish Core (301) - Ce - QP - Set 1
10 pages
Study of Chemicals Used in Daily Life
No ratings yet
Study of Chemicals Used in Daily Life
12 pages
Narrative Techniques and Styles
100% (1)
Narrative Techniques and Styles
5 pages
Sample Paper Class 12 Maths
No ratings yet
Sample Paper Class 12 Maths
20 pages
Sample Paper Mid Term 24-25 Class 12 English
No ratings yet
Sample Paper Mid Term 24-25 Class 12 English
6 pages
Practical-1 (Fitness Test-SAI Khelo India Fitness Test InSchool)
No ratings yet
Practical-1 (Fitness Test-SAI Khelo India Fitness Test InSchool)
8 pages
12 English Core SP 3 A
No ratings yet
12 English Core SP 3 A
16 pages
PDF&Rendition 1 1
100% (1)
PDF&Rendition 1 1
180 pages
Invitation
No ratings yet
Invitation
8 pages
Physics Investigatory Project Class 12 Logic Gates
No ratings yet
Physics Investigatory Project Class 12 Logic Gates
22 pages
Question Bank For XI
No ratings yet
Question Bank For XI
60 pages
Physics - Xii-Ms-Pb-2024-25-Set-1a
100% (1)
Physics - Xii-Ms-Pb-2024-25-Set-1a
4 pages
Eng Project
100% (1)
Eng Project
15 pages
Class12 Unit-1
No ratings yet
Class12 Unit-1
18 pages
Report Writing Extended
No ratings yet
Report Writing Extended
10 pages
Conversation Between Franz and Saheb
No ratings yet
Conversation Between Franz and Saheb
2 pages
Ai Notes
100% (2)
Ai Notes
3 pages
English Project
No ratings yet
English Project
13 pages
Xii English QP
No ratings yet
Xii English QP
11 pages
Job Application With Biodata or Curriculum Vitae or Resume
No ratings yet
Job Application With Biodata or Curriculum Vitae or Resume
8 pages
English Project Front Page-1
No ratings yet
English Project Front Page-1
3 pages
Invitation and Replies
No ratings yet
Invitation and Replies
14 pages
CS Class 12 Chapter - 3 Type C Questions Solutions
No ratings yet
CS Class 12 Chapter - 3 Type C Questions Solutions
7 pages
Vikram English Project
No ratings yet
Vikram English Project
17 pages
AOD Case Based Questions
No ratings yet
AOD Case Based Questions
19 pages
CS
No ratings yet
CS
15 pages
Topic 3 - Concept of Inclusion in Sports, Its Need
No ratings yet
Topic 3 - Concept of Inclusion in Sports, Its Need
6 pages
Solved Sample Paper
No ratings yet
Solved Sample Paper
19 pages
Cbse Class 12 AI Log Book 2023-24
No ratings yet
Cbse Class 12 AI Log Book 2023-24
39 pages
Important Questions of Ratrap - 1
No ratings yet
Important Questions of Ratrap - 1
3 pages
MBD Sure-Shot Chem MTPs Solved PDF
100% (1)
MBD Sure-Shot Chem MTPs Solved PDF
34 pages
The Tiger King - Script
No ratings yet
The Tiger King - Script
5 pages
Bernoulli'S Principle and Its Application: Physics Project - Kevin Shijo Xi-B
No ratings yet
Bernoulli'S Principle and Its Application: Physics Project - Kevin Shijo Xi-B
9 pages
Syllabus and Blueprint Class Xii English
No ratings yet
Syllabus and Blueprint Class Xii English
3 pages
All India Senior Secondary Certificate Examination
No ratings yet
All India Senior Secondary Certificate Examination
12 pages
Support Material English Core ClassXII (2021-22 First Term
No ratings yet
Support Material English Core ClassXII (2021-22 First Term
242 pages
The Last Lesson Long Answer Type Questions
No ratings yet
The Last Lesson Long Answer Type Questions
3 pages
CH 3 - CH 8 Answers
No ratings yet
CH 3 - CH 8 Answers
60 pages
Third Group Basic Radical
No ratings yet
Third Group Basic Radical
1 page
U-1 Capstone Q&A
No ratings yet
U-1 Capstone Q&A
10 pages
Artificial
No ratings yet
Artificial
5 pages
Sample Paper - 1
No ratings yet
Sample Paper - 1
18 pages
Python
No ratings yet
Python
17 pages
Chandigarh Region KV MS PB1
No ratings yet
Chandigarh Region KV MS PB1
5 pages
Xii - Ai - Notes - U 2
No ratings yet
Xii - Ai - Notes - U 2
8 pages
CS Project
No ratings yet
CS Project
13 pages
Third Level
No ratings yet
Third Level
3 pages
Grade-12 Unit-1 Capstone Project
No ratings yet
Grade-12 Unit-1 Capstone Project
15 pages
Kendriya Vidyalaya Sangathan, Gurugram Region First Pre Board Summer Station Kvs Class Xii Subject-Chemistry (043) Session 2022 - 2023 Marking Scheme
No ratings yet
Kendriya Vidyalaya Sangathan, Gurugram Region First Pre Board Summer Station Kvs Class Xii Subject-Chemistry (043) Session 2022 - 2023 Marking Scheme
8 pages
STD Xii Physics Ms Set I
No ratings yet
STD Xii Physics Ms Set I
9 pages
PHYSICS Investigatory Project
No ratings yet
PHYSICS Investigatory Project
14 pages
Dataframe in Pandas
No ratings yet
Dataframe in Pandas
23 pages
Rayoptics
No ratings yet
Rayoptics
46 pages
Chem Project RAYON THREAD
No ratings yet
Chem Project RAYON THREAD
15 pages
Auditing IT Governance Controls: Jorge A. Garcia, CPA, CMA
No ratings yet
Auditing IT Governance Controls: Jorge A. Garcia, CPA, CMA
39 pages
Chem201 Molec Spec Shimadzu 2450 Instr
No ratings yet
Chem201 Molec Spec Shimadzu 2450 Instr
3 pages
GP-Pro Ex-Basic Training
No ratings yet
GP-Pro Ex-Basic Training
244 pages
World 5
No ratings yet
World 5
20 pages
LABS 1-12 & Bonus Task
No ratings yet
LABS 1-12 & Bonus Task
75 pages
Lecture 1 Software Verification and Validation
No ratings yet
Lecture 1 Software Verification and Validation
24 pages
CTFL 2018 Sample Exam A v1.5 Answers
No ratings yet
CTFL 2018 Sample Exam A v1.5 Answers
19 pages
Programming For Problem Solving Using C and C++
No ratings yet
Programming For Problem Solving Using C and C++
26 pages
Assembler - Design - Options
No ratings yet
Assembler - Design - Options
9 pages
Lecture 6 Stack DataStructure in C++
100% (1)
Lecture 6 Stack DataStructure in C++
40 pages
Save in Desktop
No ratings yet
Save in Desktop
9 pages
Chemistry Project On Cement
No ratings yet
Chemistry Project On Cement
26 pages
VBA Cheat Sheet
No ratings yet
VBA Cheat Sheet
7 pages
GL865 V3/V3.1 HW User Guide: 1vv0301018 Rev. 15 - 2019-01-07
No ratings yet
GL865 V3/V3.1 HW User Guide: 1vv0301018 Rev. 15 - 2019-01-07
72 pages
LNS Mach4
No ratings yet
LNS Mach4
33 pages
Design and Implementation of Intelligent
No ratings yet
Design and Implementation of Intelligent
77 pages
A Practical Framework For Cyber Secure, Cloud Connected Smart Building Control Systems
No ratings yet
A Practical Framework For Cyber Secure, Cloud Connected Smart Building Control Systems
12 pages
Lab Report 1
No ratings yet
Lab Report 1
6 pages
Anshul Agarwal PDF
No ratings yet
Anshul Agarwal PDF
21 pages
Recruitment System Software Engineering
No ratings yet
Recruitment System Software Engineering
4 pages
MC 10217070 0001
No ratings yet
MC 10217070 0001
6 pages
Rane Sixty-Two Mixer Manual
No ratings yet
Rane Sixty-Two Mixer Manual
44 pages
Record 67c8efe613c15b06c6b871c4
No ratings yet
Record 67c8efe613c15b06c6b871c4
2 pages
Mustafa New CV
No ratings yet
Mustafa New CV
5 pages
Release Notes: System Requirements
No ratings yet
Release Notes: System Requirements
14 pages
Aerissecurityv 721663682410091
No ratings yet
Aerissecurityv 721663682410091
6 pages
DFW
No ratings yet
DFW
6 pages
M - Difrn-Continuity Impq
No ratings yet
M - Difrn-Continuity Impq
1 page
Nfhfdaf
No ratings yet
Nfhfdaf
5 pages
Migrating From DRM On-Premise To EDMCS Cloud - PPT
No ratings yet
Migrating From DRM On-Premise To EDMCS Cloud - PPT
15 pages
Grandstream GXW4104 Gateway Configuration Guide
No ratings yet
Grandstream GXW4104 Gateway Configuration Guide
4 pages
Rere GHB
No ratings yet
Rere GHB
1 page
Chapter 2 - Computer Evolution and Performance
No ratings yet
Chapter 2 - Computer Evolution and Performance
10 pages
Atomic Stimulus Generation Vs
No ratings yet
Atomic Stimulus Generation Vs
8 pages
Disabling RC4 en Windows 2008
No ratings yet
Disabling RC4 en Windows 2008
9 pages
Log
No ratings yet
Log
2 pages

Podar Pearl School: Chapter 1: Capstone Project Question and Answers

Uploaded by

Podar Pearl School: Chapter 1: Capstone Project Question and Answers

Uploaded by

PODAR PEARL SCHOOL

Chapter 1: Capstone Project

Question and Answers:

Short Answer questions:

1. What is a Capstone Project?

Ans: A capstone project is a project where students must research a topic

3. List down the steps involved in developing an AI Project

4. State the important criteria in understanding the problem

5. What are the five types of questions to be geared towards applying AI

9. Briefly explain the term ‘ Analytical approach’

 Statistical analysis applies to problems that require counts: if the question

16. What is training data set?

 Third, obtain, understand, prepare, and model the data.

19. What is the main parameter in configuring the Train-Test Split?

 Computational cost in training the model.

21. What are the common split percentages?

24. What are the advantages and shortcomings of Train-Test Split?

25. What is K-fold Cross Validation?

26. Differentiate Train-Test split and K-fold Cross Validation

28. Explain the terms

30. What is RMSE (Root Mean Squared Error)?

32. Why and when to use mean squared error?

Long answer questions:

 Fix problems, if any

3. Explain Time Series Decomposition

5. Explain the Train-Test Split Evaluation

You might also like