0% found this document useful (0 votes)

37 views11 pages

ML QB Answers

Machine learning question bank for gtu chapter 1 to 4

Uploaded by

ankittiwari4841

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views11 pages

ML QB Answers

Machine learning question bank for gtu chapter 1 to 4

Uploaded by

ankittiwari4841

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

[Note:

1.this doc have not contained all 25 answer.

2. All answers are written in sort points]

3. Write short note on Reinforcement learning.

- type of machine learning where an agent learns to make decisions by
performing actions and minimize errors, maximize reward.
- Trial-end-error approach, agent learns from its experiences and adjusts its
behaviour accordingly
- No need of clear and supervise data. It is unsupervised.
Key Components:
- Agent: The decision-maker
- Environment: world where the agent operates
- State: current situation of the agent
- Action: choices the agent can make
- Reward: signal from the environment that indicates the success or failure
How it Works:
- Initialization: The agent starts
- Action: agent takes an action
- Reward: based on action current state provide reward.
- Repat: Continue Action-Reward till termination condition reach.
Goal: Maximize reward
Examples: Playing games, Robotics, Recommendation sys.

4. Explain Key elements of Machine Learning. Explain various function

approximation methods.
Key Elements:
- Data: The raw material that ML algorithms learn from. (e.g., CSV, images,
text).
- Features: The relevant attributes or characteristics extracted from the
data that are used for learning.
- Algorithm: The mathematical method used to learn patterns from the
data.
- Model: The output of the learning process, which represents the learned
patterns.
- Evaluation: The process of assessing the model's performance on a
separate dataset.
Function Approximation
- core task in ML, goal is to learn a function that can map inputs to outputs.
Linear Models
- Linear Regression: Fits a linear equation to the data.
- Logistic Regression: Used for classification problems, predicts the
probability of belonging to a class.
Non-Linear Models
- Decision Trees: Tree structures, each node represents a test on a feature,
each leaf represents a class or a predicted value.
- Random Forests: combination of decision trees, for improved accuracy.
- Support Vector Machines (SVMs): Find a hyperplane that separates the
data into classes.
- Neural Networks: Complex models inspired by the human brain,
consisting of interconnected nodes (neurons).
- Deep Learning: A subset of neural networks with multiple layers, capable
of learning complex patterns.
- Kernel Methods: Transform the data into a higher-dimensional space to
make it linearly separable.

7. Explain any two important machine learning libraries in python.

1. Scikit-learn:
- Overview: Scikit-learn is a popular open-source Python library for
machine learning. It provides a simple interface for building and training
various machine learning models.
- Key Features:
- Classification: Logistic reg, SVMs, decision trees, random forests,
etc.
- Regression: Linear reg, ridge reg, lasso reg, etc.
- Clustering: K-means, hierarchical clustering, DBSCAN, etc.
- Dimensionality reduction: PCA, t-SNE, etc.
- Model selection: Cross-validation, grid search, hyperparameter
tuning.
- Preprocessing: Data cleaning, normalization, feature scaling, etc.
2. TensorFlow:
- Overview: TensorFlow is a flexible, high-performance open-source
platform for machine learning. It is particularly well-suited for deep
learning tasks.
- Key Features:
- Deep neural networks: Convolutional neural networks (CNNs),
recurrent neural networks (RNNs), etc.
- Tensor manipulation: Efficient operations on multi-dimensional
arrays.
- Automatic differentiation: Automatic calculation of gradients for
optimization.
- Deployment: Deploy models to various platforms, including mobile
devices and servers.
8. Define Followings. Machine Learning Concepts
a. Regression: Regression is a machine learning task that involves predicting a
continuous numerical value. For example, predicting house prices based on
features like size, location, and number of bedrooms.
b. Classification: Classification is a machine learning task that involves
predicting a categorical value. It is used to categorize data into discrete classes.
For example, classifying emails as spam or not spam, or images as cats or dogs.
c. Clustering: task that involves grouping similar data points together. It is used
to discover patterns or structures within the data. For example, clustering
customers based on their purchasing behavior.
d. Training Data: Training data is a dataset used to train a machine learning
model. It consists of input features and corresponding target values.
e. Test Data: Test data is a dataset used to evaluate the performance of a
trained machine learning model.
f. Function Approximation: is the task of learning a function that maps inputs
to outputs. it involves constructing a model that can accurately predict outputs
for new inputs based on the patterns learned from the training data.

g. Overfitting, Underfitting, and Perfect Fit

- Overfitting: A model is said to be overfitting when it performs well on
the training data but poorly on the testing data.
- Underfitting: A model is said to be underfitting when it performs poorly
on both the training and testing data.
- Perfect Fit: A model is said to have a perfect fit when it perfectly predicts
the training data. However, a perfect fit does not guarantee good
performance on new data, as it may indicate overfitting.
h. Cost Function: A cost function is a measure of how well a machine learning
model is performing. It quantifies the error between the model's predictions
and the true values. The goal of training a machine learning model is to
minimize the cost function.
9. Explain the flow diagram of machine learning procedure
- Data Collection: Gather relevant data
- Data Preprocessing: Clean and preprocess the data
- Feature Selection: Select the most relevant features
- Split Data: split in training and testing
- Model Selection: select appropriate model according to data.
- Model Training: train model with training data split
- Model Evaluation: Evolute the performance of model
- Model Deployment: deploy in production env
- Iteration and Improvement: ………

10. Issues in Machine Learning

1. Data Quality: Insufficient, noisy, or biased data can reduce Performance
2. Overfitting and Underfitting: refer Q8(g)
3. Interpretability: complex model like deep neural networks difficult to
interpret and challenging to understand how they make decisions.
4. Scalability: Handling large datasets and complex models can be
computationally expensive.
5. Bias: is leading to unfair decision making and outcomes.
6. Privacy, Ethics: Collecting and using personal data raises privacy
concerns.

11. Types of Data in Machine Learning

1. Numerical Data: Continuous, Discrete
2. Categorical Data: Nominal, Ordinal
3. Text Data: Unstructured, structured
4. Image Data: Structured data that represents visual information.
5. Audio Data: Unstructured data that represents sound information.
6. Time Series Data: Sequential data where observations are recorded at
specific time intervals.
Examples:
- Numerical Data: Age, income, temperature
- Categorical Data: Gender, country, color
- Text Data: Product reviews, news articles, social media posts
- Image Data: Photographs, medical scans, satellite images
- Audio Data: Speech recordings, music files, sound effects
- Time Series Data: Stock prices, temperature readings, sensor data

14. Explain the interpretation and comparison of Box Plot.

Box Plots: graphical representation of the distribution of a dataset. They
provide a summary of the five-number summary: minimum, 1st quartile (Q1),
median (Q2), 3rd quartile (Q3), and maximum.
Components:
- Q1: middle value of the dataset
- Q1 & Q3: values that divide the dataset into four equal parts
- IQR: The difference between Q3 and Q1
- Whiskers: Lines extending to minimum and maximum values
- Outlier: Data points that lie outside of the whiskers
Interpretation:
- Median: median indicates the central tendency of the data.
- IQR: represents the spread of the data. longer box larger
spread, shorter smaller spread.
- Whiskers: indicates the range of the data, excluding outliers.
- Outliers: …….
Comparison: idk

15. Write difference: a. Predictive and Descriptive Model. b. Lasy vs Eager

Learner
Predictive Models:
- Purpose: Predict future outcomes or values based on historical data.
- Focus: Predicting new, unseen data points.
- Examples: Regression models, classification models, time series models.
Descriptive Models:
- Purpose: Understand and summarize existing data.
- Focus: Describing patterns, relationships, and trends in the data.
- Examples: Clustering models, dimensionality reduction techniques.
Key Differences:

Feature Predictive Model Descriptive Model

Purpose Predict future outcomes Describe existing data

Focus New, unseen data Patterns in the data

Regression, classification, Clustering, dimensionality

Examples
time series reduction

Lazy vs. Eager Learners

Lazy Learners:
- Learn on the fly: Do not build a model until they receive new data.
- Store data: Store all training data.
- Prediction: Make predictions based on the similarity between the new
data point and the stored training data.
- Examples: k-nearest neighbors (k-NN), instance-based learning.
Eager Learners:
- Build a model beforehand: Construct a model from the training data
before making predictions.
- Generalize: Learn general patterns from the training data.
- Prediction: Use the learned model to make predictions on new data.
- Examples: Decision trees, neural networks, support vector machines.
Key Differences:

Feature Lazy Learner Eager Learner

Learning On-the-fly Pre-built model

Data
All training data Model parameters
Storage

Prediction Similarity-based Model-based

k-NN, instance-based Decision trees, neural networks,
Examples
learning SVMs

19. Explain K-fold and Leave-one-out cross-validation.

K-Fold and Leave-One-Out Cross-Validation
Cross-validation is a technique used to evaluate the performance of a machine
learning model by splitting the dataset into multiple folds and training the
model on different subsets.
K-Fold Cross-Validation:
Process:
1. Divide the dataset into k equal-sized folds.
2. For each fold:
- Use the remaining k-1 folds for training.
- Use the current fold for testing.
- Evaluate the model's performance on the testing set.
3. Calculate the average performance across all k folds.
- Advantages:
- Provides a more accurate estimate of the model's performance
compared to a single train-test split.
- Can be used for various evaluation metrics.
- Disadvantages:
- Can be computationally expensive for large datasets and large values of
k.
Leave-One-Out Cross-Validation (LOOCV):
 Process:
- Use all but one data point for training and the remaining data
point for testing.
- Repeat this process for each data point in the dataset.
- Calculate the average performance across all iterations.

 Advantages:
- Provides a very accurate estimate of the model's performance,
especially for small datasets.
- No need to split the data into training and testing sets.
 Disadvantages:
- Can be computationally expensive for large datasets.
- May not be as reliable as k-fold cross-validation for larger dataset

[19 to 23 questions are over dosed_ narcotics are danger for health]

24. Explain Silhouette width and its meaning in cluster.

- is a metric used to evaluate the quality of clustering results
- measures how similar a data point is to its own cluster compared to
other clusters
- A higher silhouette data points are well-clustered, lower may not optimal
Calculation:
- Calculate average distance to points in the same cluster
- Calculate average distance to points in the nearest different cluster
- Calculate silhouette coefficient.
Interpretation:
 Silhouette coefficient: A value between -1 and 1.
- 1 The data point is far from the nearest different cluster, indicating
good clustering.
- -1 The data point is closer to the nearest different cluster than its
own cluster, indicating poor clustering.
- 0 The data point is on the boundary between two clusters.

25. Write short note on Ensemble Methods.

Ensemble methods combine multiple machine learning models to improve
overall performance.
Common Ensemble Methods:
1. Bagging (Bootstrap Aggregating):
- Creates multiple models by training them on different bootstrap
samples of the training data.
- Combines the predictions of these models using averaging (for
regression) or voting (for classification).
- Example: Random Forest.
2. Boosting:
- Iteratively trains models, focusing on data points that were
misclassified by previous models.
- Weights the predictions of each model based on its performance.
- Examples: AdaBoost, Gradient Boosting Machine (GBM).
3. Stacking:
- Trains multiple base models on the same data.
- Combines the predictions of these models using a meta-model,
which learns to weigh the predictions of the base models.
Advantages:
- Improve accuracy
- Reduce overfitting
- Increase robustness
Disadvantages
- Computational complex
- Less interpretability

Logistics Support Analysis
0% (1)
Logistics Support Analysis
5 pages
Lafayette Parish Business Database 211
No ratings yet
Lafayette Parish Business Database 211
890 pages
Samuel Murphy Case Study Firms and Markets
100% (1)
Samuel Murphy Case Study Firms and Markets
21 pages
Module 2
No ratings yet
Module 2
54 pages
Snowflake Bentley
No ratings yet
Snowflake Bentley
82 pages
NRF 24 e 1
No ratings yet
NRF 24 e 1
119 pages
Coastal Protection of Highways
No ratings yet
Coastal Protection of Highways
14 pages
MA Macroeconomics 11. The Solow Model: Karl Whelan
No ratings yet
MA Macroeconomics 11. The Solow Model: Karl Whelan
38 pages
Private Fire Hydrant (PFH) Inspection and Testing Form
No ratings yet
Private Fire Hydrant (PFH) Inspection and Testing Form
2 pages
AA210 Fundamentals of Compressible Flow CH 13 BJ Cantwell PDF
No ratings yet
AA210 Fundamentals of Compressible Flow CH 13 BJ Cantwell PDF
22 pages
BOX Hill Growth Centres Precinct Development Control Plan - in Force 28 June 2021
No ratings yet
BOX Hill Growth Centres Precinct Development Control Plan - in Force 28 June 2021
243 pages
Acids Bases
No ratings yet
Acids Bases
17 pages
Scott Slaybaugh - Who Is To Blame? (Titanic Articles)
No ratings yet
Scott Slaybaugh - Who Is To Blame? (Titanic Articles)
8 pages
Climate of India - Wikipedia
No ratings yet
Climate of India - Wikipedia
146 pages
Fiziks: Institute For Net/Jrf, Gate, Iit-Jam, M.Sc. Entrance, Jest, Tifr and Gre in Physics Jnu MSC Physics-2020
No ratings yet
Fiziks: Institute For Net/Jrf, Gate, Iit-Jam, M.Sc. Entrance, Jest, Tifr and Gre in Physics Jnu MSC Physics-2020
12 pages
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
No ratings yet
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
20 pages
Common Interview Question
No ratings yet
Common Interview Question
4 pages
History: History of The Electric Vehicle
No ratings yet
History: History of The Electric Vehicle
3 pages
Canablast EDP 10 Pump - en PDF
No ratings yet
Canablast EDP 10 Pump - en PDF
4 pages
Tofinoxe 0200t1t1tddz90007tatxxxxx
No ratings yet
Tofinoxe 0200t1t1tddz90007tatxxxxx
4 pages
FIDE World Cup 2023 - The Week in Chess
No ratings yet
FIDE World Cup 2023 - The Week in Chess
4 pages
Prepguide Schedule chm1045
No ratings yet
Prepguide Schedule chm1045
2 pages
SSB GibsonMcElhaneyLtr4.2016 PDF
No ratings yet
SSB GibsonMcElhaneyLtr4.2016 PDF
1 page
Study Notes - Lesson 1 - 7 PDF
No ratings yet
Study Notes - Lesson 1 - 7 PDF
25 pages
A&P Chapter 12 Notes
No ratings yet
A&P Chapter 12 Notes
10 pages
ML Revision
No ratings yet
ML Revision
207 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
ML Sem
No ratings yet
ML Sem
24 pages
Introduction To ML
No ratings yet
Introduction To ML
48 pages
Module 1: Introduction To Machine Learning: 1. What Is Machine Learning? How Is It Different From Human Learning?
No ratings yet
Module 1: Introduction To Machine Learning: 1. What Is Machine Learning? How Is It Different From Human Learning?
21 pages
50 BMG Armor Penetration
No ratings yet
50 BMG Armor Penetration
13 pages
Rohit Unit 1 ML Notes
No ratings yet
Rohit Unit 1 ML Notes
27 pages
Questions Bank Faml
No ratings yet
Questions Bank Faml
2 pages
Ids Ashber
No ratings yet
Ids Ashber
9 pages
ML GTU Solution
No ratings yet
ML GTU Solution
83 pages
ML Final Notes Unit 4,5 Rishi
No ratings yet
ML Final Notes Unit 4,5 Rishi
45 pages
Tutorial Sheet1 (M.L.)
No ratings yet
Tutorial Sheet1 (M.L.)
49 pages
305 BA MachineLearning and Cognitive Intellegence Using Python 1
No ratings yet
305 BA MachineLearning and Cognitive Intellegence Using Python 1
32 pages
Machine Learning
No ratings yet
Machine Learning
21 pages
What Are The Basic Concepts in Machine Learning
No ratings yet
What Are The Basic Concepts in Machine Learning
3 pages
Headache Center Diary and Guide
No ratings yet
Headache Center Diary and Guide
3 pages
ML Q
No ratings yet
ML Q
40 pages
Honda Hornet 2.0 - Owner's Manual
No ratings yet
Honda Hornet 2.0 - Owner's Manual
1 page
Motherboard Diagnose - Troubleshooting Tips For Common Issues
No ratings yet
Motherboard Diagnose - Troubleshooting Tips For Common Issues
9 pages
UNIT1@
No ratings yet
UNIT1@
4 pages
Ilisha Gupta EV Optimisation Final 2
No ratings yet
Ilisha Gupta EV Optimisation Final 2
13 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
Unit 1 ML
No ratings yet
Unit 1 ML
8 pages
Machine Learning - Question Bank
No ratings yet
Machine Learning - Question Bank
45 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
What Is Human Learning? Give Any Two Examples
No ratings yet
What Is Human Learning? Give Any Two Examples
19 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
305 BA PYTHON - APR 2022 ANSWER Key
No ratings yet
305 BA PYTHON - APR 2022 ANSWER Key
14 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
Interview Material
No ratings yet
Interview Material
14 pages
ML (Theory)
No ratings yet
ML (Theory)
11 pages
20AIPC302 FMLT Question Bank
No ratings yet
20AIPC302 FMLT Question Bank
6 pages
Machine Learning GNIT Suggestions
No ratings yet
Machine Learning GNIT Suggestions
7 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
Class Notes: The Basics of Machine Learning
No ratings yet
Class Notes: The Basics of Machine Learning
4 pages
Module - 1
No ratings yet
Module - 1
9 pages
Data Science Notes C
No ratings yet
Data Science Notes C
4 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
5 pages
ChatPDF IMG 20250313 WA0000
No ratings yet
ChatPDF IMG 20250313 WA0000
2 pages
ChatPDF IMG 20250313 WA0000
No ratings yet
ChatPDF IMG 20250313 WA0000
2 pages
Lecture Notes On Machine Learning Concepts
No ratings yet
Lecture Notes On Machine Learning Concepts
5 pages
E2100168 REV2 Butterfly Valve
No ratings yet
E2100168 REV2 Butterfly Valve
17 pages
Question Bank - Student
No ratings yet
Question Bank - Student
33 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
An Introduction To Groups and Their Matrices For Science Students Robert Kolenkow Download
No ratings yet
An Introduction To Groups and Their Matrices For Science Students Robert Kolenkow Download
76 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
Machine Learning Concise Notes
No ratings yet
Machine Learning Concise Notes
7 pages
Unit 1 ML
No ratings yet
Unit 1 ML
41 pages
Machine Learning
No ratings yet
Machine Learning
12 pages
Unit-1 ML (1) .Docx 3rd Sem
No ratings yet
Unit-1 ML (1) .Docx 3rd Sem
20 pages
Introduction and Basics of Machine Learning
No ratings yet
Introduction and Basics of Machine Learning
9 pages
ML 1
No ratings yet
ML 1
44 pages
Unit I 2 Mark Answers ML
No ratings yet
Unit I 2 Mark Answers ML
3 pages
Exam Topics 1
No ratings yet
Exam Topics 1
7 pages
ML - Part - A
No ratings yet
ML - Part - A
10 pages
AI ML Concepts
No ratings yet
AI ML Concepts
97 pages
PSCS511 - Machine Learning
No ratings yet
PSCS511 - Machine Learning
23 pages
ML
No ratings yet
ML
18 pages
Machine Learning
No ratings yet
Machine Learning
38 pages
AI Module 1 Simple Notes
No ratings yet
AI Module 1 Simple Notes
14 pages

ML QB Answers

Uploaded by

ML QB Answers

Uploaded by

[Note:

1.this doc have not contained all 25 answer.

3. Write short note on Reinforcement learning.

4. Explain Key elements of Machine Learning. Explain various function

7. Explain any two important machine learning libraries in python.

g. Overfitting, Underfitting, and Perfect Fit

10. Issues in Machine Learning

11. Types of Data in Machine Learning

14. Explain the interpretation and comparison of Box Plot.

15. Write difference: a. Predictive and Descriptive Model. b. Lasy vs Eager

Feature Predictive Model Descriptive Model

Purpose Predict future outcomes Describe existing data

Regression, classification, Clustering, dimensionality

Lazy vs. Eager Learners

Feature Lazy Learner Eager Learner

Learning On-the-fly Pre-built model

Prediction Similarity-based Model-based

19. Explain K-fold and Leave-one-out cross-validation.

24. Explain Silhouette width and its meaning in cluster.

25. Write short note on Ensemble Methods.

You might also like