0% found this document useful (0 votes)

9 views12 pages

Group B 2

The document discusses the differences between traditional programming and machine learning, highlighting that traditional programming relies on explicit rules while machine learning learns patterns from data. It also compares machine learning with statistical modeling, noting their similarities in being data-driven but differing in focus and complexity. Additionally, it explains supervised vs. unsupervised learning, regression types, evaluation metrics for regression models, and the use of logistic regression for binary classification.

Uploaded by

NITESH RAGHUNATH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views12 pages

Group B 2

Uploaded by

NITESH RAGHUNATH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

CA-552

Que.1-Explain the difference between traditional programming and machine learning.

How does the approach to problem-solving differ in each case?

Ans.-Traditional Programming
• Approach: In traditional programming, you write explicit instructions for the computer to follow. It's r
ule-based, where every possible scenario is coded by the programmer.

• Steps: Define the problem, create an algorithm, and implement the solution in a programming language
.

• Data: Used to test and verify the correctness of the program.

• Output: Deterministic, meaning the same input will always produce the same output.

• Example: Writing a sorting algorithm where you specify each step to sort a list.

Machine Learning

• Approach: Machine learning focuses on teaching a model to learn patterns from data. Instead of explic
itly coding rules, you train a model using data and let it infer the rules.

• Steps: Collect data, choose a model, train the model with data, and use it to make predictions.

• Data: Crucial for training the model and improving its accuracy.

• Output: Probabilistic, meaning the output can vary based on the model and data.

• Example: Training a model to recognize images of cats and dogs by feeding it thousands of labeled im
ages.

Key Differences

• Rule-Based vs. Data-

Driven: Traditional programming is about coding explicit rules, while machine learning involves learni
ng patterns from data.

• Flexibility: Machine learning models can adapt to new data and improve over time, whereas traditional
programming requires manual updates for new scenarios.

• Problem Types: Traditional programming excels in well-

defined problems with clear rules, while machine learning handles complex problems with large, unstru
ctured data.

Example Scenario

Spam Detection:

• Traditional Programming: Write rules to filter spam emails based on keywords like "free", "win", etc.

• Machine Learning: Train a model with a dataset of labeled spam and non-
spam emails. The model learns to identify patterns and classify new emails as spam or not.

In essence, traditional programming is like following a recipe, whereas machine learning is more about teaching
a chef to create dishes based on taste and experience. Each approach has its strengths and is suited for different
types of problems. Intriguing how they complement each other, right?

1
CA-552

Que.2-Compare and contrast machine learning with statistical modeling. In what ways
are they similar, and where do they differ?

Ans.- Machine learning and statistical modeling are both techniques used to analyze data, but they have different
focuses, methodologies, and applications. Here’s a comparison highlighting their similarities and differences:

Similarities:

1. Data-Driven: Both approaches rely on data to make predictions or draw insights. They seek to identify
patterns and relationships within datasets.

2. Mathematical Foundations: Both use mathematical concepts, including probability theory, algebra, and
calculus, to derive conclusions from data.

3. Goal of Prediction: Both aim to create models that can predict outcomes based on input variables.

4. Model Evaluation: Both involve assessing model performance using metrics such as accuracy,
precision, recall, or mean squared error.

Differences:

1. Focus and Purpose:

o Statistical Modeling: Primarily concerned with understanding relationships between variables

and making inferences about populations from samples. It often emphasizes hypothesis testing
and deriving meaningful interpretations.

o Machine Learning: Focuses more on prediction and classification, often prioritizing accuracy
over interpretability. It is less concerned with understanding the underlying relationships.

2. Complexity of Models:

o Statistical Models: Typically involve simpler, more interpretable models (e.g., linear
regression, logistic regression) with a clear assumption about the data.

o Machine Learning Models: Can be much more complex (e.g., neural networks, ensemble
methods) and may not provide straightforward interpretations.

3. Assumptions:

o Statistical Modeling: Often relies on strict assumptions about data distribution (e.g., normality,
independence) and model structure.

o Machine Learning: More flexible in terms of assumptions and can handle various types of
data, including unstructured data like images and text.

4. Training and Validation:

o Statistical Models: Usually require smaller datasets and might focus on a single model.
Validation often includes techniques like cross-validation or bootstrapping.

o Machine Learning: Generally thrives on large datasets and may employ more extensive
validation techniques, including hyperparameter tuning and ensemble methods.

5. Interpretability:

o Statistical Models: More interpretable, allowing for insights into how input variables affect
outputs.

2
CA-552

o Machine Learning Models: Often viewed as "black boxes," making it difficult to interpret how
input features influence predictions, especially with deep learning.

Que. 3- Identify and explain three key differences between supervised and unsupervised
learning. Provide a real-world example for each type.

Ans.- Supervised and unsupervised learning are two fundamental approaches in machine learning, each serving
different purposes and requiring different types of data. Here are three key differences between the two:

1. Labeling of Data

• Supervised Learning: Involves training a model on a labeled dataset, meaning that each training
example comes with an associated output or target variable. The model learns to map inputs to outputs.

o Example: Spam Detection – An email filtering system is trained using a dataset of emails that
are labeled as "spam" or "not spam." The model learns to identify characteristics of spam emails
to classify new incoming emails.

• Unsupervised Learning: Involves training a model on an unlabeled dataset, where no output variable is
provided. The model tries to identify patterns or groupings in the data without prior knowledge of the
outcomes.

o Example: Customer Segmentation – A retail company uses clustering algorithms on

transaction data to group customers based on purchasing behavior. No labels are assigned; the
model identifies segments such as "frequent buyers," "occasional shoppers," etc.

2. Objective

• Supervised Learning: The main objective is to make predictions or classifications based on new input
data. The model aims to minimize the difference between predicted outputs and actual outputs during
training.

o Example: Credit Scoring – A financial institution uses labeled historical data to predict
whether a new loan applicant is likely to default based on features like income, credit history,
and loan amount.

• Unsupervised Learning: The goal is to explore the data and find hidden structures or relationships.
There are no predefined labels or outcomes to predict.

o Example: Anomaly Detection – A network security system analyzes system logs to identify
unusual patterns that might indicate a security breach, without any prior labeling of what
constitutes "normal" behavior.

3. Output Type

• Supervised Learning: The output is typically a specific prediction or classification (e.g., a category label
or a continuous value).

o Example: House Price Prediction – A real estate model predicts the price of a house based on
various features (size, location, number of bedrooms), providing a numerical output (the
estimated price).

• Unsupervised Learning: The output can include clusters, associations, or reduced dimensions, often
resulting in a set of groupings or patterns rather than specific predictions.

3
CA-552

o Example: Market Basket Analysis – A grocery store analyzes transaction data to find
associations between items purchased together (e.g., customers who buy bread often also buy
butter), which helps in promotional strategies.

Que. 4- Differentiate between linear and non-linear regression. Provide an example where
non-linear regression is more suitable than linear regression.

Ans.- Linear and non-linear regression are two types of regression analysis used to model the relationship
between a dependent variable and one or more independent variables. Here’s a breakdown of their differences and
an example where non-linear regression is more suitable.

Differences

1. Model Form:

o Linear Regression: Assumes a linear relationship between the independent variable(s) and the
dependent variable. The model can be expressed as y=mx+by = mx + by=mx+b (for one
variable) or in a multivariable form as y=b0+b1x1+b2x2+…+bnxny = b_0 + b_1x_1 + b_2x_2
+ \ldots + b_nx_ny=b0+b1x1+b2x2+…+bnxn, where mmm and bbb are coefficients.

o Non-Linear Regression: Does not assume a linear relationship. The relationship can be
described using various non-linear functions, such as polynomial, exponential, logarithmic, or
sinusoidal forms.

2. Complexity:

o Linear Regression: Generally simpler to interpret and requires fewer computations. The results
can be easily visualized as a straight line in a 2D space.

o Non-Linear Regression: More complex, potentially leading to multiple local minima in the
optimization process. Interpretation can be less straightforward, depending on the function used.

3. Use Cases:

o Linear Regression: Best suited for scenarios where the relationship between variables is
approximately linear. Commonly used in cases like predicting sales based on advertising spend.

o Non-Linear Regression: More appropriate when the relationship between the variables is
inherently non-linear. It’s used in scenarios like growth rates, decay processes, and certain types
of physical phenomena.

Example of Non-Linear Regression Suitability

Scenario: Modeling the growth of a population of bacteria over time.

• Justification: Bacterial growth typically follows a logistic growth model, which is non-linear. In the
early stages, growth is exponential, but as resources become limited, growth slows and eventually
plateaus. This pattern cannot be accurately captured by a linear model.

• Non-Linear Regression Model: A logistic growth model can be expressed as:

P(t)=1+P0K−P0e−rtK

Where:

o P(t)P(t)P(t) is the population at time ttt,

4
CA-552

o KKK is the carrying capacity,

o P0P_0P0 is the initial population size,

o rrr is the growth rate.

Using non-linear regression in this case allows for a more accurate representation of the growth pattern, capturing
the initial rapid growth followed by a slowdown as the population approaches its carrying capacity. Linear
regression would fail to fit this data properly, likely resulting in poor predictions and interpretations.

Que. 5- What are the key metrics used to evaluate a regression model? Explain how R-
squared, Mean Squared Error (MSE), and Root Mean Squared Error (RMSE) are
calculated.

Ans.- Evaluating a regression model involves several key metrics that help assess its performance and accuracy.
Here are three important metrics: R-squared, Mean Squared Error (MSE), and Root Mean Squared Error (RMSE),
along with their calculations:

1. R-squared (Coefficient of Determination)

Definition: R-squared measures the proportion of variance in the dependent variable that can be explained by the
independent variables in the model. It provides an indication of how well the model fits the data.

Calculation:

• Formula:

R2=1−SStotSSres
Where:

o SSres=∑(yi−y^i)2\text{SS}_{\text{res}} = \sum (y_i - \hat{y}_i)^2SSres=∑(yi−y^i)2 is the

residual sum of squares (the sum of the squared differences between observed values yiy_iyi
and predicted values y^i\hat{y}_iy^i).

o SStot=∑(yi−yˉ)2\text{SS}_{\text{tot}} = \sum (y_i - \bar{y})^2SStot=∑(yi−yˉ)2 is the total

sum of squares (the sum of the squared differences between observed values yiy_iyi and the
mean of the observed values yˉ\bar{y}yˉ).

• Interpretation: R-squared values range from 0 to 1. A value closer to 1 indicates a better fit, meaning a
higher proportion of variance is explained by the model.

2. Mean Squared Error (MSE)

Definition: MSE measures the average of the squared differences between the observed actual outcomes and the
predictions made by the model. It quantifies how close the predicted values are to the actual values.

Calculation:

• Formula:

MSE=1n∑(yi−y^i)2\text{MSE} = \frac{1}{n} \sum (y_i - \hat{y}_i)^2MSE=n1∑(yi−y^i)2

Where:

o nnn is the number of observations,

o yiy_iyi are the actual observed values,

5
CA-552

o y^i\hat{y}_iy^i are the predicted values.

• Interpretation: A lower MSE indicates a better fit of the model, as it signifies smaller errors in
predictions. However, MSE is sensitive to outliers due to the squaring of errors.

3. Root Mean Squared Error (RMSE)

Definition: RMSE is the square root of the Mean Squared Error. It provides a measure of the average magnitude
of the errors in a set of predictions, giving the errors the same units as the original data.

Calculation:

• Formula:

RMSE=MSE=n1∑(yi−y^i)2

• Interpretation: RMSE is often preferred over MSE because it is in the same units as the dependent
variable, making it easier to interpret. Like MSE, a lower RMSE indicates a better fit.

Summary

• R-squared helps to understand the explanatory power of the model.

• MSE provides a measure of the average squared error, sensitive to outliers.

• RMSE gives a more interpretable measure of prediction error, providing the error in the same units as
the original data.

These metrics together give a comprehensive view of a regression model's performance, allowing for better
assessment and comparison between models.

Que. 6- Discuss how logistic regression is used for binary classification tasks. Explain the
sigmoid function and how it transforms linear outputs into probabilities.
Ans.- Logistic regression is a widely used statistical method for binary classification tasks, where the goal is to
predict one of two possible outcomes based on one or more predictor variables. Here’s a detailed discussion of
how logistic regression works, along with an explanation of the sigmoid function and its role in transforming
linear outputs into probabilities.

Logistic Regression for Binary Classification

1. Modeling the Relationship:

o Logistic regression models the probability that a given input belongs to a particular class (e.g.,
1 or 0, true or false). It does this by establishing a linear relationship between the input features
and the log-odds of the probability of the positive class.

2. Mathematical Formulation:

o The relationship can be expressed as:

log-odds(p)=log(1−pp)=β0+β1x1+β2x2+…+βnxn

o Here, ppp is the probability of the positive class, xix_ixi are the input features, and βi\beta_iβi
are the model coefficients.

6
CA-552

3. Prediction:

o The output from the linear combination of the input features (β0+β1x1+β2x2+…+βnxn\beta_0
+ \beta_1 x_1 + \beta_2 x_2 + \ldots + \beta_n x_nβ0+β1x1+β2x2+…+βnxn)is then transformed
into a probability using the sigmoid function.

The Sigmoid Function

The sigmoid function is crucial in logistic regression as it converts the linear output into a probability value
between 0 and 1.

1. Mathematical Definition:

o The sigmoid function is defined as:

σ(z)=1+e−z1

o Where zzz is the linear combination of input features (i.e., z=β0+β1x1+β2x2+…+βnxnz =

\beta_0 + \beta_1 x_1 + \beta_2 x_2 + \ldots + \beta_n x_nz=β0+β1x1+β2x2+…+βnxn).

2. Transformation into Probability:

o When the linear output zzz is input into the sigmoid function, it outputs a value ppp (the
predicted probability) that lies between 0 and 1.

o If p≥0.5p \geq 0.5p≥0.5, the model predicts the positive class (e.g., class 1). If p<0.5p <
0.5p<0.5, it predicts the negative class (e.g., class 0).

3. Characteristics of the Sigmoid Function:

o S-shaped Curve: The sigmoid function has an S-shaped curve, which smoothly transitions from
0 to 1.

o Asymptotes: As zzz approaches negative infinity, the output approaches 0, and as zzz
approaches positive infinity, the output approaches 1.

o Center Point: At z=0z = 0z=0, the output is 0.5, making it the decision boundary.

Example Use Case

Suppose we want to predict whether a customer will buy a product (1) or not (0) based on features like age,
income, and previous purchase history. Using logistic regression:

1. We fit the model using historical data to estimate the coefficients (βi\beta_iβi).

2. For a new customer, we compute the linear combination of features.

3. We apply the sigmoid function to this linear output to obtain the probability of purchase.

4. Based on the probability, we classify the customer into either the buying or non-buying category.

Que. 7- Explain the difference between simple regression and multiple regression. How
does the number of predictor variables influence the complexity of the model?

Ans.- Simple regression and multiple regression are both techniques used to analyze the relationship between a
dependent variable and one or more independent variables. Here’s a breakdown of their differences and the impact
of the number of predictor variables on model complexity.

7
CA-552

1. Simple Regression

Definition: Simple regression involves a single independent variable (predictor) used to predict a dependent
variable (response). It establishes a linear relationship between the two.

• Mathematical Form: The equation for simple linear regression can be expressed as: y=β0+β1x+ϵy =
\beta_0 + \beta_1 x + \epsilony=β0+β1x+ϵ Where:

o yyy is the dependent variable.

o xxx is the independent variable.

o β0\beta_0β0 is the y-intercept.

o β1\beta_1β1 is the slope of the line.

o ϵ\epsilonϵ represents the error term.

Example: Predicting a person's weight based on their height.

2. Multiple Regression

Definition: Multiple regression involves two or more independent variables used to predict a dependent variable.
It models the relationship between the dependent variable and multiple predictors simultaneously.

• Mathematical Form: The equation for multiple linear regression can be expressed as:
y=β0+β1x1+β2x2+…+βnxn+ϵy = \beta_0 + \beta_1 x_1 + \beta_2 x_2 + \ldots + \beta_n x_n +
\epsilony=β0+β1x1+β2x2+…+βnxn+ϵ Where:

o yyy is the dependent variable.

o x1,x2,…,xnx_1, x_2, \ldots, x_nx1,x2,…,xn are the independent variables.

o β0\beta_0β0 is the y-intercept.

o β1,β2,…,βn\beta_1, \beta_2, \ldots, \beta_nβ1,β2,…,βn are the coefficients for each predictor.

o ϵ\epsilonϵ represents the error term.

Example: Predicting a person's weight based on height, age, and diet.

Key Differences

1. Number of Predictors:

o Simple Regression: One predictor variable.

o Multiple Regression: Two or more predictor variables.

2. Complexity:

o Simple Regression: Generally simpler and easier to interpret. The relationship is represented
by a single line.

o Multiple Regression: More complex due to the inclusion of multiple predictors. The
relationship is represented in a multidimensional space (e.g., a plane in three dimensions or a
hyperplane in higher dimensions).

3. Interpretation:

o Simple Regression: The slope directly indicates the change in the dependent variable for a one-
unit change in the predictor.

8
CA-552

o Multiple Regression: Each coefficient represents the change in the dependent variable for a
one-unit change in the respective predictor, holding other predictors constant (partial effect).

Influence of Predictor Variables on Model Complexity

1. Dimensionality:

o Increasing the number of predictors adds dimensions to the model. While this can allow for a
more nuanced representation of the data, it can also make the model harder to visualize and
interpret.

2. Interactions and Non-Linearity:

o With more predictors, the potential for interactions (where the effect of one predictor on the
dependent variable depends on another predictor) increases. This can add to the complexity and
may require additional modeling strategies.

3. Overfitting:

o As the number of predictor variables increases, there is a risk of overfitting, where the model
captures noise in the training data rather than the underlying relationship. This can lead to poor
generalization to new data.

4. Multicollinearity:

o Multiple predictors can introduce multicollinearity, where predictors are highly correlated with
each other. This can make it difficult to estimate the coefficients accurately and interpret their
individual effects.

Que. 8- Explain how decision trees are used in classification tasks. How do decision trees
handle both categorical and continuous features?
Ans.- Decision trees are a popular method for classification tasks due to their intuitive structure and ease of
interpretation. Here's how they work and how they handle different types of features:

Structure of Decision Trees

A decision tree consists of nodes that represent features, branches that represent decision rules, and leaves that
represent outcomes (class labels). The tree is built through a process called recursive partitioning, where the data
is split into subsets based on the values of the features.

Building the Tree

1. Selecting the Best Feature: At each node, the algorithm selects the feature that best separates the data
into distinct classes. This is typically done using metrics like:

o Gini impurity: Measures the impurity of a node, where lower values indicate purer nodes.

o Entropy: Measures the amount of disorder or uncertainty in the data.

o Information Gain: The reduction in entropy after a split.

2. Splitting the Data: Based on the chosen feature, the dataset is split into subsets. The process is repeated
recursively for each subset, creating new nodes until a stopping criterion is met (like a maximum tree
depth, a minimum number of samples in a node, or a node purity threshold).

9
CA-552

Categorical Features

For categorical features, decision trees split the data based on the distinct categories. For example, if a feature
represents "color" with values {red, blue, green}, the tree might create branches for each color. The decision at
that node would classify samples according to the chosen category.

Continuous Features

Continuous features, such as age or salary, are handled by selecting a threshold value to create binary splits. For
instance, if the feature is "age," the algorithm might decide to split at age 30, creating two branches: one for ages
less than or equal to 30 and another for ages greater than 30. This allows decision trees to create flexible decision
boundaries.

Advantages and Limitations

Advantages:

• Easy to interpret and visualize.

• Can handle both categorical and continuous data.

• Requires little data preprocessing (no need for normalization).

Limitations:

• Prone to overfitting, especially with deep trees.

• Sensitive to small changes in the data, which can lead to different tree structures.

Que. 9- Explain the main difference between supervised learning and clustering. What
are the goals of clustering, and how is it different from classification?

Ans.- The main difference between supervised learning and clustering lies in the type of data they use and their
goals.

Supervised Learning

Definition: In supervised learning, the model is trained on a labeled dataset, where each input data point is paired
with a corresponding output label.

Goal: The primary goal is to learn a mapping from inputs to outputs, enabling the model to predict labels for new,
unseen data. Common tasks include classification (assigning labels to discrete classes) and regression (predicting
continuous values).

Clustering

Definition: Clustering, on the other hand, is an unsupervised learning technique. It involves grouping a set of data
points into clusters based on similarity, without any labeled outputs.

Goal: The main goal of clustering is to discover inherent structures in the data. It aims to group similar data points
together while keeping different groups distinct. Clustering helps identify patterns, trends, or natural groupings
within the dataset.

Key Differences

1. Data Type:

o Supervised Learning: Requires labeled data.

10
CA-552

o Clustering: Works with unlabeled data.

2. Output:

o Supervised Learning: Produces specific labels or values.

o Clustering: Produces clusters or groupings of data points.

3. Use Cases:

o Supervised Learning: Used for tasks like spam detection, image recognition, and sales
forecasting.

o Clustering: Used for market segmentation, customer grouping, and anomaly detection.

Que. 10- Describe the three types of clustering approaches: Partition-based clustering,
Hierarchical clustering, and Density-based clustering. Provide one real-world application
where each approach is useful.

Ans.- Clustering approaches can be broadly categorized into three main types: partition-based clustering,
hierarchical clustering, and density-based clustering. Each approach has its own methodology and is suited for
different types of data and applications.

1. Partition-based Clustering

Description: Partition-based clustering divides the dataset into distinct clusters, where each data point belongs to
one cluster. The most common method is K-means clustering, which aims to minimize the variance within each
cluster.

• Process:

o Specify the number of clusters (K).

o Randomly initialize K centroids.

o Assign each data point to the nearest centroid.

o Recalculate centroids based on the assigned points.

o Repeat until convergence.

Real-world Application: Customer Segmentation: Businesses often use K-means clustering to segment
customers into groups based on purchasing behavior, helping to tailor marketing strategies for different customer
profiles.

2. Hierarchical Clustering

Description: Hierarchical clustering creates a tree-like structure (dendrogram) to represent data points and their
relationships. It can be agglomerative (bottom-up) or divisive (top-down).

• Process:

o Agglomerative: Start with each data point as its own cluster and iteratively merge the closest
clusters.

o Divisive: Start with all data points in a single cluster and recursively split them.

Real-world Application: Taxonomy Creation: In biology, hierarchical clustering is often used to classify species
based on genetic similarities, helping researchers understand evolutionary relationships.

11
CA-552

3. Density-based Clustering

Description: Density-based clustering identifies clusters based on the density of data points in a region. The most
notable algorithm is DBSCAN (Density-Based Spatial Clustering of Applications with Noise).

• Process:

o Identify regions of high density.

o Define clusters based on a minimum number of points within a specified radius.

o Classify points as core, border, or noise based on their density connectivity.

Real-world Application: Anomaly Detection: In fraud detection for financial transactions, density-based
clustering can help identify unusual patterns or outliers that deviate from typical transaction behavior, signaling
potential fraudulent activities.

MachineLearning Jan2nd
100% (2)
MachineLearning Jan2nd
171 pages
21CSC305P ML - Unit 1-E
No ratings yet
21CSC305P ML - Unit 1-E
137 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
Supervised - ML Complete Book
No ratings yet
Supervised - ML Complete Book
153 pages
CH 01
No ratings yet
CH 01
70 pages
Unit 5 Machine Language
No ratings yet
Unit 5 Machine Language
77 pages
Machine Learning
No ratings yet
Machine Learning
31 pages
Week 01
No ratings yet
Week 01
37 pages
Ai Chapter 5
No ratings yet
Ai Chapter 5
45 pages
Week 2: Machine Learning Intro: Instructor: Ting Sun
No ratings yet
Week 2: Machine Learning Intro: Instructor: Ting Sun
21 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
138 pages
Machine Learning Is A Branch of Artificial Intelligence (AI)
No ratings yet
Machine Learning Is A Branch of Artificial Intelligence (AI)
80 pages
Linear Regression For ML Ass
No ratings yet
Linear Regression For ML Ass
99 pages
Machine Learning Unit 1
100% (7)
Machine Learning Unit 1
112 pages
R22 Machine Learning Digital Notes Final
No ratings yet
R22 Machine Learning Digital Notes Final
143 pages
THEORY FILE - Machine Learning (6th Sem) !!
No ratings yet
THEORY FILE - Machine Learning (6th Sem) !!
26 pages
Machine Learning-Supervised Learning
No ratings yet
Machine Learning-Supervised Learning
31 pages
ML Maths Full Notes
No ratings yet
ML Maths Full Notes
120 pages
ML Unit1
No ratings yet
ML Unit1
31 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
CE880 Lecture5 Slides
No ratings yet
CE880 Lecture5 Slides
32 pages
Machine Learning Most Important Question For Mid Term Ipu University
No ratings yet
Machine Learning Most Important Question For Mid Term Ipu University
36 pages
Ids Ashber
No ratings yet
Ids Ashber
9 pages
Task The Problems That Can Be Solved With Machine Learning
No ratings yet
Task The Problems That Can Be Solved With Machine Learning
9 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
48 pages
Avani Kakkar
No ratings yet
Avani Kakkar
13 pages
Supervised Learning
No ratings yet
Supervised Learning
23 pages
DS-05 Introduction To Machine Learning
No ratings yet
DS-05 Introduction To Machine Learning
103 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Machine Learning (Chapter1)
No ratings yet
Machine Learning (Chapter1)
8 pages
SDL Unit 1
No ratings yet
SDL Unit 1
7 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
Machine Learning IAI
No ratings yet
Machine Learning IAI
94 pages
What Is Human Learning? Give Any Two Examples
No ratings yet
What Is Human Learning? Give Any Two Examples
19 pages
UNIT1@
No ratings yet
UNIT1@
4 pages
Lect3 Machine Learning
No ratings yet
Lect3 Machine Learning
27 pages
Machine Learning Types
No ratings yet
Machine Learning Types
30 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
ML Intro Theory
No ratings yet
ML Intro Theory
10 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
Unit-3 Machine Learning
No ratings yet
Unit-3 Machine Learning
81 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Supervised Learning
No ratings yet
Supervised Learning
14 pages
Aiml Notes
No ratings yet
Aiml Notes
12 pages
Unit 1
No ratings yet
Unit 1
24 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
Practical # 9
No ratings yet
Practical # 9
4 pages
MAchine Learning Notes
No ratings yet
MAchine Learning Notes
6 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
Module 1 ML
No ratings yet
Module 1 ML
8 pages
Machine Learning Notes From AWS
No ratings yet
Machine Learning Notes From AWS
5 pages
AI Lab6
No ratings yet
AI Lab6
7 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages
Unit-2 Advance Concept of Model. Notes
No ratings yet
Unit-2 Advance Concept of Model. Notes
15 pages
Difference Between Statistical Model and Machine Learning
No ratings yet
Difference Between Statistical Model and Machine Learning
4 pages
Indian Standard: Sampling Inspection Procedures
No ratings yet
Indian Standard: Sampling Inspection Procedures
24 pages
Q2 Module 5 - Data Analysis Using Statistics and Hypothesis Testing
No ratings yet
Q2 Module 5 - Data Analysis Using Statistics and Hypothesis Testing
9 pages
Measure of Dispersion and Location
No ratings yet
Measure of Dispersion and Location
51 pages
Inferential Statistics (AutoRecovered)
No ratings yet
Inferential Statistics (AutoRecovered)
12 pages
Analysis For Business Chi-Square Test
No ratings yet
Analysis For Business Chi-Square Test
28 pages
SPSS Sample Power Self Paced Course Syllabus
No ratings yet
SPSS Sample Power Self Paced Course Syllabus
8 pages
Department of Education: Practical Research 2 Second Periodical Test
No ratings yet
Department of Education: Practical Research 2 Second Periodical Test
2 pages
Lecture 4 & 5 - Chapter 5 - Forecasting
No ratings yet
Lecture 4 & 5 - Chapter 5 - Forecasting
50 pages
P&S-Unit II, III PDF
No ratings yet
P&S-Unit II, III PDF
3 pages
Measures of Relative Position Grouped 1
No ratings yet
Measures of Relative Position Grouped 1
20 pages
Statistics: Worksheet-1 Mode, Mean
No ratings yet
Statistics: Worksheet-1 Mode, Mean
8 pages
Linear Regression (BA)
No ratings yet
Linear Regression (BA)
13 pages
Eda 10 Anova
No ratings yet
Eda 10 Anova
34 pages
Chapter 3-Sampling Theory
No ratings yet
Chapter 3-Sampling Theory
21 pages
COST-BEHAVIOR - Mas 23
No ratings yet
COST-BEHAVIOR - Mas 23
12 pages
Marketing Research Chapter 15
No ratings yet
Marketing Research Chapter 15
21 pages
Canonical Correlation Analysis: James H. Steiger
No ratings yet
Canonical Correlation Analysis: James H. Steiger
35 pages
Last Week: 4.2 Cramer-Rao Lower Bound: 2 2 Fisher Bilgisi CRB
No ratings yet
Last Week: 4.2 Cramer-Rao Lower Bound: 2 2 Fisher Bilgisi CRB
9 pages
Stata Introduction and Worksheet
No ratings yet
Stata Introduction and Worksheet
2 pages
Spearman's Rank Correlation
No ratings yet
Spearman's Rank Correlation
22 pages
Question 1A: FOR ETF5910 ONLY (10 Marks)
No ratings yet
Question 1A: FOR ETF5910 ONLY (10 Marks)
4 pages
ML Unit 2
No ratings yet
ML Unit 2
25 pages
Stat - 335 - Lecture - 1 - 1
No ratings yet
Stat - 335 - Lecture - 1 - 1
31 pages
Analisis Data Skripsi Fix
No ratings yet
Analisis Data Skripsi Fix
11 pages
Lesson Two
No ratings yet
Lesson Two
66 pages
Topic 5 - Solutions
No ratings yet
Topic 5 - Solutions
5 pages
Hubungan Antara Pola Asuh Gizi Dan Pengetahuan Ibu Dengan Status Gizi Balita Di Posyandu Cempaka Desa Pejagan Kabupaten Bangkalan
No ratings yet
Hubungan Antara Pola Asuh Gizi Dan Pengetahuan Ibu Dengan Status Gizi Balita Di Posyandu Cempaka Desa Pejagan Kabupaten Bangkalan
24 pages
Post Hoc Tests Familywise Error: Newsom Psy 521/621 Univariate Quantitative Methods, Fall 2020 1
No ratings yet
Post Hoc Tests Familywise Error: Newsom Psy 521/621 Univariate Quantitative Methods, Fall 2020 1
4 pages
Tugas1 - StatMat2 - Adhitya Dwi Nugraha
No ratings yet
Tugas1 - StatMat2 - Adhitya Dwi Nugraha
2 pages
The Population Is The Set of Entities Under Study
No ratings yet
The Population Is The Set of Entities Under Study
2 pages
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet

Group B 2

Uploaded by

Group B 2

Uploaded by

CA-552

Que.1-Explain the difference between traditional programming and machine learning.

• Data: Used to test and verify the correctness of the program.

• Rule-Based vs. Data-

• Problem Types: Traditional programming excels in well-

1. Focus and Purpose:

o Statistical Modeling: Primarily concerned with understanding relationships between variables

4. Training and Validation:

o Example: Customer Segmentation – A retail company uses clustering algorithms on

Example of Non-Linear Regression Suitability

Scenario: Modeling the growth of a population of bacteria over time.

• Non-Linear Regression Model: A logistic growth model can be expressed as:

o P(t)P(t)P(t) is the population at time ttt,

o KKK is the carrying capacity,

o P0P_0P0 is the initial population size,

o rrr is the growth rate.

1. R-squared (Coefficient of Determination)

o SSres=∑(yi−y^i)2\text{SS}_{\text{res}} = \sum (y_i - \hat{y}_i)^2SSres=∑(yi−y^i)2 is the

o SStot=∑(yi−yˉ)2\text{SS}_{\text{tot}} = \sum (y_i - \bar{y})^2SStot=∑(yi−yˉ)2 is the total

2. Mean Squared Error (MSE)

MSE=1n∑(yi−y^i)2\text{MSE} = \frac{1}{n} \sum (y_i - \hat{y}_i)^2MSE=n1∑(yi−y^i)2

o nnn is the number of observations,

o yiy_iyi are the actual observed values,

o y^i\hat{y}_iy^i are the predicted values.

3. Root Mean Squared Error (RMSE)

• R-squared helps to understand the explanatory power of the model.

• MSE provides a measure of the average squared error, sensitive to outliers.

Logistic Regression for Binary Classification

1. Modeling the Relationship:

o The relationship can be expressed as:

The Sigmoid Function

o The sigmoid function is defined as:

o Where zzz is the linear combination of input features (i.e., z=β0+β1x1+β2x2+…+βnxnz =

2. Transformation into Probability:

3. Characteristics of the Sigmoid Function:

Example Use Case

2. For a new customer, we compute the linear combination of features.

o yyy is the dependent variable.

o xxx is the independent variable.

o β0\beta_0β0 is the y-intercept.

o β1\beta_1β1 is the slope of the line.

o ϵ\epsilonϵ represents the error term.

Example: Predicting a person's weight based on their height.

o yyy is the dependent variable.

o x1,x2,…,xnx_1, x_2, \ldots, x_nx1,x2,…,xn are the independent variables.

o β0\beta_0β0 is the y-intercept.

o ϵ\epsilonϵ represents the error term.

Example: Predicting a person's weight based on height, age, and diet.

o Simple Regression: One predictor variable.

o Multiple Regression: Two or more predictor variables.

Influence of Predictor Variables on Model Complexity

2. Interactions and Non-Linearity:

Structure of Decision Trees

Building the Tree

o Entropy: Measures the amount of disorder or uncertainty in the data.

o Information Gain: The reduction in entropy after a split.

Advantages and Limitations

• Easy to interpret and visualize.

• Can handle both categorical and continuous data.

• Requires little data preprocessing (no need for normalization).

• Prone to overfitting, especially with deep trees.

o Supervised Learning: Requires labeled data.

o Clustering: Works with unlabeled data.

o Supervised Learning: Produces specific labels or values.

o Clustering: Produces clusters or groupings of data points.

o Specify the number of clusters (K).

o Randomly initialize K centroids.

o Assign each data point to the nearest centroid.

o Recalculate centroids based on the assigned points.

o Repeat until convergence.

o Identify regions of high density.

o Define clusters based on a minimum number of points within a specified radius.

o Classify points as core, border, or noise based on their density connectivity.

You might also like