0% found this document useful (0 votes)

24 views5 pages

Unit 1 Machine Learning - PDF Lands

Uploaded by

sahugungun76

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views5 pages

Unit 1 Machine Learning - PDF Lands

Uploaded by

sahugungun76

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Machine Learning Machine Learning

Unit-1 Natural Language Processing (NLP):

ML is employed in NLP applications such as language translation, sentiment analysis,

What is Machine Learning? and chatbots to enable computers to understand, interpret, and generate human-like text.

Machine Learning (ML) is a subfield of artificial intelligence (AI) that focuses on the Healthcare:
development of algorithms and models that enable computers to learn from data and
make predictions or decisions without being explicitly programmed. The key idea behind ML is used for medical diagnosis, predicting patient outcomes, and personalized
machine learning is to allow systems to automatically learn and improve from treatment plans. It can analyze large datasets to identify patterns that might be
experience. challenging for humans to discern.

There are three main types of machine learning:

Finance:
Supervised Learning:
ML is utilized for fraud detection, credit scoring, and stock market predictions.
The algorithm is trained on a labeled dataset, where the input data is paired with the Algorithms analyze financial data to make predictions and optimize decision-making
corresponding correct output. The model learns to map inputs to outputs. processes.

Unsupervised Learning: Autonomous Vehicles:

The algorithm is given data without explicit instructions on what to do with it. The ML algorithms play a crucial role in enabling self-driving cars to perceive their
system tries to learn the patterns and the structure from the data without labeled outputs. surroundings, make decisions, and navigate safely.

Reinforcement Learning: Recommendation Systems:

The algorithm learns by interacting with an environment. It receives feedback in the form ML is behind recommendation engines in platforms like Netflix, Amazon, and Spotify.
of rewards or penalties as it navigates through a problem space, allowing it to learn the These systems analyze user behavior to suggest products, movies, or music tailored to
optimal behavior. individual preferences.

Applications of Machine Learning: Classification of Machine Learning:

Machine learning algorithms can be classified based on various criteria. Here are two
Machine learning has found applications in various domains. Here are some notable primary classifications:
examples:
By Learning Style:
Image and Speech Recognition:
Supervised Learning: The algorithm is trained on labeled data.
Machine learning is used in systems that can recognize and understand images and
Unsupervised Learning: The algorithm learns from unlabeled data.
speech. This is evident in applications like facial recognition, voice assistants, and image
Reinforcement Learning: The algorithm learns by interacting with an environment and
classification.
receiving feedback.

By Task:
Machine Learning Machine Learning

Train the Model:

Classification: Predicting the category or class to which a new data point belongs.
Regression: Predicting a continuous value. Use the training set to train your machine learning model. The model learns the patterns
Clustering: Grouping similar data points based on their characteristics. in the data and adjusts its parameters to make accurate predictions.
Dimensionality Reduction: Reducing the number of features in a dataset while
preserving its essential information. Evaluate the Model:

Assess the model's performance using the testing set. Common evaluation metrics vary
Developing a machine learning model based on the type of problem (e.g., accuracy, precision, recall, F1-score for classification;
mean squared error for regression). Choose metrics relevant to your specific problem.

Developing a machine learning model involves several key steps. Here's a step-by-step Hyperparameter Tuning:
procedure that we follow:
Fine-tune the hyperparameters of your model to improve its performance. This may
Define the Problem: involve using techniques like grid search or random search.

Clearly define the problem you want to solve. Understand the goal and the expected Validation and Cross-Validation:
output of the machine learning model.
Implement cross-validation techniques to ensure that the model's performance is
Gather Data: consistent across different subsets of the data. This helps in detecting overfitting or
underfitting.
Collect relevant data for your problem. Ensure that your dataset is representative,
comprehensive, and free from biases. Consider the quality and quantity of data available. Model Interpretation (Optional):

Data Exploration and Preprocessing: Depending on the type of model used, try to interpret the results. Some models, like
decision trees or linear regression, offer insights into feature importance.
Explore the dataset to understand its characteristics, identify missing values, outliers, and
potential features. Handle missing data and outliers appropriately. Deploy the Model (if applicable):
Convert categorical variables into a suitable format if needed (e.g., one-hot encoding).
Split the dataset into training and testing sets. If the model performs satisfactorily, deploy it to a production environment. Ensure that
the deployment process is seamless and monitored.
Feature Engineering:
Monitor and Maintain:
Create new features or transform existing ones to enhance the model's performance.
Feature engineering involves selecting, modifying, or creating features that can improve Regularly monitor the model's performance in a real-world setting. Retrain the model
the model's ability to make accurate predictions. periodically with new data to maintain its accuracy over time.

Select a Model: Document the Process:

Choose a machine learning algorithm that is suitable for your problem. The choice of Keep comprehensive documentation of the entire development process, including data
algorithm depends on the nature of the problem (classification, regression, clustering) and sources, preprocessing steps, model selection, and evaluation metrics. This
the characteristics of your data. documentation is valuable for reproducibility and future reference.
Machine Learning Machine Learning

# Generate example data

Iterate and Improve: np.random.seed(42)
X = 2 * np.random.rand(100, 1)
Machine learning is an iterative process. Use feedback from the model's performance in a y = 4 + 3 * X + np.random.randn(100, 1)
real-world setting to make improvements. Revisit any of the previous steps if necessary.
# Split the data into training and testing sets
By following these steps, you can systematically develop and deploy a machine learning X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
model for various applications. Keep in mind that the specific details of each step may
vary depending on the nature of your problem and the characteristics of your data. # Create and train a linear regression model
model = LinearRegression()
Linear regression model.fit(X_train, y_train)

Linear regression is a statistical method used in machine learning to model the # Make predictions on the test set
relationship between a dependent variable (target) and one or more independent variables y_pred = model.predict(X_test)
(features). The goal is to find the best-fit line that minimizes the difference between the
predicted and actual values of the dependent variable. This line is called the "regression # Plot the training data and the regression line
line" or "best-fit line." plt.scatter(X_train, y_train, color='blue', label='Training Data')
plt.scatter(X_test, y_test, color='red', label='Test Data')
The equation for a simple linear regression with one independent variable can be plt.plot(X_test, y_pred, color='green', linewidth=3, label='Regression Line')
represented as: plt.xlabel('Independent Variable (X)')
plt.ylabel('Dependent Variable (y)')
y=mx+b plt.title('Linear Regression Example')
plt.legend()
Here: plt.show()
y is the dependent variable (target),
In this example:
x is the independent variable (feature),
We generate random data points using a linear equation with some added noise.
m is the slope of the line, Split the data into training and testing sets.
Create a Linear Regression model using scikit-learn.
b is the y-intercept. Train the model on the training data.
Make predictions on the test data and plot the regression line.
In a machine learning context, the values of m and b are learned from the training data to
make accurate predictions on new, unseen data. The green line in the plot represents the learned regression line. The model aims to
minimize the difference between the predicted and actual values, optimizing the
Let's go through a simple example using Python and the popular library, scikit-learn: parameters m and b to best fit the training data. This learned line can then be used to
make predictions on new data.
# Import necessary libraries
import numpy as np Cost function
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression In linear regression, the cost function, also known as the loss function or error function, is
import matplotlib.pyplot as plt a measure of how well the model's predictions match the actual target values. The goal of
Machine Learning Machine Learning

linear regression is to find the best-fitting line (or hyperplane in higher dimensions) that Compute Gradient: At each iteration, you compute the gradient of the cost function with
minimizes this cost function. The most commonly used cost function in linear regression respect to each parameter. The gradient indicates the direction of steepest increase of the
is the Mean Squared Error (MSE) function. cost function. In the case of linear regression with MSE, the gradient with respect to each
parameter (slope and intercept) can be computed analytically using calculus.
Here's how the cost function works in linear regression:
Update Parameters: Once you have the gradient, you update the parameters by taking a
Mean Squared Error (MSE): The MSE is calculated by taking the average of the small step (determined by a parameter called the learning rate) in the opposite direction
squared differences between the predicted values and the actual target values. of the gradient. This step is performed to minimize the cost function. The update rule for
Mathematically, for a dataset with each parameter

MSE=m1∑i=1m(yi−y^i)2 θ:=θ−α⋅∇J(θ)
α is the learning rate, a hyperparameter that determines the size of the step taken during
The goal is to minimize this value, which means finding the parameters (slope and each iteration.
intercept in simple linear regression) that result in the smallest MSE.
Where:
Optimization: To find the parameters that minimize the cost function (MSE),  α is the learning rate, a hyperparameter that determines the size of the
optimization algorithms such as Gradient Descent are commonly used. Gradient Descent
step taken during each iteration.
iteratively adjusts the parameters in the direction that reduces the cost function until
 ∇()∇J(θ) is the gradient of the cost function J with respect to the parameter
convergence is reached, i.e., until further adjustments do not significantly decrease the
cost. θ.

Other Cost Functions: While MSE is the most commonly used cost function in linear Repeat: Steps 2 and 3 are repeated iteratively until convergence is reached. Convergence
regression, other cost functions such as Mean Absolute Error (MAE) or Huber loss can is typically determined when the change in the cost function between iterations is very
also be used depending on the specific requirements of the problem. small, or when a maximum number of iterations is reached.

Overall, the cost function in linear regression serves as a quantitative measure of how Gradient Descent allows the linear regression model to iteratively adjust its parameters in
well the model is performing, and the aim is to minimize this cost to obtain the best- the direction that minimizes the cost function, eventually leading to optimal parameter
fitting line or hyperplane. values that result in the best-fitting line or hyperplane for the given dataset.

Gradient Descent What is logistic regression in machine learning

Gradient Descent is an optimization algorithm commonly used in machine learning to Logistic regression is a statistical method used for binary classification tasks in machine
minimize a cost function. In the context of linear regression, Gradient Descent is used to learning. Despite its name, logistic regression is actually a classification algorithm rather
find the optimal parameters (coefficients) for the linear model that minimize the cost than a regression algorithm. It is used to predict the probability that a given input belongs
function, such as the Mean Squared Error (MSE). to a particular class, typically represented as either 0 or 1.

Here's how Gradient Descent works in the context of linear regression: Here's a brief overview of logistic regression:

Initialization: First, you start by initializing the parameters (coefficients) of the linear
regression model with some random values or zeros. 1. Binary Classification: Logistic regression is used for binary classification tasks
where the target variable has only two possible outcomes (classes), usually
Machine Learning Machine Learning

represented as 0 and 1. For example, predicting whether an email is spam (1) or Gaussian function
not spam (0), or whether a patient has a disease (1) or not (0).
2. Sigmoid Function: Logistic regression uses the logistic function, also known as In machine learning, the Gaussian function often refers to the Gaussian distribution, also
the sigmoid function, to model the probability that a given input belongs to the known as the normal distribution. It's a type of probability distribution that is commonly
positive class. The sigmoid function is an S-shaped curve that maps any real- used in various statistical models and machine learning algorithms due to its
valued number to the range [0, 1]. The logistic function is defined as: mathematical properties and prevalence in natural phenomena.
σ(z)=1+e−z1 The Gaussian function is defined by its probability density function (PDF), which takes
where z is a linear combination of the input features and model parameters. the form:

f(x∣μ,σ2)=2πσ21exp(−2σ2(x−μ)2)
3. Linear Combination: Similar to linear regression, logistic regression models the
relationship between the input features �X and the target variable �y using a
linear combination of the features: Where:

z=β0+β1x1+β2x2+…+βnxn  x is the random variable.

where 0,1,…, β0,β1,…,βn are the coefficients (parameters) to be learned, and 1,
 μ is the mean (average) of the distribution.
2,…, x1,x2,…,xn are the input features.
 σ2 is the variance, which measures the spread of the distribution.

4. Probability Prediction: After computing the linear combination z, logistic The Gaussian function describes a symmetric "bell-shaped" curve centered around the
regression applies the sigmoid function to obtain the predicted probability that the mean μ. The spread of the curve is determined by the variance σ2, where larger values of
input belongs to the positive class: σ2 result in wider curves, and smaller values result in narrower curves.
In machine learning, the Gaussian function is often used in various contexts, including:
1. Probability Density Estimation: Gaussian distributions are frequently used to
p^=σ(z)=1+e−z1
model the underlying probability distributions of data, especially when the data is
continuous and assumes a symmetric, bell-shaped form.
The predicted probability ^p^ can then be thresholded at 0.5 (or any other threshold) to
2. Gaussian Mixture Models (GMMs): GMMs are probabilistic models that represent
make binary predictions.
the distribution of data as a mixture of several Gaussian distributions. They are
commonly used for clustering and density estimation tasks.
3. Kernel Density Estimation (KDE): KDE is a non-parametric method used for
5. Model Training: Logistic regression is typically trained using optimization
estimating the probability density function of a random variable. Gaussian kernels
algorithms such as gradient descent to find the optimal values of the coefficients
are often employed in KDE to smooth the estimated density function.
0,1,…,β0,β1,…,βn that minimize a loss function, such as binary cross-entropy
4. Bayesian Inference: Gaussian distributions are often used as prior distributions in
loss, which measures the difference between the predicted probabilities and the
Bayesian inference due to their mathematical tractability and conjugate properties
actual class labels.
with certain likelihood functions.
Overall, the Gaussian function plays a fundamental role in machine learning, providing a
Overall, logistic regression is a simple yet powerful algorithm for binary classification
mathematical framework for understanding and modeling uncertainty in data, as well as
tasks, particularly when the relationship between the input features and the target variable
forming the basis for many algorithms and statistical techniques.
is linear or can be reasonably approximated as such.

Big-Data Unit-3
100% (1)
Big-Data Unit-3
54 pages
Machine Learning Notes
100% (10)
Machine Learning Notes
19 pages
Lecture2-Supervised-Learning Slides
No ratings yet
Lecture2-Supervised-Learning Slides
56 pages
UNIT 1 All Notes
No ratings yet
UNIT 1 All Notes
24 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
19 pages
Introduction To Machine Learning PPT Main
100% (1)
Introduction To Machine Learning PPT Main
15 pages
William R. Bell, Scott H. Holan, Tucker S. McElroy - Economic Time Series - Modeling and Seasonality-Chapman and Hall - CRC (2012)
100% (1)
William R. Bell, Scott H. Holan, Tucker S. McElroy - Economic Time Series - Modeling and Seasonality-Chapman and Hall - CRC (2012)
544 pages
Project
No ratings yet
Project
12 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
Untitled
No ratings yet
Untitled
11 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
14-004-1 Machine Learning
No ratings yet
14-004-1 Machine Learning
10 pages
Unit Iii Supervised Learning
No ratings yet
Unit Iii Supervised Learning
67 pages
A To Z of Machine Learning by Rashvandh
No ratings yet
A To Z of Machine Learning by Rashvandh
34 pages
ML Notes UT-1
No ratings yet
ML Notes UT-1
21 pages
Machine Learning QB
No ratings yet
Machine Learning QB
15 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
Handbook of Applied Econometrics & Statistical Inference
No ratings yet
Handbook of Applied Econometrics & Statistical Inference
718 pages
Lecture 17&18 - Introduction To Machine Learning
No ratings yet
Lecture 17&18 - Introduction To Machine Learning
51 pages
Estimation and Detection: Lecture 1: Introduction and Minimum Variance Unbiased Estimators
No ratings yet
Estimation and Detection: Lecture 1: Introduction and Minimum Variance Unbiased Estimators
17 pages
ML Unit-1
No ratings yet
ML Unit-1
28 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
17 pages
Chapter 4 - Machine Learning
No ratings yet
Chapter 4 - Machine Learning
81 pages
Data Science Vijay1
No ratings yet
Data Science Vijay1
88 pages
Linear Regression For ML Ass
No ratings yet
Linear Regression For ML Ass
99 pages
BMIS360-AnswerKeys Revision Sheet
No ratings yet
BMIS360-AnswerKeys Revision Sheet
13 pages
Machine Learning
No ratings yet
Machine Learning
12 pages
Machine Learning (Chapter1)
No ratings yet
Machine Learning (Chapter1)
8 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
10 pages
Unit 3
No ratings yet
Unit 3
13 pages
MCA - ML Question Bank Answer
No ratings yet
MCA - ML Question Bank Answer
139 pages
INTRODUCTION
No ratings yet
INTRODUCTION
51 pages
Machine Learning Career Roadmap
No ratings yet
Machine Learning Career Roadmap
17 pages
Literature Survey in Normalised Adaptive Channel Equalizer For MIMO-OFDM
No ratings yet
Literature Survey in Normalised Adaptive Channel Equalizer For MIMO-OFDM
6 pages
Machine Learning Models
No ratings yet
Machine Learning Models
11 pages
RADAR: An In-Building RF-based User Location and Tracking System
No ratings yet
RADAR: An In-Building RF-based User Location and Tracking System
10 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
Agronomy MCQs
100% (1)
Agronomy MCQs
198 pages
BIOSTATISTICS Mcqs
100% (2)
BIOSTATISTICS Mcqs
13 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
9 pages
Group Assignment Final PDF
100% (1)
Group Assignment Final PDF
13 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
Machine Learning For Data Science Unit-4
No ratings yet
Machine Learning For Data Science Unit-4
16 pages
MAchine Learning
No ratings yet
MAchine Learning
10 pages
Prediction of Construction Project Performance Using Regression Analysis and Artificial Neural Network
No ratings yet
Prediction of Construction Project Performance Using Regression Analysis and Artificial Neural Network
8 pages
Chopra4 Tif
No ratings yet
Chopra4 Tif
18 pages
Classification and Clustering: CS109/Stat121/AC209/E-109 Data Science
No ratings yet
Classification and Clustering: CS109/Stat121/AC209/E-109 Data Science
28 pages
Machine Learning
No ratings yet
Machine Learning
51 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
20 pages
Introduction To Machine Learning Notes
No ratings yet
Introduction To Machine Learning Notes
26 pages
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
No ratings yet
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
13 pages
ML Unit-2
No ratings yet
ML Unit-2
17 pages
Multiple Regression
No ratings yet
Multiple Regression
127 pages
Part 2 Introduction To ML
No ratings yet
Part 2 Introduction To ML
13 pages
Barrier Option Pricing: Degree Project in Mathematics, First Level
No ratings yet
Barrier Option Pricing: Degree Project in Mathematics, First Level
38 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
17 pages
Project: Advisor Dr. Sanaa El Touny (Spring 2024) Group 3
No ratings yet
Project: Advisor Dr. Sanaa El Touny (Spring 2024) Group 3
7 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Module - 1
No ratings yet
Module - 1
9 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Machinelearning Unit1
No ratings yet
Machinelearning Unit1
9 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
8 pages
Basic of Machine Learning
No ratings yet
Basic of Machine Learning
7 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
Panion PDF
No ratings yet
Panion PDF
154 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
Chapter 01 Machine Learning
No ratings yet
Chapter 01 Machine Learning
22 pages
Elements On Estimation Theory 04nvp04de13
No ratings yet
Elements On Estimation Theory 04nvp04de13
52 pages
Week 4 Lecture Slides BUS265 2023
No ratings yet
Week 4 Lecture Slides BUS265 2023
45 pages
Analisis Perhitungan Metode Interpolasi
No ratings yet
Analisis Perhitungan Metode Interpolasi
7 pages
Lecture 2 Unit 1
No ratings yet
Lecture 2 Unit 1
60 pages
Regno: 19mis1106 Name: Sam Melvin M Course Code - Swe4012 - Machine Learning Lab Slot: L11+L12 Faculty: Dr. M. Premalatha
No ratings yet
Regno: 19mis1106 Name: Sam Melvin M Course Code - Swe4012 - Machine Learning Lab Slot: L11+L12 Faculty: Dr. M. Premalatha
37 pages
Machine Learning Assignment-1
No ratings yet
Machine Learning Assignment-1
7 pages
Decap776 P 1
No ratings yet
Decap776 P 1
6 pages
ML - Part - A
No ratings yet
ML - Part - A
10 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
URECA Final Research Paper Goh Kai Leong U1840005E
No ratings yet
URECA Final Research Paper Goh Kai Leong U1840005E
10 pages
ML Unit 1'
No ratings yet
ML Unit 1'
13 pages
Determinants of Body Image in Perimenopausal and Postmenopausal Women
No ratings yet
Determinants of Body Image in Perimenopausal and Postmenopausal Women
16 pages
ML 5th
No ratings yet
ML 5th
8 pages
ML Unit1 (HKB)
No ratings yet
ML Unit1 (HKB)
7 pages
PDF 3
No ratings yet
PDF 3
14 pages
Unit 4 Agile
No ratings yet
Unit 4 Agile
7 pages
Feature and Feature Extractionlect2
No ratings yet
Feature and Feature Extractionlect2
28 pages
Air Pollution
No ratings yet
Air Pollution
20 pages
Lectr 16
No ratings yet
Lectr 16
33 pages
Cbsyllabus Bda 1
No ratings yet
Cbsyllabus Bda 1
4 pages
Statistical Method For On-Line Voltage Collapse Proximity Estimation
No ratings yet
Statistical Method For On-Line Voltage Collapse Proximity Estimation
8 pages
Introduction To Ai
No ratings yet
Introduction To Ai
3 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
9 pages
JCSSP 2025 817 826
No ratings yet
JCSSP 2025 817 826
10 pages
Presenttion 33
No ratings yet
Presenttion 33
2 pages
2 Simple Linear Regression
No ratings yet
2 Simple Linear Regression
22 pages
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet

Unit 1 Machine Learning - PDF Lands

Uploaded by

Unit 1 Machine Learning - PDF Lands

Uploaded by

Machine Learning Machine Learning

Unit-1 Natural Language Processing (NLP):

ML is employed in NLP applications such as language translation, sentiment analysis,

There are three main types of machine learning:

Unsupervised Learning: Autonomous Vehicles:

Reinforcement Learning: Recommendation Systems:

Applications of Machine Learning: Classification of Machine Learning:

Train the Model:

Select a Model: Document the Process:

# Generate example data

Gradient Descent What is logistic regression in machine learning

z=β0+β1x1+β2x2+…+βnxn  x is the random variable.

You might also like