0% found this document useful (0 votes)

1 views14 pages

ML 01 (Shubham)

This document serves as a practical file for a Machine Learning course, detailing the fundamentals of machine learning, including its types (supervised, unsupervised, reinforcement), applications, and regression techniques. It explains regression as a supervised learning method for predicting continuous values, outlines various regression types, and provides a Python script for data handling and analysis. The document also highlights the importance of regression in various domains such as finance, healthcare, and marketing.

Uploaded by

pranav1256kam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1 views14 pages

ML 01 (Shubham)

Uploaded by

pranav1256kam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Practical File

Machine Learing
B.E. (IT) 6th Semester

Submitted To: Submitted by:

Dr.Mandeep Kaur ShubhamKumar

UE228093

IT Section-2

Department of Information Technology

University Institute of Engineering and Technology

Panjab University Chandigarh

Practical Number - 01
Introduction to Machine Learing
Machine Learning (ML) is a branch of artificial intelligence (AI) that enables
computers to learn from data and make decisions or predictions without being
explicitly programmed. Instead of following a set of predefined rules, machine
learning models identify patterns and improve their performance over time based on
experience.

Types of Machine Learning

1.Supervised Learning – The model is trained on labeled data, meaning it learns
from input-output pairs. Examples include:

· Classification: Predicting categories (e.g., spam detection in emails).

· Regression: Predicting continuous values (e.g., stock price prediction).
2.Unsupervised Learning – The model is given data without labeled outputs and
must find hidden patterns. Examples include:

· Clustering: Grouping similar data points (e.g., customer segmentation).

· Dimensionality Reduction: Simplifying data while preserving important
features (e.g., principal component analysis).

3.Reinforcement Learning – The model learns by interacting with an environment

and receiving rewards or penalties for actions. It is widely used in robotics and
gaming.

Applications of Machine Learning

· Healthcare: Disease prediction, medical image analysis.

· Finance: Fraud detection, algorithmic trading.
· E-commerce: Personalized recommendations.
· Autonomous Vehicles: Self-driving cars use ML for object recognition.
· Natural Language Processing (NLP): Chatbots, language translation.
What is Regression?
Regression is a type of supervised learning used to predict continuous numerical
values based on input data. It identifies relationships between independent variables
(features) and a dependent variable (target) to make predictions.

In machine learning regression, independent and dependent variables are key

concepts:

· IndependentVariable (Predictor, Feature, Input Variable):

These are the variables that you use to predict the outcome. They are the inputs
to the model.
Example: In predicting house prices, features like square footage, number of
bedrooms, and location are independent variables.
· DependentVariable (Response,Target,OutputVariable):
This is the variable that you are trying to predict. It depends on the
independent variables.
Example: The house price in a real estate prediction model is the dependent
variable.

Example: Simple Linear Regression

If you want to predict salary based on years of experience:

· Independent variable (X) = Years of experience

· Dependent variable (Y) = Salary

Mathematically, the regression equation is:

Y=mX+b

where:

· Y is the dependent variable (salary),

· X is the independent variable (years of experience),
· m is the coefficient (slope),
· b is the intercept.
How to Use Regression?
1. Identify the Problem

Determine if your problem involves predicting a continuous variable (e.g., price,

temperature, salary).

Choose regression when the goal is to find relationships between variables.

2. Collect and Prepare Data

Gather relevant data with independent and dependent variables.

Clean the data by handling missing values, removing outliers, and normalizing if
necessary.

Split data into training and testing sets (e.g., 80% train, 20% test).

3. Choose the Right Regression Model

Linear Regression: When the relationship between variables is linear.

Polynomial Regression: When the relationship is nonlinear.

Multiple Linear Regression: When multiple independent variables influence the

outcome.

Logistic Regression: For classification problems (e.g., spam or not spam).

Ridge/Lasso Regression: For regularization to avoid overfitting.

Decision Tree/Random Forest Regression: For complex, nonlinear relationships.

4. Train the Model

5. Evaluate the Model

6. Make Predictions
Where Do We Use Regression?
Regression is widely used in various domains where predicting a continuous
numerical value is required. Here are some key applications:

1. Finance & Economics 💰

· Stock Price Prediction: Predict future stock prices based on historical data.
· Risk Assessment: Estimate financial risks for loans and investments.
· House Price Prediction: Estimate real estate prices based on location, size,
and amenities.

2. Healthcare & Medicine 🏥

· Disease Progression Prediction: Estimate how a disease (e.g., diabetes)
progresses over time.
· Medical Cost Estimation: Predict hospital bills based on patient data.
· BMI Calculation: Predict BMI based on weight, height, and lifestyle habits.

3.Marketing & Sales 📊

· Sales Forecasting: Predict future sales based on past data and market trends.
· Customer Lifetime Value (CLV): Estimate how much revenue a customer
will generate over time.
· Advertising Effectiveness: Predict revenue impact from different ad
campaigns.

4. Manufacturing & Supply Chain 🏭

· Demand Forecasting: Predict future product demand to optimize inventory.
· Quality Control: Predict product defects based on production factors.
· Energy Consumption Prediction: Estimate power usage based on
operational conditions.

5. Environmental Science & Weather 🌍

· Weather Forecasting: Predict temperature, humidity, and rainfall.
· Pollution Level Prediction: Estimate air quality based on emissions and
climate factors.
· Natural Disaster Prediction: Forecast floods, earthquakes, and hurricanes.
6. Sports Analytics ⚽
· Performance Prediction: Estimate player performance based on training
data.
· Injury Risk Analysis: Predict probability of injuries based on physical
metrics.
· Game Score Prediction: Estimate the outcome of sports matches.

7. Education 🎓
· Student Performance Prediction: Predict grades based on study hours and
attendance.
· Dropout Rate Analysis: Estimate the likelihood of students dropping out.
· Tuition Fee Estimation: Predict costs based on various factors.
Types of Regression
1. Linear Regression 📈
✅ Use Case: Predict continuous values (e.g., house prices, salary).
✅ Equation:

Y=mX+b

✅ Example: Predicting salary based on years of experience.

🔹 Simple Linear Regression → One independent variable.

🔹 Multiple Linear Regression → Multiple independent variables.

2. Polynomial Regression 🔄
✅ Use Case: When data has a non-linear relationship but is still continuous.
✅ Equation:

Y=aX2+bX+c

✅ Example: Predicting population growth or temperature variations.

🔹 Used when the relationship is curved, not straight.

3. Logistic Regression (for Classification) 🚦
✅ Use Case: Binary or multi-class classification problems (e.g., spam detection,
disease prediction).
✅ Equation (Sigmoid Function):

P(Y)=1/1+e^−(b0+b1X)

✅ Example: Predicting whether an email is spam (Yes/No).

🔹 Types:

· Binary Logistic Regression → Two classes (e.g., pass/fail).

· Multinomial Logistic Regression → More than two classes (e.g., predicting
type of weather: sunny, rainy, snowy).
· Ordinal Logistic Regression → Ordered categories (e.g., rating: low,
medium, high).
4. Ridge Regression (L2 Regularization) 🏋️
✅ Use Case: Prevents overfitting by adding a penalty to large coefficients.
✅ Equation (Loss Function with Regularization Term):

∑(Y−Y^)2+λ∑β^2

✅ Example: Used in high-dimensional data (e.g., financial modeling, genetics).

🔹 Helps when multicollinearity (correlation between independent variables) is

present.

5. Lasso Regression (L1 Regularization) ✂️

✅ Use Case: Feature selection by reducing less important variables to zero.
✅ Equation (Loss Function with Regularization Term):

∑(Y−Y^)2+λ∑∣β∣

✅ Example: Selecting the most relevant factors affecting house prices.

🔹 Helps in feature selection by shrinking irrelevant coefficients to zero.

Here's a Python script to perform the requested tasks:

Steps Covered in the Script:

1.Read a CSV file
You can read a CSV file in Python using the pandas library. Here’s how you can
do it:

· pd.read_csv("file.csv") loads the CSV file into a DataFrame.

2.Perform descriptive exploration (head, summary statistics)

· df.head() shows the first 5 rows of the dataset.

• The df.describe() function in Pandas provides summary statistics of a

DataFrame’s numerical columns.
3.Plot feature distributions (histograms, scatter plots)

Histogram Graph
Scatter Plot

4.Check linear relationship between two features

5.Split the dataset into 70% test and 30% train

Yes! train_test_split from sklearn.model_selection is used to split a dataset into

training and testing subsets.

· train_test_split() randomly divides New_Data into:

· 70% test data (test_data)
· 30% training data (train_data)
· test_size=0.3 → 30% of the data is used for testing.
· random_state=42 ensures that the split is reproducible (same split every time
you run it).

Machine Learning
100% (3)
Machine Learning
46 pages
Statistical Prediction and Machine Learning
100% (4)
Statistical Prediction and Machine Learning
314 pages
Machine Learning
No ratings yet
Machine Learning
158 pages
Unit 2 Notes - Final
No ratings yet
Unit 2 Notes - Final
32 pages
Unit 6
No ratings yet
Unit 6
107 pages
Unit 3 DSA
No ratings yet
Unit 3 DSA
69 pages
Module 2 Modified
No ratings yet
Module 2 Modified
67 pages
2494508-Machine Learning Module Notes
No ratings yet
2494508-Machine Learning Module Notes
41 pages
Pma 5
No ratings yet
Pma 5
39 pages
MLT Unit 2 Linear Regression
No ratings yet
MLT Unit 2 Linear Regression
26 pages
ML Combined
No ratings yet
ML Combined
254 pages
Week-14 Lecture 28
No ratings yet
Week-14 Lecture 28
34 pages
ML Report 1
No ratings yet
ML Report 1
23 pages
Session 1 Coding - Supervised Learning Recap and Code
No ratings yet
Session 1 Coding - Supervised Learning Recap and Code
25 pages
Ds Module 4
No ratings yet
Ds Module 4
73 pages
S&ML Unit 5 - Q & A
No ratings yet
S&ML Unit 5 - Q & A
15 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
Smada PDF
No ratings yet
Smada PDF
17 pages
FDS-Unit III-ECE
No ratings yet
FDS-Unit III-ECE
16 pages
Machinelearning Algorithm Basics2 NOTES
No ratings yet
Machinelearning Algorithm Basics2 NOTES
72 pages
Unit 5
No ratings yet
Unit 5
18 pages
ML Final
No ratings yet
ML Final
92 pages
Aychew Chernet
No ratings yet
Aychew Chernet
8 pages
Lab Experiment 4 - AI
No ratings yet
Lab Experiment 4 - AI
7 pages
Comm-05-Random Variables and Processes
No ratings yet
Comm-05-Random Variables and Processes
90 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Lec 2
No ratings yet
Lec 2
6 pages
Mad Cthulhu - English
100% (2)
Mad Cthulhu - English
20 pages
ML 7th Sem AIML ITE Notes Complete LONG (1) - 34-62
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG (1) - 34-62
29 pages
Commonly Used Machine Learning Algorithms
No ratings yet
Commonly Used Machine Learning Algorithms
27 pages
Undergraduate Fundamentals of Machine Learning Author William J. Deuschle
No ratings yet
Undergraduate Fundamentals of Machine Learning Author William J. Deuschle
143 pages
Regression Vs Classification in Machine Learning Explained!
No ratings yet
Regression Vs Classification in Machine Learning Explained!
10 pages
Insolation PDF
No ratings yet
Insolation PDF
472 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
2-Machine Learning Algorithms
No ratings yet
2-Machine Learning Algorithms
16 pages
4 ML
No ratings yet
4 ML
41 pages
Broadly, There Are 3 Types of Machine Learning Algorithms.
No ratings yet
Broadly, There Are 3 Types of Machine Learning Algorithms.
33 pages
Assignment Group C
No ratings yet
Assignment Group C
8 pages
Machine Learning Basic Principles
No ratings yet
Machine Learning Basic Principles
124 pages
ML 2 ND Unit
No ratings yet
ML 2 ND Unit
50 pages
ML Week 4
No ratings yet
ML Week 4
5 pages
Regression Logistic Unit3 Notes
No ratings yet
Regression Logistic Unit3 Notes
6 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
Regression: UNIT - V Regression Model
100% (1)
Regression: UNIT - V Regression Model
21 pages
ML QB
No ratings yet
ML QB
13 pages
Stoichiometry
No ratings yet
Stoichiometry
60 pages
Unit - Iii Data Analysis
No ratings yet
Unit - Iii Data Analysis
39 pages
Lab Mannual of ML
No ratings yet
Lab Mannual of ML
43 pages
SDL Unit 1
No ratings yet
SDL Unit 1
7 pages
Commonly Used Machine Learning Algorithms
No ratings yet
Commonly Used Machine Learning Algorithms
38 pages
AI Lab7
No ratings yet
AI Lab7
13 pages
Commonly Used Machine Learning Algorithms (With Python and R Codes)
No ratings yet
Commonly Used Machine Learning Algorithms (With Python and R Codes)
19 pages
Machine Learning (Chapter1)
No ratings yet
Machine Learning (Chapter1)
8 pages
AGE 200 Qualitative and Quantitative Techniques in Geography
No ratings yet
AGE 200 Qualitative and Quantitative Techniques in Geography
54 pages
LP III Lab Manual
100% (1)
LP III Lab Manual
8 pages
Wa0023.
No ratings yet
Wa0023.
22 pages
H2 Maths Detailed Summary
No ratings yet
H2 Maths Detailed Summary
82 pages
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
No ratings yet
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
13 pages
Week 9 - PROG 8510 Week 9
No ratings yet
Week 9 - PROG 8510 Week 9
27 pages
Module 2
No ratings yet
Module 2
5 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Turbulence and Spectra For Wind Field Simulation: Jakob Mann
No ratings yet
Turbulence and Spectra For Wind Field Simulation: Jakob Mann
110 pages
Unit1 6thsemCS
No ratings yet
Unit1 6thsemCS
22 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
2 18 Covariance
No ratings yet
2 18 Covariance
34 pages
Jurusan Fisika Fakultas MIPA Universitas Negeri Padang: Mairizwan@unp - Ac.id
No ratings yet
Jurusan Fisika Fakultas MIPA Universitas Negeri Padang: Mairizwan@unp - Ac.id
38 pages
Forecast - Notes
100% (1)
Forecast - Notes
24 pages
Understanding Fixed Limit Gages
No ratings yet
Understanding Fixed Limit Gages
17 pages
ACRM Non Fragility Test
No ratings yet
ACRM Non Fragility Test
15 pages
Probability
No ratings yet
Probability
3 pages
What Is Statistics
No ratings yet
What Is Statistics
6 pages
Bachelor of Business Administration (BBA) : Q.T. in Business
No ratings yet
Bachelor of Business Administration (BBA) : Q.T. in Business
4 pages
ML Notes
No ratings yet
ML Notes
47 pages
Improvement of The Van Der Waals Equation of State
No ratings yet
Improvement of The Van Der Waals Equation of State
13 pages
Introduction To Biostatistics: Data Collection Descriptive Statistics
No ratings yet
Introduction To Biostatistics: Data Collection Descriptive Statistics
33 pages
Vacuum Requirements For Cryogenic Vessels: A Journal of From Practical and Useful Vacuum Technology
No ratings yet
Vacuum Requirements For Cryogenic Vessels: A Journal of From Practical and Useful Vacuum Technology
2 pages
Dew Point
No ratings yet
Dew Point
2 pages
Extreme Climate Phenomena and Bond Returns
No ratings yet
Extreme Climate Phenomena and Bond Returns
29 pages
Nanyang Girls' High School Second Block Test 2013 Secondary Four Integrated Mathematics 1
No ratings yet
Nanyang Girls' High School Second Block Test 2013 Secondary Four Integrated Mathematics 1
10 pages
Sumit Kumar
No ratings yet
Sumit Kumar
58 pages
Sas Arma Forecast
No ratings yet
Sas Arma Forecast
11 pages
Pranavsql
No ratings yet
Pranavsql
26 pages
Ibandronate
No ratings yet
Ibandronate
6 pages
Simple Regression B
No ratings yet
Simple Regression B
7 pages
247978
No ratings yet
247978
16 pages
Interstellar Operations Beta - Strategic BattleForce
No ratings yet
Interstellar Operations Beta - Strategic BattleForce
40 pages
PDFF
No ratings yet
PDFF
15 pages
Germination Traits Explain The Success of Direct Seeding Restoration in The Seasonal Tropics of Brazil Vieira Laumann Maxmiller
No ratings yet
Germination Traits Explain The Success of Direct Seeding Restoration in The Seasonal Tropics of Brazil Vieira Laumann Maxmiller
8 pages
ML File 17 March
No ratings yet
ML File 17 March
18 pages
A.C. Joshi Library Panjab University, Chandigarh
No ratings yet
A.C. Joshi Library Panjab University, Chandigarh
1 page
ML 2 Marks Quick Revision
No ratings yet
ML 2 Marks Quick Revision
3 pages
Check Balance and Imbalance Using Stack
No ratings yet
Check Balance and Imbalance Using Stack
2 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet

ML 01 (Shubham)

Uploaded by

ML 01 (Shubham)

Uploaded by

Practical File

Submitted To: Submitted by:

Dr.Mandeep Kaur ShubhamKumar

Department of Information Technology

Panjab University Chandigarh

Types of Machine Learning

· Classification: Predicting categories (e.g., spam detection in emails).

· Clustering: Grouping similar data points (e.g., customer segmentation).

3.Reinforcement Learning – The model learns by interacting with an environment

Applications of Machine Learning

· Healthcare: Disease prediction, medical image analysis.

In machine learning regression, independent and dependent variables are key

· IndependentVariable (Predictor, Feature, Input Variable):

Example: Simple Linear Regression

· Independent variable (X) = Years of experience

Mathematically, the regression equation is:

· Y is the dependent variable (salary),

Determine if your problem involves predicting a continuous variable (e.g., price,

Choose regression when the goal is to find relationships between variables.

2. Collect and Prepare Data

Gather relevant data with independent and dependent variables.

3. Choose the Right Regression Model

Linear Regression: When the relationship between variables is linear.

Polynomial Regression: When the relationship is nonlinear.

Multiple Linear Regression: When multiple independent variables influence the

Logistic Regression: For classification problems (e.g., spam or not spam).

Ridge/Lasso Regression: For regularization to avoid overfitting.

Decision Tree/Random Forest Regression: For complex, nonlinear relationships.

4. Train the Model

5. Evaluate the Model

1. Finance & Economics 💰

2. Healthcare & Medicine 🏥

3.Marketing & Sales 📊

4. Manufacturing & Supply Chain 🏭

5. Environmental Science & Weather 🌍

✅ Example: Predicting salary based on years of experience.

🔹 Simple Linear Regression → One independent variable.

✅ Example: Predicting population growth or temperature variations.

🔹 Used when the relationship is curved, not straight.

✅ Example: Predicting whether an email is spam (Yes/No).

· Binary Logistic Regression → Two classes (e.g., pass/fail).

✅ Example: Used in high-dimensional data (e.g., financial modeling, genetics).

🔹 Helps when multicollinearity (correlation between independent variables) is

5. Lasso Regression (L1 Regularization) ✂️

✅ Example: Selecting the most relevant factors affecting house prices.

🔹 Helps in feature selection by shrinking irrelevant coefficients to zero.

Steps Covered in the Script:

· pd.read_csv("file.csv") loads the CSV file into a DataFrame.

· df.head() shows the first 5 rows of the dataset.

• The df.describe() function in Pandas provides summary statistics of a

4.Check linear relationship between two features

Yes! train_test_split from sklearn.model_selection is used to split a dataset into

· train_test_split() randomly divides New_Data into:

You might also like