0% found this document useful (0 votes)

74 views

AI Lab File - C

Here are the steps to perform simple linear regression to predict CO2 emission using the given dataset: 1. Import necessary libraries 2. Load the dataset 3. Select the target variable (CO2) and predictor variable (weight) 4. Create a linear regression model 5. Fit the model on the training data 6. Print the coefficients of the fitted model 7. Make predictions on test data 8. Calculate R-squared to evaluate the model 9. Plot the regression line overlaid on scattered data 10. Print statistics like mean absolute error and root mean squared error This will help build a basic linear regression model to predict CO2 emission based on vehicle weight. The coefficients, prediction performance

Uploaded by

Debabrata Pain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views

AI Lab File - C

Uploaded by

Debabrata Pain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 52

INDEX

Exp Aim of Date of Date of Page Remarks

No. Experiment Allotment Evaluation No.

1 Exploring Panda’s DataFrame 03-01-2023 10-01-2023 2

2 Data Cleansing 10-01-2023 17-01-2023 9

3 Statistical Summary and basic plotting 17-01-2023 24-01-2023 15

4 Predict CO2 Emission using Simple Linear 24-01-2023 31-01-2023 21

Regression

5 Predict CO2 Emission using Gradient 31-01-2023 07-02-2023 26

Descent

6 Predict CO2 Emission using Multilinear 07-02-2023 14-02-2023 31

Regression.

7 Perform logistic regression on the 14-02-2023 21-02-2023 35

“ChurnData.csv”.

8 Linear Discriminant Analysis on the IRIS 21-02-2023 28-02-2023 38

dataset

9 Using Support Vector Machine build a 28-02-2023 07-03-2023 41

breast cancer detection system.

10 Using decision tree, build an ML model on 07-03-2023 14-03-2023 43

the “diabetes” dataset.

11 Classify the “Iris Dataset” using KNN 14-03-2023 21-03-2023 46

12 Cluster the dataset using K-Means 21-03-2023 28-03-2023 48

Clustering

1
Shivendra Singh
A2305220681
Practical-1 Date: 03-01-2023
Aim: Exploring Panda’s DataFrame

You are given a CSV file named `student.csv', whose first few records are as below
Write the python code / command for following questions

(a) Load this file into a python Data Frame

Output:

(b) Find the number of rows and columns in it.

Output:

2
Shivendra Singh
A2305220681
(c) Print column names
Output:

(d) Change column name ‘Name’ with new name

‘FirstName’
Output:

(e) Print last 5 rows from the bottom

Output:

(f) Print the details of student with lowest marks

Output:

(g) Find total marks of all female students

Output:

3
Shivendra Singh
A2305220681
(h) List names of all the male students
Output:

(i) Find mean age of the class

Output:

(j) Line plot marks of the class

Output:

(k) Find the index of record of oldest student in the

class
Output:

4
Shivendra Singh
A2305220681
(l) sort and print the data on the basis of Name followed by Age.
Output:

(m) Change the name 'Nihar' to 'Jason Bourne' in

name column of the DataFrame.
Output:

(n) Change and print order of the columns (Name,

Sex, Age, Marks, Grade).
Output:

5
Shivendra Singh
A2305220681
(o) Count and print number of students sex wise and display result with suitable column
headers.
Output:

(p) Delete and print row where age=46

Output:

(q) Print the data types of individual columns of the data frame
Output:

(r) Convert and print the datatype of a given column Age (int to float).
Output:

6
Shivendra Singh
A2305220681
(s) Create a new column named “UpdatedMarks” which as 5.5% more marks than the existing
Marks column.
Output:

(t) Delete the “Marks Column”

Output:

Conclusion:
Hence, the experiment to understand basics of machine learning were studied using python.

7
Shivendra Singh
A2305220681
Evalutaion:

8
Shivendra Singh
A2305220681
Practical-2 Date: 10-01-2023

AIM: Data Cleansing

The given csv file named “RSData.csv” contains real state data for a particular city.
Write python command/ code to answer the following questions on this dataset.

propertyid stno stname owneroccupied numbedrooms numbathrooms areasqft

100001000 10 Kranti Y 3 1 1000
Road
Bhagat
Singh
100002000 17 Road N 3 1.5 --
Bhagat
Singh
100003000 Road N n/a 1 850
100004000 20 Azad 12 1 NaN 700
Mard
23 Azad Y 3 2 1600
Mard
100006000 20 Azad Y NA 1 800
Mard
100007000 NA Shivaji 2 Surya 950
Road
100008000 13 Netaji Y 1 1
Marg
100009000 15 Netaji Y na 2 1800
Marg

9
Shivendra Singh
A2305220681
1. Write the command to read this data in the data frame.
Output:

2. List number of rows and columns in this dataset

Output:

3. Check if there is any missing value in this entire dataset?

Output:

10
Shivendra Singh
A2305220681
4. Which column(s) does not have any missing value?
Output:

5. Which columns have maximum number of missing values?

Output:

6. There are how many rows, which does not have any missing value(s)?
Output:

7. If there is any missing value in the “areasqft” replace it with 900

Output:

11
Shivendra Singh
A2305220681
8. Fill the street number of record at index 2 with 77
Output:

9. If there is any missing value in the “number of bedrooms” columns, then replace it with the
median value of this column.
Output:

10. Give your comment on the “owner occupied” column.

Output:

12
Shivendra Singh
A2305220681
11. It is believed that the data entry operator might have entered integer values in “owner
occupied” column. Count number of such entries and replace them with numpy standard nan
(np.nan).
Output:

Conclusion:
Hence, the concept of data cleaning is studied and implemented successfully.

13
Shivendra Singh
A2305220681
Evalutaion:

14
Shivendra Singh
A2305220681
Practical-3 Date: 17-01-2023

AIM: Statistical Summary and basic plotting

Load the “mtcar.csv” dataset and write python code to perform the following operations
model mpg cyl disp hp drat wt qsec vs am gear carb
Mazda RX4 21 6 160 110 3.9 2.62 16.46 0 1 4 4
Mazda RX4 Wag 21 6 160 110 3.9 2.87 17.02 0 1 4 4
5
Datsun 710 22. 4 108 93 3.85 2.32 18.61 1 1 4 1
8
Hornet 4 Drive 21. 6 258 110 3.08 3.21 19.44 1 0 3 1
4 5
Hornet 18. 8 360 175 3.15 3.44 17.02 0 0 3 2
Sportabout 7
Valiant 18. 6 225 105 2.76 3.46 20.22 1 0 3 1
1
Duster 360 14. 8 360 245 3.21 3.57 15.84 0 0 3 4
3
Merc 240D 24. 4 146.7 62 3.69 3.19 20 1 0 4 2
4
Merc 230 22. 4 140.8 95 3.92 3.15 22.9 1 0 4 2
8

(a) Generate various summary statistics such as mean, standard deviation, minimum value,
maximum value, and “1,2 & 3rd quantiles” for all the numerical attributes.
Output:

15
Shivendra Singh
A2305220681
16
Shivendra Singh
A2305220681
(b) Count the number of non-NaN items per feature
Output:

(c) Calculate mean absolute deviation.

Output:

(d) Calculate median for any of the numeric type column

Output:

(e) Calculate mean any of the numeric type column

Output:

17
Shivendra Singh
A2305220681
(f) Calculate mode for “hp” column
Output:

(g) Calculate skewness for “disp”

Output:

(h) Find the coefficient of correlation between all the numeric attributes.
Output:

(i) Scatter plot a graph between “hp” vs “mpg”

Output:

(j) Plot a density diagram (kde- kernel desity estimation) for “displacement”
18
Shivendra Singh
A2305220681
Output:

(k) Plot a bar diagram for “CarName” vs “mpg”

Output:

19
Shivendra Singh
A2305220681
(l) Box plot the “hp” attribute.
Output:

Conclusion:
Hence, the concept of plotting of different types of graphs is studied and implemented
successfully.

20
Shivendra Singh
A2305220681
Evalutaion:

21
Shivendra Singh
A2305220681
Practical-4 Date: 24-01-2023

AIM: Predict CO2 Emission using Simple Linear Regression

You are given with following data

(a) Scatter plot “EngineSize vs CO2Emissions”

22
Shivendra Singh
A2305220681
(b) Predict the value of CO2 emission on the basis of Engine size for enginesize=2.4. using
data given in “CO2Small.csv” file.

(c) Calculate R^2.

(d) Calculate the above details on the larger dataset i.e., file “CO2Full Data.csv”

23
Shivendra Singh
A2305220681
(e) Print the final equation of the line

(f) Plot final line (best fit line) along with other data points.

24
Shivendra Singh
A2305220681
(g) Give your comments by comparing the R2 of both the datasets.

Conclusion:
Hence, the prediction of CO2 Emission using Simple Linear Regression was implemented
successfully.

25
Shivendra Singh
A2305220681
Evalutaion:

26
Shivendra Singh
A2305220681
Practical-5 Date: 31-01-2023

AIM: Predict CO2 Emission using Gradient Descent

27
Shivendra Singh
A2305220681
(a) Scatter plot “EngineSize vs CO2Emissions”

28
Shivendra Singh
A2305220681
(b) Calculate R^2 value

29
Shivendra Singh
A2305220681
(c) Print the final equation of the line

(d) Plot final line (best fit line) along with other data points.

Conclusion:
Hence, the prediction of CO2 Emission using Gradient Descent was implemented
successfully.

30
Shivendra Singh
A2305220681
Evalutaion:

31
Shivendra Singh
A2305220681
Practical-6 Date: 07-02-2023
AIM: Predict CO2 Emission using Multilinear Regression.

a. Using the above data build a multi-variable linear regression.

32
Shivendra Singh
A2305220681
b. What is the accuracy level of your model?

 The accuracy of the model is measured by the R^2 score

c. Which attributes have you used in this model?

 ENGINESIZE
 CYLINDERS
 FUELCONSUMPTION_CITY
33
Shivendra Singh
A2305220681
 FUELCONSUMPTION_HWY
 FUELCONSUMPTION_COMB.

d. Write your observation /comment about this data and the model.

 The ENGINESIZE and CYLINDERS attributes are positively correlated with CO2
emissions, while the fuel consumption attributes are negatively correlated.
 The linear regression model assumes a linear relationship between the input attributes
and the output variable, which may not be entirely accurate in this case.
 There may be other factors that influence CO2 emissions that are not captured by the
input attributes in this dataset.
 Overall, the model seems to provide a reasonably accurate prediction of CO2
emissions based on the available data

Conclusion:
Hence, the prediction of CO2 Emission using Multilinear Regression was implemented
successfully.

34
Shivendra Singh
A2305220681
Evalutaion:

35
Shivendra Singh
A2305220681
Practical-7 Date: 14-02-2023

AIM: Perform logistic regression on the “ChurnData.csv”.

Consider only following features from the file.

[['tenure', 'age', 'address', 'income', 'ed', 'employ', 'equip', 'callcard', 'wireless','churn']]

a. Print confusion matrix

36
Shivendra Singh
A2305220681
b. Print Classification matrix

Conclusion:
Hence, the Logistics Regression was implemented successfully.

37
Shivendra Singh
A2305220681
Evalutaion:

38
Shivendra Singh
A2305220681
Practical-8 Date: 21-03-2023

AIM: Linear Discriminant Analysis on the IRIS dataset

39
Shivendra Singh
A2305220681
Conclusion:
Hence, the Linear Discriminant Analysis on the IRIS dataset was done successfully.

40
Shivendra Singh
A2305220681
Evalutaion:

41
Shivendra Singh
A2305220681
Practical-9 Date: 28-03-2023

AIM: Using Support Vector Machine build a breast cancer detection

system. Use the breast cancer dataset available with the datasets package of
sklearn library.

a. Print the accuracy, precision and recall for the model built.

Conclusion:
Hence, Using Support Vector Machine a breast cancer detection system was built
successfully.

42
Shivendra Singh
A2305220681
Evalutaion:

43
Shivendra Singh
A2305220681
Practical-10 Date: 07-03-2023

AIM: Using decision tree, build an ML model on the “diabetes” dataset.

44
Shivendra Singh
A2305220681
Conclusion:
Hence, using decision tree a ML model on the “diabetes” dataset was built successfully.

45
Shivendra Singh
A2305220681
Evalutaion:

46
Shivendra Singh
A2305220681
Practical-11 Date: 14-03-2023

AIM: Classify the “Iris Dataset” using KNN. Print Precision, Recall, F1 and

Support.

Conclusion:
Hence, classification of the “Iris Dataset” using KNN was implemented successfully.

47
Shivendra Singh
A2305220681
Evalutaion:

48
Shivendra Singh
A2305220681
Practical-12 Date: 21-03-2023

AIM: You are given with “Mall-Customer”. Cluster the dataset using K-

Means Clustering. Show the steps for estimation of optimum value of k.

First few records of the dataset are given below.

49
Shivendra Singh
A2305220681
50
Shivendra Singh
A2305220681
The above code performs the following steps:
1.Loads the Mall-Customer dataset
2.Selects the columns to use for clustering
3.Scales the features using StandardScaler
4.Estimates the optimum value of k using the elbow method and the silhouette score
5.Clusters the dataset using the optimum value of k
6.Visualizes the clusters

The optimum value of k in k-means clustering can be estimated using the following steps:
Elbow method: Plot the within-cluster sum of squares (WCSS) against the number of clusters (k). The
WCSS is the sum of squared distances between each point in a cluster and its centroid. The plot will
have a shape like an elbow, and the optimum value of k will be at the "elbow" or the point where
the rate of decrease in WCSS slows down significantly. This method gives a visual representation of
the best k value for clustering.

Silhouette method: The silhouette score measures how similar a data point is to its own cluster
compared to other clusters. It ranges from -1 to 1, where a score closer to 1 indicates that the point
is well-matched to its own cluster and poorly matched to neighbouring clusters. Compute the
average silhouette score for different values of k and choose the k with the highest average score.
This method is more quantitative than the elbow method and can be used to find a more precise
value of k.
Conclusion:
Hence, the cluster the dataset using K-Means Clustering was done successfully.
51
Shivendra Singh
A2305220681
Evalutaion:

52
Shivendra Singh
A2305220681

Linear Regression Assignment
0% (2)
Linear Regression Assignment
8 pages
Cognitive Class - Answers Data Analysis With Python
No ratings yet
Cognitive Class - Answers Data Analysis With Python
6 pages
Applied Univariate, Bivariate, and Multivariate Statistics Using Python
100% (3)
Applied Univariate, Bivariate, and Multivariate Statistics Using Python
300 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
10 pages
CSE1703 - Fundamental of Data Science
No ratings yet
CSE1703 - Fundamental of Data Science
6 pages
ML0101EN Reg Simple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Simple Linear Regression Co2 Py v1
4 pages
List of Experiment - Data Analysis Lab
No ratings yet
List of Experiment - Data Analysis Lab
2 pages
Data Analysis
No ratings yet
Data Analysis
8 pages
DBDAL LAB - MANUAL - Final
No ratings yet
DBDAL LAB - MANUAL - Final
93 pages
Python practice questions (1)
No ratings yet
Python practice questions (1)
5 pages
PRACTICAL QUESTIONS For DSBDA
No ratings yet
PRACTICAL QUESTIONS For DSBDA
9 pages
AIDS - DM Using Python - Lab Programs
No ratings yet
AIDS - DM Using Python - Lab Programs
19 pages
Big Data Analysis
No ratings yet
Big Data Analysis
38 pages
Analysis and Prediction of House Prices by Linear Regression Model
No ratings yet
Analysis and Prediction of House Prices by Linear Regression Model
91 pages
data-analytics-manual lab g.anill kumar
No ratings yet
data-analytics-manual lab g.anill kumar
23 pages
co2 emission project
No ratings yet
co2 emission project
6 pages
Python-2 Practice Book 2024
No ratings yet
Python-2 Practice Book 2024
48 pages
Ml Lab Manual 2024
No ratings yet
Ml Lab Manual 2024
41 pages
AI Lab 05 Lab Tasks Maaz
No ratings yet
AI Lab 05 Lab Tasks Maaz
23 pages
CS 611 Slides 4
No ratings yet
CS 611 Slides 4
25 pages
Practise Questions
No ratings yet
Practise Questions
26 pages
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
5 pages
SL-III Lab Manual
No ratings yet
SL-III Lab Manual
74 pages
CO2 Emission Project Source Code
No ratings yet
CO2 Emission Project Source Code
2 pages
Python For Data Sceince l1 Hands On
No ratings yet
Python For Data Sceince l1 Hands On
5 pages
Engo 645
No ratings yet
Engo 645
9 pages
Assignment 1 (Fall 2024)
No ratings yet
Assignment 1 (Fall 2024)
4 pages
Date Preparation and Exploration:: Titanic Data - CSV
No ratings yet
Date Preparation and Exploration:: Titanic Data - CSV
5 pages
DS-DS Lab-1
No ratings yet
DS-DS Lab-1
4 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Excel Regression
No ratings yet
Excel Regression
41 pages
Exp 2 Data Preprocessing_ Cleaning the Dataset Obtained from the UCI ML Repository
No ratings yet
Exp 2 Data Preprocessing_ Cleaning the Dataset Obtained from the UCI ML Repository
9 pages
DATA_SCIENCE_MANAUL (TE) (1)
No ratings yet
DATA_SCIENCE_MANAUL (TE) (1)
78 pages
INDUSTRY 2 Akshat
No ratings yet
INDUSTRY 2 Akshat
12 pages
Data Science
No ratings yet
Data Science
18 pages
Datascience Lab 1-2
No ratings yet
Datascience Lab 1-2
3 pages
04 DS 2023
No ratings yet
04 DS 2023
63 pages
Dav Pracs
No ratings yet
Dav Pracs
9 pages
Monika Sree 11-07-2024
No ratings yet
Monika Sree 11-07-2024
36 pages
Dwdm-Lab Manual
No ratings yet
Dwdm-Lab Manual
39 pages
Train
No ratings yet
Train
17 pages
INDUSTRY 2 Jaimin
No ratings yet
INDUSTRY 2 Jaimin
14 pages
Assignment 2 - LP1
No ratings yet
Assignment 2 - LP1
7 pages
fds qb
No ratings yet
fds qb
6 pages
PW2 DataCleaning
No ratings yet
PW2 DataCleaning
6 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
14 pages
manishadav
No ratings yet
manishadav
27 pages
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
No ratings yet
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
5 pages
Int AI TW-PW 03
No ratings yet
Int AI TW-PW 03
4 pages
index
No ratings yet
index
4 pages
Linear Regression
100% (1)
Linear Regression
16 pages
Cs3361 Set3 Fds Anna University
No ratings yet
Cs3361 Set3 Fds Anna University
3 pages
DataAnalytics Lab Manual (1)
No ratings yet
DataAnalytics Lab Manual (1)
35 pages
Bussiness Report PM
No ratings yet
Bussiness Report PM
44 pages
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
100% (1)
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
256 pages
Question Bank Class XII IP 065 Long Question Answer
No ratings yet
Question Bank Class XII IP 065 Long Question Answer
35 pages
DA PROGRAM UPTO 6 (1)
No ratings yet
DA PROGRAM UPTO 6 (1)
20 pages
ml file syllabus
No ratings yet
ml file syllabus
43 pages
Aids - 21ad62 - Datascience Lab Manual-1
No ratings yet
Aids - 21ad62 - Datascience Lab Manual-1
15 pages
CS3362 Data Science Laboratory Manual 2022-23
No ratings yet
CS3362 Data Science Laboratory Manual 2022-23
54 pages
Keras to Kubernetes: The Journey of a Machine Learning Model to Production
From Everand
Keras to Kubernetes: The Journey of a Machine Learning Model to Production
Dattaraj Rao
No ratings yet
WindowsServer2016 Notes Part1 ADDS (DC)
No ratings yet
WindowsServer2016 Notes Part1 ADDS (DC)
8 pages
Software Lab File Ab
No ratings yet
Software Lab File Ab
15 pages
Debabrata Pain (A2305220415) - Exp 7 Simulation Lab
No ratings yet
Debabrata Pain (A2305220415) - Exp 7 Simulation Lab
9 pages
Debabrata Pain (A2305220415) - Exp 8 Simulation Lab
No ratings yet
Debabrata Pain (A2305220415) - Exp 8 Simulation Lab
7 pages
Debabrata Pain (A2305220415) - Exp 4 Simulation Lab
No ratings yet
Debabrata Pain (A2305220415) - Exp 4 Simulation Lab
9 pages
BEE Practical Reciprocity Theorem
No ratings yet
BEE Practical Reciprocity Theorem
4 pages
BEE Practical Reciprocity Theorem
No ratings yet
BEE Practical Reciprocity Theorem
4 pages
Lecture 7 - CH 3 Forecasting - 1spp
No ratings yet
Lecture 7 - CH 3 Forecasting - 1spp
58 pages
Carter Bruene 2018 Examining The Relationship Between Self Perceptions of Person Role and Social Identity Change and
No ratings yet
Carter Bruene 2018 Examining The Relationship Between Self Perceptions of Person Role and Social Identity Change and
27 pages
Collection of Data Organize The Data - Tally Presentation of Data - Graphs and Table Analysis of The Data Interpretation of The Data
100% (2)
Collection of Data Organize The Data - Tally Presentation of Data - Graphs and Table Analysis of The Data Interpretation of The Data
26 pages
Dar Lab Imp Questions[1]
No ratings yet
Dar Lab Imp Questions[1]
11 pages
CH 06
No ratings yet
CH 06
20 pages
(Ebook PDF) The Fundamentals of Political Science Research 2nd Edition All Chapters Instant Download
100% (4)
(Ebook PDF) The Fundamentals of Political Science Research 2nd Edition All Chapters Instant Download
28 pages
dividend policy 2nd chapter
No ratings yet
dividend policy 2nd chapter
41 pages
Isolation of Citral From Lemongrass Oil Using Steam Distillation Statistical Optimization by Response Surface Methodolog
No ratings yet
Isolation of Citral From Lemongrass Oil Using Steam Distillation Statistical Optimization by Response Surface Methodolog
10 pages
Impacts of Liquidity Ratios On Profitability
No ratings yet
Impacts of Liquidity Ratios On Profitability
4 pages
Minor Project Report
No ratings yet
Minor Project Report
50 pages
Kaplanlearn - Key Concepts 19
No ratings yet
Kaplanlearn - Key Concepts 19
2 pages
Berzar Color Print Assignment
No ratings yet
Berzar Color Print Assignment
57 pages
Assignment 1
No ratings yet
Assignment 1
9 pages
Statistical Analysis of Spatial and Spatio Temporal Point Patterns Third Edition Peter J. Diggle 2025 Scribd Download
100% (1)
Statistical Analysis of Spatial and Spatio Temporal Point Patterns Third Edition Peter J. Diggle 2025 Scribd Download
67 pages
1822 B.E Ece Batchno 120
No ratings yet
1822 B.E Ece Batchno 120
29 pages
10 1108 - JFMPC 11 2019 0084
No ratings yet
10 1108 - JFMPC 11 2019 0084
23 pages
CHAPTER 2 AcFn
No ratings yet
CHAPTER 2 AcFn
83 pages
Regresi-Berganda
100% (1)
Regresi-Berganda
31 pages
(Ebook) Multilevel Analysis: Techniques and Applications by Joop J. Hox, Mirjam Moerbeek, Rens Van De Schoot ISBN 9781138121362, 1138121363 pdf download
100% (1)
(Ebook) Multilevel Analysis: Techniques and Applications by Joop J. Hox, Mirjam Moerbeek, Rens Van De Schoot ISBN 9781138121362, 1138121363 pdf download
46 pages
Mastering Predictive Analytics with R 2nd edition Edition Forte download
100% (2)
Mastering Predictive Analytics with R 2nd edition Edition Forte download
84 pages
C.V. Raman Global University: Bhubaneswar - 752 054 (Odisha)
No ratings yet
C.V. Raman Global University: Bhubaneswar - 752 054 (Odisha)
3 pages
Take Home Exam Ukp
No ratings yet
Take Home Exam Ukp
4 pages
Parkinsons Disease Detection
No ratings yet
Parkinsons Disease Detection
80 pages
Guc 2029 61 36940 2023-10-15T17 57 15
No ratings yet
Guc 2029 61 36940 2023-10-15T17 57 15
2 pages
Chapter 1 Exam Review - Graphical Displays of Data
No ratings yet
Chapter 1 Exam Review - Graphical Displays of Data
8 pages
MATH1041-Chap3 Lecturer Beginning Only
No ratings yet
MATH1041-Chap3 Lecturer Beginning Only
46 pages
Non Pathological Dissociation Stress
No ratings yet
Non Pathological Dissociation Stress
6 pages
Bookbinders Case 2
0% (3)
Bookbinders Case 2
6 pages
How To Read A Quantitative) Journal Article
No ratings yet
How To Read A Quantitative) Journal Article
6 pages

AI Lab File - C

Uploaded by

AI Lab File - C

Uploaded by

INDEX

Exp Aim of Date of Date of Page Remarks

1 Exploring Panda’s DataFrame 03-01-2023 10-01-2023 2

2 Data Cleansing 10-01-2023 17-01-2023 9

3 Statistical Summary and basic plotting 17-01-2023 24-01-2023 15

4 Predict CO2 Emission using Simple Linear 24-01-2023 31-01-2023 21

5 Predict CO2 Emission using Gradient 31-01-2023 07-02-2023 26

6 Predict CO2 Emission using Multilinear 07-02-2023 14-02-2023 31

7 Perform logistic regression on the 14-02-2023 21-02-2023 35

8 Linear Discriminant Analysis on the IRIS 21-02-2023 28-02-2023 38

9 Using Support Vector Machine build a 28-02-2023 07-03-2023 41

10 Using decision tree, build an ML model on 07-03-2023 14-03-2023 43

11 Classify the “Iris Dataset” using KNN 14-03-2023 21-03-2023 46

12 Cluster the dataset using K-Means 21-03-2023 28-03-2023 48

(a) Load this file into a python Data Frame

(b) Find the number of rows and columns in it.

(d) Change column name ‘Name’ with new name

(e) Print last 5 rows from the bottom

(f) Print the details of student with lowest marks

(g) Find total marks of all female students

(i) Find mean age of the class

(j) Line plot marks of the class

(k) Find the index of record of oldest student in the

(m) Change the name 'Nihar' to 'Jason Bourne' in

(n) Change and print order of the columns (Name,

(p) Delete and print row where age=46

(t) Delete the “Marks Column”

AIM: Data Cleansing

propertyid stno stname owneroccupied numbedrooms numbathrooms areasqft

2. List number of rows and columns in this dataset

3. Check if there is any missing value in this entire dataset?

5. Which columns have maximum number of missing values?

7. If there is any missing value in the “areasqft” replace it with 900

10. Give your comment on the “owner occupied” column.

AIM: Statistical Summary and basic plotting

(c) Calculate mean absolute deviation.

(d) Calculate median for any of the numeric type column

(e) Calculate mean any of the numeric type column

(g) Calculate skewness for “disp”

(i) Scatter plot a graph between “hp” vs “mpg”

(k) Plot a bar diagram for “CarName” vs “mpg”

AIM: Predict CO2 Emission using Simple Linear Regression

You are given with following data

(a) Scatter plot “EngineSize vs CO2Emissions”

(c) Calculate R^2.

AIM: Predict CO2 Emission using Gradient Descent

a. Using the above data build a multi-variable linear regression.

 The accuracy of the model is measured by the R^2 score

c. Which attributes have you used in this model?

AIM: Perform logistic regression on the “ChurnData.csv”.

Consider only following features from the file.

[['tenure', 'age', 'address', 'income', 'ed', 'employ', 'equip', 'callcard', 'wireless','churn']]

a. Print confusion matrix

AIM: Linear Discriminant Analysis on the IRIS dataset

AIM: Using Support Vector Machine build a breast cancer detection

AIM: Using decision tree, build an ML model on the “diabetes” dataset.

Means Clustering. Show the steps for estimation of optimum value of k.

You might also like