0% found this document useful (0 votes)

22 views16 pages

ML Group 2

Uploaded by

Hassen Mhd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views16 pages

ML Group 2

Uploaded by

Hassen Mhd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

DEBRE BERHAN UNIVERSITY

College
COLLAGE OF COMPUTING
DEPARTMENT OF SOFTWARE ENGINEERING
FUNDAMENTALS OF MACHINE LEARNING
COURSE CODE: SEng4091

13/15 + 13/15 = 26/30

NAME ID NO.
1. HASSEN MUHAMMED 17+37+26=80+2 DBUR/0280/13
2. Firdiwek Sisay 21.5+28.5+26=76+2 DBUR/1510/13
3. Yewoynhareg Mulugeta 24+35+26=85+2 DBUR/0035/13
4. Khadar Muhammed 19+29+26=74+2 DBUR/3689/13
5. Haileyesus Demes 12+15+26=53+2 DBUR/0241/13

Submitted to: Kinde B. (PHD)

Submitted date: 06/07/2024
1. Introduction to data preprocessing
#Group members id
#Hassen Muhammed DBUR/0280/13
#Firdiwek Sisay DBUR/1510/13
#Yewoynhareg Mulugeta DBUR/0035/13
#Khadar Muhammed DBUR/3689/13
#Haileyesus Demes DBUR/0241/13

# Import necessary libraries

import pandas as pd
import matplotlib.pyplot as plt

# Load the dataset

# Make sure to replace 'path_to_iris.data' with
#the actual path to the file on your local machine(incase you
want to run #the code)
#dataset_path = 'path_to_iris.data'
column_names = ['Hassen', 'Firdiwek', 'Yewoyn hareg',
'Hayleyesus', 'Khadar']
iris_data = pd.read_csv("iris.data", header=None,
names=column_names)
# Display the number of rows and columns
rows, columns = iris_data.shape
print(f"Number of rows: {rows}")
print(f"Number of columns: {columns}")

# Display the first five records where is the import statement code?
print("\nFirst five records:")
print(iris_data.head())
# Display the last five records where is the import statement code?
print("\nLast five records:")
print(iris_data.tail())

# Display the first ten records where is the import statement code?
print("\nFirst ten records:")
print(iris_data.head(10))

# Display the statistical summary of the dataset

print("\nStatistical summary:") where is the import statement code?
print(iris_data.describe())

# Display the count of each class in the dataset

print("\nClass count:") where is the import statement code?
print(iris_data['Khadar'].value_counts())
# Extract the independent features (all except the class label)
X = iris_data.drop(columns=['Khadar'])where is the import statement code?
print (iris_data.iloc[:,:-1].values)
[[5.1 3.5 1.4 0.2]
[4.9 3. 1.4 0.2]
[4.7 3.2 1.3 0.2]
[4.6 3.1 1.5 0.2]
[5. 3.6 1.4 0.2]
[5.4 3.9 1.7 0.4]
[4.6 3.4 1.4 0.3]
[5. 3.4 1.5 0.2]
[4.4 2.9 1.4 0.2]
[4.9 3.1 1.5 0.1]
[5.4 3.7 1.5 0.2]
[4.8 3.4 1.6 0.2]
[4.8 3. 1.4 0.1]
[4.3 3. 1.1 0.1]
[5.8 4. 1.2 0.2]
[5.7 4.4 1.5 0.4]
[5.4 3.9 1.3 0.4]
[5.1 3.5 1.4 0.3]
[5.7 3.8 1.7 0.3]
[5.1 3.8 1.5 0.3]
[5.4 3.4 1.7 0.2]
[5.1 3.7 1.5 0.4]
[4.6 3.6 1. 0.2]
[5.1 3.3 1.7 0.5]
[4.8 3.4 1.9 0.2]
[5. 3. 1.6 0.2]
[5. 3.4 1.6 0.4]
[5.2 3.5 1.5 0.2]
[5.2 3.4 1.4 0.2]
[4.7 3.2 1.6 0.2]
[4.8 3.1 1.6 0.2]
[5.4 3.4 1.5 0.4]
[5.2 4.1 1.5 0.1]
[5.5 4.2 1.4 0.2]
[4.9 3.1 1.5 0.1]
[5. 3.2 1.2 0.2]
[5.5 3.5 1.3 0.2]
[4.9 3.1 1.5 0.1]
[4.4 3. 1.3 0.2]
[5.1 3.4 1.5 0.2]
[5. 3.5 1.3 0.3]
[4.5 2.3 1.3 0.3]
[4.4 3.2 1.3 0.2]
[5. 3.5 1.6 0.6]
[5.1 3.8 1.9 0.4]
[4.8 3. 1.4 0.3]
[5.1 3.8 1.6 0.2]
[4.6 3.2 1.4 0.2]
[5.3 3.7 1.5 0.2]
[5. 3.3 1.4 0.2]
[7. 3.2 4.7 1.4]
[6.4 3.2 4.5 1.5]
[6.9 3.1 4.9 1.5]
[5.5 2.3 4. 1.3]
[6.5 2.8 4.6 1.5]
[5.7 2.8 4.5 1.3]
[6.3 3.3 4.7 1.6]
[4.9 2.4 3.3 1. ]
[6.6 2.9 4.6 1.3]
[5.2 2.7 3.9 1.4]
[5. 2. 3.5 1. ]
[5.9 3. 4.2 1.5]
[6. 2.2 4. 1. ]
[6.1 2.9 4.7 1.4]
[5.6 2.9 3.6 1.3]
[6.7 3.1 4.4 1.4]
[5.6 3. 4.5 1.5]
[5.8 2.7 4.1 1. ]
[6.2 2.2 4.5 1.5]
[5.6 2.5 3.9 1.1]
[5.9 3.2 4.8 1.8]
[6.1 2.8 4. 1.3]
[6.3 2.5 4.9 1.5]
[6.1 2.8 4.7 1.2]
[6.4 2.9 4.3 1.3]
[6.6 3. 4.4 1.4]
[6.8 2.8 4.8 1.4]
[6.7 3. 5. 1.7]
[6. 2.9 4.5 1.5]
[5.7 2.6 3.5 1. ]
[5.5 2.4 3.8 1.1]
[5.5 2.4 3.7 1. ]
[5.8 2.7 3.9 1.2]
[6. 2.7 5.1 1.6]
[5.4 3. 4.5 1.5]
[6. 3.4 4.5 1.6]
[6.7 3.1 4.7 1.5]
[6.3 2.3 4.4 1.3]
[5.6 3. 4.1 1.3]
[5.5 2.5 4. 1.3]
[5.5 2.6 4.4 1.2]
[6.1 3. 4.6 1.4]
[5.8 2.6 4. 1.2]
[5. 2.3 3.3 1. ]
[5.6 2.7 4.2 1.3]
[5.7 3. 4.2 1.2]
[5.7 2.9 4.2 1.3]
[6.2 2.9 4.3 1.3]
[5.1 2.5 3. 1.1]
[5.7 2.8 4.1 1.3]
[6.3 3.3 6. 2.5]
[5.8 2.7 5.1 1.9]
[7.1 3. 5.9 2.1]
[6.3 2.9 5.6 1.8]
[6.5 3. 5.8 2.2]
[7.6 3. 6.6 2.1]
[4.9 2.5 4.5 1.7]
[7.3 2.9 6.3 1.8]
[6.7 2.5 5.8 1.8]
[7.2 3.6 6.1 2.5]
[6.5 3.2 5.1 2. ]
[6.4 2.7 5.3 1.9]
[6.8 3. 5.5 2.1]
[5.7 2.5 5. 2. ]
[5.8 2.8 5.1 2.4]
[6.4 3.2 5.3 2.3]
[6.5 3. 5.5 1.8]
[7.7 3.8 6.7 2.2]
[7.7 2.6 6.9 2.3]
[6. 2.2 5. 1.5]
[6.9 3.2 5.7 2.3]
[5.6 2.8 4.9 2. ]
[7.7 2.8 6.7 2. ]
[6.3 2.7 4.9 1.8]
[6.7 3.3 5.7 2.1]
[7.2 3.2 6. 1.8]
[6.2 2.8 4.8 1.8]
[6.1 3. 4.9 1.8]
[6.4 2.8 5.6 2.1]
[7.2 3. 5.8 1.6]
[7.4 2.8 6.1 1.9]
[7.9 3.8 6.4 2. ]
[6.4 2.8 5.6 2.2]
[6.3 2.8 5.1 1.5]
[6.1 2.6 5.6 1.4]
[7.7 3. 6.1 2.3]
[6.3 3.4 5.6 2.4]
[6.4 3.1 5.5 1.8]
[6. 3. 4.8 1.8]
[6.9 3.1 5.4 2.1]
[6.7 3.1 5.6 2.4]
[6.9 3.1 5.1 2.3]
[5.8 2.7 5.1 1.9]
[6.8 3.2 5.9 2.3]
[6.7 3.3 5.7 2.5]
[6.7 3. 5.2 2.3]
[6.3 2.5 5. 1.9]
[6.5 3. 5.2 2. ]
[6.2 3.4 5.4 2.3]
[5.9 3. 5.1 1.8]]

# Extract the dependent feature (the class label)

y = iris_data['Khadar'] where is the import statement code?
print (iris_data.iloc[:,4].values)
['Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa'
'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa'
'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa'
'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa'
'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa'
'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa'
'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa'
'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa'
'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa'
'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa' 'Iris-setosa'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor' 'Iris-versicolor'
'Iris-versicolor' 'Iris-versicolor' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica'
'Iris-virginica' 'Iris-virginica' 'Iris-virginica' 'Iris-virginica']

# Plotting the histogram

where is the import statement code?
X.hist(figsize=(10, 8))
plt.suptitle("Histograms of Iris Dataset Features")
plt.show()
# Plotting the density plots where is the import statement code?
X.plot(kind='density', subplots=True, layout=(2,2),
sharex=False, figsize=(10, 8))
plt.suptitle("Density Plots of Iris Dataset Features")
plt.show()
# Plotting the boxplots where is the import statement code?
X.plot(kind='box', subplots=True, layout=(2,2), sharex=False,
sharey=False, figsize=(10, 8))
plt.suptitle("Boxplots of Iris Dataset Features")
plt.show()

13/15
2. Advanced data preprocessing

import numpy as np
import pandas as pd

from sklearn.impute import SimpleImputer, KNNImputer

from sklearn.preprocessing import MinMaxScaler, StandardScaler

# Load the IRIS dataset

file_path = '/project2.csv'
column_names = ['khadar','furd','hassan','hylayasu','hareg']
iris = pd.read_csv(file_path, header=None, names=column_names)

# Step 2: Introduce missing values into each feature

where is the import statement code?
# Read the CSV file So, where is the file path?
missing_data = pd.read_csv(file_path)
# Introduce missing values into each feature
for col in missing_data.columns:
iris.loc[missing_data.sample(frac=0).index, col] = np.nan
print(missing_data[:10])

khadar hassan furdwik hylayassu hareg

0 NaN 3.5 1.4 0.2 Iris-setosa
1 4.9 3.0 1.4 0.2 Iris-setosa
2 NaN 3.2 NaN 0.2 Iris-setosa
3 4.6 3.1 NaN 0.2 Iris-setosa
4 5.0 3.6 1.4 0.2 Iris-setosa
5 5.4 NaN 1.7 0.4 Iris-setosa
6 4.6 3.4 1.4 0.3 Iris-setosa
7 5.0 3.4 1.5 0.2 Iris-setosa
8 4.4 2.9 1.4 0.2 Iris-setosa
9 4.9 3.1 1.5 0.1 Iris-setosa
# Step 3: Impute missing values with mean
imputer_mean = SimpleImputer(strategy='mean')
iris_mean_imputed = pd.read_csv(file_path)
iris_mean_imputed.iloc[:, :-1] =
imputer_mean.fit_transform(iris_mean_imputed.iloc[:, :-1])
print(iris_mean_imputed[:10])
khadar hassan furdwik hylayassu hareg
0 5.856081 3.500000 1.400000 0.2 Iris-setosa
1 4.900000 3.000000 1.400000 0.2 Iris-setosa
2 5.856081 3.200000 3.790541 0.2 Iris-setosa
3 4.600000 3.100000 3.790541 0.2 Iris-setosa
4 5.000000 3.600000 1.400000 0.2 Iris-setosa
5 5.400000 3.048322 1.700000 0.4 Iris-setosa
6 4.600000 3.400000 1.400000 0.3 Iris-setosa
7 5.000000 3.400000 1.500000 0.2 Iris-setosa
8 4.400000 2.900000 1.400000 0.2 Iris-setosa
9 4.900000 3.100000 1.500000 0.1 Iris-setosa
# Step 4: Adjust precision to 2 decimal places
where is the import statement code?
df_mean_imputed = iris_mean_imputed.copy()
df_mean_imputed = df_mean_imputed.round(2)
print(df_mean_imputed[:10])
khadar hassan furdwik hylayassu hareg
0 5.86 3.50 1.40 0.2 Iris-setosa
1 4.90 3.00 1.40 0.2 Iris-setosa
2 5.86 3.20 3.79 0.2 Iris-setosa
3 4.60 3.10 3.79 0.2 Iris-setosa
4 5.00 3.60 1.40 0.2 Iris-setosa
5 5.40 3.05 1.70 0.4 Iris-setosa
6 4.60 3.40 1.40 0.3 Iris-setosa
7 5.00 3.40 1.50 0.2 Iris-setosa
8 4.40 2.90 1.40 0.2 Iris-setosa
9 4.90 3.10 1.50 0.1 Iris-setosa
# Step 5: Impute missing values with the most frequent value
imputer = SimpleImputer(strategy='most_frequent') where is the import statement code?
df_most_frequent =pd.read_csv("/project2.csv")
df_most_frequent = pd.DataFrame(imputer.fit_transform(iris),
columns=column_names)

# Display the first few rows after imputation

print(df_most_frequent[:10]) where is the import statement code?
khadar furd hassan hylayasu hareg
0 khadar hassan furdwik hylayassu hareg
1 5 3.5 1.4 0.2 Iris-setosa
2 4.9 3 1.4 0.2 Iris-setosa
3 5 3.2 1.5 0.2 Iris-setosa
4 4.6 3.1 1.5 0.2 Iris-setosa
5 5 3.6 1.4 0.2 Iris-setosa
6 5.4 3 1.7 0.4 Iris-setosa
7 4.6 3.4 1.4 0.3 Iris-setosa
8 5 3.4 1.5 0.2 Iris-setosa
9 4.4 2.9 1.4 0.2 Iris-setosa
# Step 6: Impute missing values with a constant value of 100
imputer = SimpleImputer(strategy='constant', fill_value=100)
df_constant =pd.read_csv('/content/project2.csv')
df_constant = pd.DataFrame(imputer.fit_transform(df),
columns=df.columns)
where is the import statement code?
# Display the first few rows after imputation
print(df_constant[:10]) where is the import statement code?

khadar hassan furdwik hylayassu hareg

0 100 3.5 1.4 0.2 Iris-setosa
1 4.9 3.0 1.4 0.2 Iris-setosa
2 100 3.2 100 0.2 Iris-setosa
3 4.6 3.1 100 0.2 Iris-setosa
4 5.0 3.6 1.4 0.2 Iris-setosa

# Step 7: Impute missing values with KNN where N=2

imputer_knn = KNNImputer(n_neighbors=2) where is the import statement code?
iris_knn_imputed = pd.read_csv('/content/project2.csv')
iris_knn_imputed.iloc[:, :-1] =
imputer_knn.fit_transform(iris_knn_imputed.iloc[:, :-1])
print(iris_knn_imputed[:10])

khadar hassan furdwik hylayassu hareg

0 5.35 3.5 1.4 0.2 Iris-setosa
1 4.90 3.0 1.4 0.2 Iris-setosa
2 4.85 3.2 1.4 0.2 Iris-setosa
3 4.60 3.1 1.5 0.2 Iris-setosa
4 5.00 3.6 1.4 0.2 Iris-setosa
5 5.40 3.4 1.7 0.4 Iris-setosa
6 4.60 3.4 1.4 0.3 Iris-setosa
7 5.00 3.4 1.5 0.2 Iris-setosa
8 4.40 2.9 1.4 0.2 Iris-setosa
9 4.90 3.1 1.5 0.1 Iris-setosa
# Step 8: Delete records with missing values
df_no_missing = df.dropna() where is the import statement code?
print(df_no_missing[:10])
khadar hassan furdwik hylayassu hareg
1 4.9 3.0 1.4 0.2 Iris-setosa
4 5.0 3.6 1.4 0.2 Iris-setosa
6 4.6 3.4 1.4 0.3 Iris-setosa
7 5.0 3.4 1.5 0.2 Iris-setosa
8 4.4 2.9 1.4 0.2 Iris-setosa
9 4.9 3.1 1.5 0.1 Iris-setosa
10 5.4 3.7 1.5 0.2 Iris-setosa
11 4.8 3.4 1.6 0.2 Iris-setosa
12 4.8 3.0 1.4 0.1 Iris-setosa
13 4.3 3.0 1.1 0.1 Iris-setosa
# Step 9: Min-Max Normalization
where is the import statement code?
scaler_min_max = MinMaxScaler()
iris_min_max_normalized =
df_mean_imputed.drop(columns=['hareg']).copy()
iris_min_max_normalized.iloc[:, :] =
scaler_min_max.fit_transform(iris_min_max_normalized.iloc[:, :])
iris_min_max_normalized['hareg'] = df_mean_imputed['hareg']

print(iris_min_max_normalized[:10])

0 0.433333 0.625000 0.067797 0.041667 Iris-setosa

1 0.166667 0.416667 0.067797 0.041667 Iris-setosa
2 0.433333 0.500000 0.472881 0.041667 Iris-setosa
3 0.083333 0.458333 0.472881 0.041667 Iris-setosa
4 0.194444 0.666667 0.067797 0.041667 Iris-setosa
5 0.305556 0.437500 0.118644 0.125000 Iris-setosa
6 0.083333 0.583333 0.067797 0.083333 Iris-setosa
7 0.194444 0.583333 0.084746 0.041667 Iris-setosa
8 0.027778 0.375000 0.067797 0.041667 Iris-setosa
9 0.166667 0.458333 0.084746 0.000000 Iris-setosa
# Step 10: Z-Score Normalization
scaler_z_score = StandardScaler() where is the import statement code?
df_z_score_scaled =
pd.DataFrame(scaler_z_score.fit_transform(df_mean_imputed.drop(c
olumns='hareg')), columns=df.columns[:-1])
df_z_score_scaled['hareg'] = df_mean_imputed['hareg']

print(df_z_score_scaled[:10])

khadar hassan furdwik hylayassu hareg

0 0.004729 1.058877 -1.376256 -1.312977 Iris-setosa
1 -1.169357 -0.113312 -1.376256 -1.312977 Iris-setosa
2 0.004729 0.355564 -0.000307 -1.312977 Iris-setosa
3 -1.536259 0.121126 -0.000307 -1.312977 Iris-setosa
4 -1.047056 1.293314 -1.376256 -1.312977 Iris-setosa
5 -0.557854 0.003907 -1.203542 -1.050031 Iris-setosa
6 -1.536259 0.824439 -1.376256 -1.181504 Iris-setosa
7 -1.047056 0.824439 -1.318685 -1.312977 Iris-setosa
8 -1.780860 -0.347749 -1.376256 -1.312977 Iris-setosa
9 -1.169357 0.121126 -1.318685 -1.444450 Iris-setosa
df = pd.DataFrame({'Age': [42, 15, 67, 55, 1, 29, 75, 89, 4, 10, 15, 38,
22, 77]})

print("Before Transformation: ")

print(df)
Before Transformation:
Age
0 42
1 15
2 67
3 55
4 1
5 29
6 75
7 89
8 4
9 10
10 15
11 38
12 22
13 77
Label = pd.cut(x=df['Age'], bins=[0, 3, 7, 17, 63, 99],
labels=['Baby', 'Child', 'Teenage', 'Adult',
'Elderly'])
where is the import statement code?
# Printing DataFrame after sorting Continuous to
# Categories
print("After: ")
print(Label)
After:
0 Adult
1 Teenage
2 Elderly
3 Adult
4 Baby
5 Adult
6 Elderly
7 Elderly
8 Child
9 Teenage
10 Teenage
11 Adult
12 Adult
13 Elderly
Name: Age, dtype: category
Categories (5, object): ['Baby' < 'Child' < 'Teenage' < 'Adult' <
'Elderly']
# Check the number of values in each bin
print("Categories: ")
print(Label.value_counts())
Categories:
Age
Adult 5
Elderly 4
Teenage 3
Baby 1
Child 1
Name: count, dtype: int64
data = pd.concat([df, Label], axis=1)
print ("\n \n \n Merged Data \n \n", data)

Merged Data

Age Age
0 42 Adult
1 15 Teenage
2 67 Elderly
3 55 Adult
4 1 Baby
5 29 Adult
6 75 Elderly
7 89 Elderly
8 4 Child
9 10 Teenage
10 15 Teenage
11 38 Adult
12 22 Adult
13 77 Elderly
13/15

Machine Learning Group Project
No ratings yet
Machine Learning Group Project
22 pages
Exno 4
No ratings yet
Exno 4
13 pages
1 Assignment 3 - Classification
No ratings yet
1 Assignment 3 - Classification
16 pages
ML#07
No ratings yet
ML#07
21 pages
Week 6 (PCA, SVD, LDA)
No ratings yet
Week 6 (PCA, SVD, LDA)
14 pages
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
No ratings yet
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
21 pages
1 Abril PDF
No ratings yet
1 Abril PDF
10 pages
18AIL78 - Lab Manual
No ratings yet
18AIL78 - Lab Manual
25 pages
Assignment 5'
No ratings yet
Assignment 5'
4 pages
Exp 07 (ML)
No ratings yet
Exp 07 (ML)
4 pages
EXP 07 (ML) - Ashu
No ratings yet
EXP 07 (ML) - Ashu
4 pages
SK Learn 1
No ratings yet
SK Learn 1
11 pages
EXP 07 (ML) - Darshu
No ratings yet
EXP 07 (ML) - Darshu
4 pages
EXP 07 (ML) - Sarthak
No ratings yet
EXP 07 (ML) - Sarthak
4 pages
Chap5 - Wei - Ipynb - Colab
No ratings yet
Chap5 - Wei - Ipynb - Colab
29 pages
ML LabReport Final Index Edited
No ratings yet
ML LabReport Final Index Edited
35 pages
7 Output
No ratings yet
7 Output
4 pages
Week 6 K Nearestneighbors 1
No ratings yet
Week 6 K Nearestneighbors 1
11 pages
MLRecord
No ratings yet
MLRecord
24 pages
Introduction of Matplotlib1
No ratings yet
Introduction of Matplotlib1
1 page
Practical No - 1
No ratings yet
Practical No - 1
5 pages
ML Keshav
No ratings yet
ML Keshav
23 pages
DSBDA6
No ratings yet
DSBDA6
6 pages
ML Record
No ratings yet
ML Record
19 pages
Merged
No ratings yet
Merged
35 pages
Mlext
No ratings yet
Mlext
1 page
Preksha Ai Practical Class 10th - 070428
No ratings yet
Preksha Ai Practical Class 10th - 070428
13 pages
Batch1 Ds
No ratings yet
Batch1 Ds
15 pages
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
No ratings yet
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
28 pages
Numpy Dataframe
No ratings yet
Numpy Dataframe
12 pages
Experiment-2-1-Ml Kritika
No ratings yet
Experiment-2-1-Ml Kritika
11 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Data Science: Objectives
No ratings yet
Data Science: Objectives
10 pages
KRAI LabManual
No ratings yet
KRAI LabManual
77 pages
KNN052
No ratings yet
KNN052
5 pages
Practical File (Xii - Ip) 2023-24
No ratings yet
Practical File (Xii - Ip) 2023-24
40 pages
BDA File
No ratings yet
BDA File
26 pages
K Means
No ratings yet
K Means
15 pages
Python Lab PRG
No ratings yet
Python Lab PRG
20 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
38 pages
Time Series Analysis Group 9
No ratings yet
Time Series Analysis Group 9
16 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
Fds Mannual
No ratings yet
Fds Mannual
39 pages
Ai Tools and Applications-Lab
No ratings yet
Ai Tools and Applications-Lab
33 pages
Lab4 KNN
No ratings yet
Lab4 KNN
9 pages
ML N PY Programs
No ratings yet
ML N PY Programs
17 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Exp 5,6,7
No ratings yet
Exp 5,6,7
2 pages
DL Lab 3
No ratings yet
DL Lab 3
5 pages
M L
No ratings yet
M L
13 pages
Mlpy 2
No ratings yet
Mlpy 2
18 pages
Record
No ratings yet
Record
23 pages
Exercise and Experiment 3
No ratings yet
Exercise and Experiment 3
14 pages
Suryadatta National School Class 12 CBSE Informatics Practices Practicals List
No ratings yet
Suryadatta National School Class 12 CBSE Informatics Practices Practicals List
19 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
34 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
48 pages
Graphs Using Matplotlib
No ratings yet
Graphs Using Matplotlib
23 pages
Fds QB
No ratings yet
Fds QB
6 pages
Exp 2 SDK Ok
No ratings yet
Exp 2 SDK Ok
18 pages
150+ C Pattern Programs
From Everand
150+ C Pattern Programs
Hernando Abella
No ratings yet
Conductor Characteristics
No ratings yet
Conductor Characteristics
20 pages
Bid Form - Water System at Sitio Tinago
No ratings yet
Bid Form - Water System at Sitio Tinago
56 pages
Intel Core I7 Comparison Chart
No ratings yet
Intel Core I7 Comparison Chart
1 page
Tabel Data Percobaan Praktikum Optik-2
No ratings yet
Tabel Data Percobaan Praktikum Optik-2
2 pages
ACCUMULATOR 76028575 - Case
No ratings yet
ACCUMULATOR 76028575 - Case
3 pages
Ags 200-200en-300en-400en-L - Instructions For Use - 2019 - 02
No ratings yet
Ags 200-200en-300en-400en-L - Instructions For Use - 2019 - 02
50 pages
66kV BUSCOUPLER
No ratings yet
66kV BUSCOUPLER
73 pages
Cylinder Data Sheet Re VC
No ratings yet
Cylinder Data Sheet Re VC
3 pages
SQL
0% (1)
SQL
25 pages
F56449 and F56450 Recommended Parts Per Maintenance Interval
No ratings yet
F56449 and F56450 Recommended Parts Per Maintenance Interval
31 pages
Valves
No ratings yet
Valves
10 pages
Jiangsu Eastern 37500 DWT 004
No ratings yet
Jiangsu Eastern 37500 DWT 004
17 pages
Balancing Water in The Brewhouse
No ratings yet
Balancing Water in The Brewhouse
33 pages
MCA - III SEM - MCA3610-Prog Lab V - Visual Programming Lab Manual PDF
No ratings yet
MCA - III SEM - MCA3610-Prog Lab V - Visual Programming Lab Manual PDF
89 pages
Ap T107 08
No ratings yet
Ap T107 08
43 pages
Himadri ASTM
No ratings yet
Himadri ASTM
2 pages
History of Civil Engineering
No ratings yet
History of Civil Engineering
8 pages
Esp 141028
No ratings yet
Esp 141028
2 pages
HTS Code Chapter 85
No ratings yet
HTS Code Chapter 85
87 pages
Atlas Copco Mobility Air Systems: Oil-Free Scroll Compressor SFR 2-12
No ratings yet
Atlas Copco Mobility Air Systems: Oil-Free Scroll Compressor SFR 2-12
2 pages
I CS Project
No ratings yet
I CS Project
23 pages
Acivator
No ratings yet
Acivator
4 pages
Dzak 97
No ratings yet
Dzak 97
16 pages
MSC.461 (101) - 2011 Esp Code
No ratings yet
MSC.461 (101) - 2011 Esp Code
441 pages
BT Toyota CBE Servicemanual
No ratings yet
BT Toyota CBE Servicemanual
104 pages
Weather Data Al Jouf
No ratings yet
Weather Data Al Jouf
7 pages
Test #3 - PS 113
No ratings yet
Test #3 - PS 113
1 page
(Www.manuallib.com) LEISTRITZ&Mdash;L3MF 060 120 IFOKSO G 技术手册
No ratings yet
(Www.manuallib.com) LEISTRITZ&Mdash;L3MF 060 120 IFOKSO G 技术手册
3 pages
ET200L e
No ratings yet
ET200L e
414 pages
Edited Chapter 1
100% (1)
Edited Chapter 1
8 pages

ML Group 2

Uploaded by

ML Group 2

Uploaded by

DEBRE BERHAN UNIVERSITY

13/15 + 13/15 = 26/30

Submitted to: Kinde B. (PHD)

# Import necessary libraries

# Load the dataset

# Display the statistical summary of the dataset

# Display the count of each class in the dataset

# Extract the dependent feature (the class label)

# Plotting the histogram

from sklearn.impute import SimpleImputer, KNNImputer

# Load the IRIS dataset

# Step 2: Introduce missing values into each feature

khadar hassan furdwik hylayassu hareg

# Display the first few rows after imputation

khadar hassan furdwik hylayassu hareg

# Step 7: Impute missing values with KNN where N=2

khadar hassan furdwik hylayassu hareg

0 0.433333 0.625000 0.067797 0.041667 Iris-setosa

khadar hassan furdwik hylayassu hareg

print("Before Transformation: ")

You might also like