0% found this document useful (0 votes)

43 views3 pages

#Creating A Dataset #Creating Target Variable: Import As Import As

1. The document creates a dataset with features including gender, height, weight, and foot size for 80 individuals. 2. It then calculates summary statistics including the mean and variance for each feature, separated by gender. 3. A probability function is defined to calculate the probability of an individual's features given the mean and variance for a particular gender. 4. This function is used to calculate the probability that each individual in the original dataset belongs to either the male or female class, and predict the most likely gender. The accuracy of these predictions is reported.

Uploaded by

badeni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views3 pages

#Creating A Dataset #Creating Target Variable: Import As Import As

Uploaded by

badeni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

import numpy as np

import pandas as pd
#creating a dataset
person = pd.DataFrame()
#creating target variable
person['Gender'] = ['male','male','male','male','female','female','female'
,'female']
#creating our feature variables
person['Height'] = [6,5.92,5.58,5.92,5,5.5,5.42,5.75]
person['Weight'] = [180,190,170,165,100,150,130,150]
person['Foot_Size'] = [12,11,12,10,6,8,7,9]
#view the data
print("\n Dataset")
print("")
print(data)
#create an empty data frame
data = pd.DataFrame()
#creating some feature values for this single row
data['Gender']=['male','male','male','male','male','male','male','male','m
ale','male','male','male','male','male','male','male','male','male','male'
,'male','female','female','female','female','female','female','female','fe
male','female','female','female','female','female','female','female','fema
le','female','female','female','female']
data['Height'] =[5.82,5.77,5.87,5.99,6.07,6.13,6.06,5.99,6.21,5.81,5.57,5.
15,6.02,5.93,5.91,5.63,5.86,5.93,5.59,5.77,5.60,5.40,5,5.75,5.7,5.2,5.1,5.
73,5.74,5,5.8,5.77,5.82,5.60,5.40,5,5.75,5.43,5.12,5.55]
data['Weight'] =[172,171,180,163,169,181,185,168,166,164,175,172,167,140,1
74,183,133,111,162,177,154,134,137,150,155,136,132,140,154,146,141,145,142
,158,155,155,152,150,139,160]
data['Foot_Size'] =[10,11,12,11,12,11,12,13,13,10,11,13,12,12,6,7,12,13,8,
9,7,6,5,9,5,6,5,7,6,5,5,9,5,7,6,6,9,12,9,10]
#view the data
print('\n Test Instance: ')
print(" ")
print(person)
n_male = data['Gender'][data['Gender'] == 'male'].count()
n_male
n_female = data['Gender'][data['Gender'] == 'female'].count()
n_female
#total rows
total_ppl = data['Gender'].count()
total_ppl
#no of males divided by the total rows
p_male = n_male / total_ppl #(4/8)
p_male
p_female = n_female / total_ppl #(4/8)
p_female
# group the data by gender & calculate the means of each feature
# for eg - height = (6+5.92+5.58+5.92) / 4
data_means = data.groupby('Gender').mean()
data_means
#calculate of mean
print('\n Dataset Mean')
print(" ")
print(data_means)
# calculate the data variance
# variance = summation of((mean - x) ** 2) / n
data_variance = data.groupby('Gender').var()
print(data_variance)
#mean for male
male_height_mean = data_means['Height'][data_means.index == 'male'].values
[0]
male_weight_mean = data_means['Weight'][data_means.index == 'male'].values
[0]
male_footsize_mean = data_means['Foot_Size'][data_means.index == 'male'].v
alues[0]
print("male_height_mean: ", male_height_mean)
print("male_weight_mean: ", male_weight_mean)
print("male_footsize_mean: ", male_footsize_mean)
#variance for male
male_height_variance = data_variance['Height'][data_variance.index == 'mal
e'].values[0]
male_weight_variance = data_variance['Weight'][data_variance.index == 'mal
e'].values[0]
male_footsize_variance = data_variance['Foot_Size'][data_variance.index ==
'male'].values[0]
print("male_height_variance: ",male_height_variance)
print("male_weight_variance: ",male_weight_variance)
print("male_footsize_variance: ",male_footsize_variance)
# for female now
# mean for female
female_height_mean = data_means['Height'][data_means.index == 'female'].va
lues[0]
female_weight_mean = data_means['Weight'][data_means.index == 'female'].va
lues[0]
female_footsize_mean = data_means['Foot_Size'][data_means.index == 'female
'].values[0]
print("female_height_mean: ", female_height_mean)
print("female_weight_mean: ", female_weight_mean)
print("female_footsize_mean: ", female_footsize_mean)
#variance for female
female_height_variance = data_variance['Height'][data_variance.index == 'f
emale'].values[0]
female_weight_variance = data_variance['Weight'][data_variance.index == 'f
emale'].values[0]
female_footsize_variance = data_variance['Foot_Size'][data_variance.index
== 'female'].values[
0]
print("female_height_variance: ",female_height_variance)
print("female_weight_variance: ",female_weight_variance)
print("female_footsize_variance: ",female_footsize_variance)
# create a function which calculates p(x|y)
def p_x_given_y(x,mean_y, variance_y):
#input the arguments into a probability density function
p = 1/(np.sqrt(2*np.pi*variance_y))* np.exp((-(x-
mean_y) ** 2)/(2*variance_y))
return p
count=0
# numerator of the posterior if the unclassified observation is a male
for i in range(len(person)):
print('\n Probability male: ')
prob_male = p_male*p_x_given_y(person['Height'][i],male_height_mean,ma
le_height_variance)*p_x_given_y(person['Weight'][i],male_weight_mean,male_
weight_variance)* p_x_given_y(person['Foot_Size'][i],male_footsize_mean,ma
le_footsize_variance)
print(prob_male)
print('\n Probability female: ')
prob_female = p_female*p_x_given_y(person['Height'][i],female_height_m
ean,female_height_variance)*p_x_given_y(person['Weight'][i],female_weight_
mean,female_weight_variance)*p_x_given_y(person['Foot_Size'][i],female_foo
tsize_mean,female_footsize_variance)
print(prob_female)
if(prob_male > prob_female):
print(f"target label: male for {i} ")
if(person['Gender'][i]=='male'):
count+=1
else:
print(f"target label: Female for {i} ")
if (person['Gender'][i]=='female'):
count+=1
print(f"Accuracy {((count)/8)*100}")

PGM9
No ratings yet
PGM9
1 page
Axis Mobile Features
No ratings yet
Axis Mobile Features
34 pages
Bus Times
No ratings yet
Bus Times
2 pages
10 Formatting Text (Font, Paragraph, Lists)
No ratings yet
10 Formatting Text (Font, Paragraph, Lists)
3 pages
Candidate Privacy
No ratings yet
Candidate Privacy
6 pages
05 Group Account Management
No ratings yet
05 Group Account Management
13 pages
E36 Asc+t
No ratings yet
E36 Asc+t
16 pages
IRCTC Retiring Room
No ratings yet
IRCTC Retiring Room
1 page
UNIT 2.2 Functional Modeling
No ratings yet
UNIT 2.2 Functional Modeling
23 pages
B58 - Handling Missing Values, Feature - Selection
No ratings yet
B58 - Handling Missing Values, Feature - Selection
4 pages
OCCUPATIONAL HEALTH AND SAFETY PROCEDURES IN COMPUTER - PPTM
No ratings yet
OCCUPATIONAL HEALTH AND SAFETY PROCEDURES IN COMPUTER - PPTM
29 pages
Activo PD503 004
No ratings yet
Activo PD503 004
4 pages
Correction Examen IApartie-pratique
No ratings yet
Correction Examen IApartie-pratique
3 pages
Capstone 1 Corizo
No ratings yet
Capstone 1 Corizo
2 pages
DSBDA Practicals
No ratings yet
DSBDA Practicals
16 pages
P 7
No ratings yet
P 7
5 pages
Aiml Programs
No ratings yet
Aiml Programs
12 pages
Ml-15 Work Procedure For Row Clean Up & Restoration
No ratings yet
Ml-15 Work Procedure For Row Clean Up & Restoration
8 pages
PNB Recruiment 2024 For Various Posts
No ratings yet
PNB Recruiment 2024 For Various Posts
11 pages
Set A
No ratings yet
Set A
4 pages
1
No ratings yet
1
2 pages
Chapter 2：基于模型的建模 & 连续动力学建模 & 系统的参与者模型
No ratings yet
Chapter 2：基于模型的建模 & 连续动力学建模 & 系统的参与者模型
69 pages
AIML Exp 4 Output
No ratings yet
AIML Exp 4 Output
2 pages
1 10
No ratings yet
1 10
4 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
24 pages
Chapter 4
No ratings yet
Chapter 4
7 pages
Toshiba RAS-M10SKV-E
No ratings yet
Toshiba RAS-M10SKV-E
52 pages
Task 2
No ratings yet
Task 2
4 pages
Exp 13
No ratings yet
Exp 13
2 pages
Baseline - Ipynb - Colab
No ratings yet
Baseline - Ipynb - Colab
5 pages
Bill of Material IH
No ratings yet
Bill of Material IH
1 page
Hoeganaes Corporation
No ratings yet
Hoeganaes Corporation
11 pages
Message
No ratings yet
Message
30 pages
Model2.ipynb - Colab
No ratings yet
Model2.ipynb - Colab
11 pages
Data Science Programs
No ratings yet
Data Science Programs
11 pages
Biology Assignment Ra2411003020195
No ratings yet
Biology Assignment Ra2411003020195
6 pages
Mayank Chaudhary DEV Practicals
No ratings yet
Mayank Chaudhary DEV Practicals
14 pages
CS334 - Machine Learning Lab 04 - Feature Selection Methods in ML (Part - 1)
No ratings yet
CS334 - Machine Learning Lab 04 - Feature Selection Methods in ML (Part - 1)
4 pages
Naive
No ratings yet
Naive
5 pages
Week2 Lab
No ratings yet
Week2 Lab
8 pages
ML Labmanual
No ratings yet
ML Labmanual
33 pages
Titanic Survival Prediction ML
No ratings yet
Titanic Survival Prediction ML
36 pages
Adobe Scan 10-Jan-2022
100% (1)
Adobe Scan 10-Jan-2022
21 pages
Practical 4
No ratings yet
Practical 4
2 pages
Cardiovascular Disease Prediction
No ratings yet
Cardiovascular Disease Prediction
2 pages
Preprocessing Python
No ratings yet
Preprocessing Python
9 pages
Digital B&W Copiers (M156/M157/M176/M177-EU/AA) Parts Catalog
No ratings yet
Digital B&W Copiers (M156/M157/M176/M177-EU/AA) Parts Catalog
50 pages
Information and Software Technology: Andrew Austin, Casper Holmgreen, Laurie Williams
No ratings yet
Information and Software Technology: Andrew Austin, Casper Holmgreen, Laurie Williams
10 pages
ML Practical 3D
No ratings yet
ML Practical 3D
4 pages
Adobe Scan 10 Jan 2022
No ratings yet
Adobe Scan 10 Jan 2022
25 pages
BACKPROPAGATION (Training - Example, Ƞ, N
No ratings yet
BACKPROPAGATION (Training - Example, Ƞ, N
4 pages
File System Basics: Hadoop Distributed
No ratings yet
File System Basics: Hadoop Distributed
22 pages
Project Paarth
No ratings yet
Project Paarth
21 pages
Grade 6 Performance Task: Taking A Field Trip
No ratings yet
Grade 6 Performance Task: Taking A Field Trip
24 pages
Healthcare-Project-Simplilearn - Week1
No ratings yet
Healthcare-Project-Simplilearn - Week1
6 pages
PGM 7
No ratings yet
PGM 7
3 pages
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
From Everand
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
Abdelfattah Ragab
No ratings yet
Description For Engineering CHNG CMZ700-001 (050804) 3decoded
No ratings yet
Description For Engineering CHNG CMZ700-001 (050804) 3decoded
8 pages
Aerofit
No ratings yet
Aerofit
7 pages
Titanic Dataset Model Prediction
No ratings yet
Titanic Dataset Model Prediction
11 pages
ID3 Program4
No ratings yet
ID3 Program4
3 pages
A Rugged Radio For Harsh Environments
No ratings yet
A Rugged Radio For Harsh Environments
8 pages
Technology-Plan BPP Frelyn
100% (2)
Technology-Plan BPP Frelyn
4 pages
Heart Disease Diagnosis Using Machine Learning
No ratings yet
Heart Disease Diagnosis Using Machine Learning
26 pages
MACHINE LEARNING Manual
No ratings yet
MACHINE LEARNING Manual
36 pages
MM - B412, B432, B512, MB472, MB492, MB562, ES4132, ES4192, ES5112, ES5162 (Option Tray) - 1
No ratings yet
MM - B412, B432, B512, MB472, MB492, MB562, ES4132, ES4192, ES5112, ES5162 (Option Tray) - 1
18 pages
Engine Immobilizer System
No ratings yet
Engine Immobilizer System
6 pages
Anemia Code
No ratings yet
Anemia Code
33 pages
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
Steering System PDF
No ratings yet
Steering System PDF
49 pages
Ex 3
No ratings yet
Ex 3
5 pages
ML 7
No ratings yet
ML 7
6 pages
Abhiml ML File
No ratings yet
Abhiml ML File
74 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
32 pages
34 Davass1
No ratings yet
34 Davass1
8 pages
Linear and Multilinear Regression
No ratings yet
Linear and Multilinear Regression
5 pages
DSBDA2
No ratings yet
DSBDA2
6 pages
Data Science Practicals - Ipynb
No ratings yet
Data Science Practicals - Ipynb
54 pages
Print Print Print Print: Import As
No ratings yet
Print Print Print Print: Import As
6 pages
Multiple LR Code
No ratings yet
Multiple LR Code
1 page
DATASCI112 Midterm Cheat Sheet
No ratings yet
DATASCI112 Midterm Cheat Sheet
2 pages
KNN Age Prediction Model
No ratings yet
KNN Age Prediction Model
9 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
Data Sci
No ratings yet
Data Sci
29 pages
Root Cause Analysis
No ratings yet
Root Cause Analysis
12 pages
Assignment 4 - Jupyter Notebook
No ratings yet
Assignment 4 - Jupyter Notebook
6 pages
Cardio Screen RF
100% (1)
Cardio Screen RF
27 pages
ML Lab File
No ratings yet
ML Lab File
19 pages
Ss Project With Python
No ratings yet
Ss Project With Python
9 pages
LINEAR REGRESSION (Using Python)
No ratings yet
LINEAR REGRESSION (Using Python)
1 page
Linear Regression
No ratings yet
Linear Regression
1 page
Precision and Recall
No ratings yet
Precision and Recall
13 pages
Import Import As Import As From Import: Pre - Prob
No ratings yet
Import Import As Import As From Import: Pre - Prob
2 pages
Assignment: Name: Md. Nasim Uddin ID: 15162103276 Intake: 32 Section: 07
No ratings yet
Assignment: Name: Md. Nasim Uddin ID: 15162103276 Intake: 32 Section: 07
8 pages
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet
Sectional Weights
No ratings yet
Sectional Weights
1 page
Pattern Recognition
No ratings yet
Pattern Recognition
26 pages

#Creating A Dataset #Creating Target Variable: Import As Import As

Uploaded by

#Creating A Dataset #Creating Target Variable: Import As Import As

Uploaded by

import numpy as np

You might also like