0% found this document useful (0 votes)

14 views

Copy of ML - Assignment

Uploaded by

Decoy Mail

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Copy of ML - Assignment

Uploaded by

Decoy Mail

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

import numpy as np

import pandas as pd

pd.options.display.width = 1000
pd.options.display.max_rows = 500
pd.options.display.max_columns = 500 # this code helps to output
columns in 1 line , it is easy to see & understand

from google.colab import drive

drive.mount('/content/drive')

Drive already mounted at /content/drive; to attempt to forcibly

remount, call drive.mount("/content/drive", force_remount=True).

df = pd.read_csv('/content/drive/MyDrive/Colab
Notebooks/pokemon_data.csv')
sample = df[df['Legendary']==True].head(10)
print(sample)
sample['Attack']

# Name Type 1 Type 2 HP Attack

Defense Sp. Atk Sp. Def Speed Generation Legendary
156 144 Articuno Ice Flying 90 85
100 95 125 85 1 True
157 145 Zapdos Electric Flying 90 90
85 125 90 100 1 True
158 146 Moltres Fire Flying 90 100
90 125 85 90 1 True
162 150 Mewtwo Psychic NaN 106 110
90 154 90 130 1 True
163 150 MewtwoMega Mewtwo X Psychic Fighting 106 190
100 154 100 130 1 True
164 150 MewtwoMega Mewtwo Y Psychic NaN 106 150
70 194 120 140 1 True
262 243 Raikou Electric NaN 90 85
75 115 100 115 2 True
263 244 Entei Fire NaN 115 115
85 90 75 100 2 True
264 245 Suicune Water NaN 100 75
115 90 115 85 2 True
269 249 Lugia Psychic Flying 106 90
130 90 154 110 2 True

156 85
157 90
158 100
162 110
163 190
164 150
262 85
263 115
264 75
269 90
Name: Attack, dtype: int64

p = sample[sample['Attack']>100]
p

{"summary":"{\n \"name\": \"p\",\n \"rows\": 4,\n \"fields\": [\n

{\n \"column\": \"#\",\n \"properties\": {\n
\"dtype\": \"number\",\n \"std\": 47,\n \"min\": 150,\n
\"max\": 244,\n \"num_unique_values\": 2,\n \"samples\":
[\n 244,\n 150\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Name\",\n \"properties\": {\n
\"dtype\": \"string\",\n \"num_unique_values\": 4,\n
\"samples\": [\n \"MewtwoMega Mewtwo X\",\n
\"Entei\"\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"Type 1\",\n \"properties\": {\n \"dtype\": \"string\",\n
\"num_unique_values\": 2,\n \"samples\": [\n
\"Fire\",\n \"Psychic\"\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Type 2\",\n \"properties\":
{\n \"dtype\": \"category\",\n \"num_unique_values\":
1,\n \"samples\": [\n \"Fighting\"\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"HP\",\n \"properties\": {\n
\"dtype\": \"number\",\n \"std\": 4,\n \"min\": 106,\n
\"max\": 115,\n \"num_unique_values\": 2,\n \"samples\":
[\n 115\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"Attack\",\n \"properties\": {\n \"dtype\": \"number\",\n
\"std\": 37,\n \"min\": 110,\n \"max\": 190,\n
\"num_unique_values\": 4,\n \"samples\": [\n 190\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Defense\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
12,\n \"min\": 70,\n \"max\": 100,\n
\"num_unique_values\": 4,\n \"samples\": [\n 100\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Sp. Atk\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
43,\n \"min\": 90,\n \"max\": 194,\n
\"num_unique_values\": 3,\n \"samples\": [\n 154\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Sp. Def\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
18,\n \"min\": 75,\n \"max\": 120,\n
\"num_unique_values\": 4,\n \"samples\": [\n 100\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Speed\",\n \"properties\":
{\n \"dtype\": \"number\",\n \"std\": 17,\n
\"min\": 100,\n \"max\": 140,\n \"num_unique_values\":
3,\n \"samples\": [\n 130\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Generation\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
0,\n \"min\": 1,\n \"max\": 2,\n
\"num_unique_values\": 2,\n \"samples\": [\n 2\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Legendary\",\n
\"properties\": {\n \"dtype\": \"boolean\",\n
\"num_unique_values\": 1,\n \"samples\": [\n true\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n }\n ]\n}","type":"dataframe","variable_name":"p"}

n = sample[sample['Attack']<=100]
n
# total = 10 , in which negative are 6 , positive are 4

{"summary":"{\n \"name\": \"n\",\n \"rows\": 6,\n \"fields\": [\n

{\n \"column\": \"#\",\n \"properties\": {\n
\"dtype\": \"number\",\n \"std\": 55,\n \"min\": 144,\n
\"max\": 249,\n \"num_unique_values\": 6,\n \"samples\":
[\n 144,\n 145,\n 249\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Name\",\n \"properties\": {\n
\"dtype\": \"string\",\n \"num_unique_values\": 6,\n
\"samples\": [\n \"Articuno\",\n \"Zapdos\",\n
\"Lugia\"\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"Type 1\",\n \"properties\": {\n \"dtype\": \"string\",\n
\"num_unique_values\": 5,\n \"samples\": [\n
\"Electric\",\n \"Psychic\",\n \"Fire\"\n ],\
n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Type 2\",\n \"properties\":
{\n \"dtype\": \"category\",\n \"num_unique_values\":
1,\n \"samples\": [\n \"Flying\"\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"HP\",\n \"properties\": {\n
\"dtype\": \"number\",\n \"std\": 6,\n \"min\": 90,\n
\"max\": 106,\n \"num_unique_values\": 3,\n \"samples\":
[\n 90\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"Attack\",\n \"properties\": {\n \"dtype\": \"number\",\n
\"std\": 8,\n \"min\": 75,\n \"max\": 100,\n
\"num_unique_values\": 4,\n \"samples\": [\n 90\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Defense\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
20,\n \"min\": 75,\n \"max\": 130,\n
\"num_unique_values\": 6,\n \"samples\": [\n 100\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Sp. Atk\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
16,\n \"min\": 90,\n \"max\": 125,\n
\"num_unique_values\": 4,\n \"samples\": [\n 125\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Sp. Def\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
25,\n \"min\": 85,\n \"max\": 154,\n
\"num_unique_values\": 6,\n \"samples\": [\n 125\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Speed\",\n \"properties\":
{\n \"dtype\": \"number\",\n \"std\": 12,\n
\"min\": 85,\n \"max\": 115,\n \"num_unique_values\":
5,\n \"samples\": [\n 100\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Generation\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
0,\n \"min\": 1,\n \"max\": 2,\n
\"num_unique_values\": 2,\n \"samples\": [\n 2\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Legendary\",\n
\"properties\": {\n \"dtype\": \"boolean\",\n
\"num_unique_values\": 1,\n \"samples\": [\n true\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n }\n ]\n}","type":"dataframe","variable_name":"n"}

# lets calculate entropy for root node p =4, n=6

# Entropy for root node

-(P/total)*np.log2(p/total)-(n/total)*np.log2(n/total)

enR = -(4/10)*np.log2(4/10)-(6/10)*np.log2(6/10)
print(enR)

0.9709505944546686

#total values in root node are Posotive = 4 , Negative = 6

# Now lets build the left node and right node based on 'Generation'
column

LN = n[n['Generation']==1] ,p[p['Generation']==1]
print(LN)
#total 6 rows , 3 rows of negative and 3 rows of positive

( # Name Type 1 Type 2 HP Attack Defense Sp. Atk

Sp. Def Speed Generation Legendary
156 144 Articuno Ice Flying 90 85 100 95
125 85 1 True
157 145 Zapdos Electric Flying 90 90 85 125
90 100 1 True
158 146 Moltres Fire Flying 90 100 90 125
85 90 1 True, # Name Type
1 Type 2 HP Attack Defense Sp. Atk Sp. Def Speed Generation
Legendary
162 150 Mewtwo Psychic NaN 106 110 90
154 90 130 1 True
163 150 MewtwoMega Mewtwo X Psychic Fighting 106 190 100
154 100 130 1 True
164 150 MewtwoMega Mewtwo Y Psychic NaN 106 150 70
194 120 140 1 True)

RN = n[n['Generation']==2],p[p['Generation']==2]
print(RN)
# total rows 4 , 3 negative and 1 positive

( # Name Type 1 Type 2 HP Attack Defense Sp. Atk

Sp. Def Speed Generation Legendary
262 243 Raikou Electric NaN 90 85 75 115
100 115 2 True
264 245 Suicune Water NaN 100 75 115 90
115 85 2 True
269 249 Lugia Psychic Flying 106 90 130 90
154 110 2 True, # Name Type 1 Type 2 HP
Attack Defense Sp. Atk Sp. Def Speed Generation Legendary
263 244 Entei Fire NaN 115 115 85 90 75
100 2 True)

#calculating the entropy , of LN , we have p = 3 , N= 3 , in which our

total sample was 6 were negative , and 4 were positive

# Entropy for left node

entropyLN = -(3/6)*np.log2(3/6)-(3/6)*np.log2(3/6)
print(entropyLN)

1.0

# we have p = 1 , n = 3 , total = 4

# Entropy for right node

entropyRN = -(1/4)*np.log2(1/4)-(3/4)*np.log2(3/4)
print(entropyRN)

0.8112781244591328
# Now let us calculate Information Gain which is IG = E(root)-E(Root|
part{left or right node})
# Since left node entropy is 1.0 , right node entropy is ~= 0.8113 and
root node entropy is ~= 0.9710 based on this we calculate IG

print('for left node')

IG_left = enR - entropyLN
print(IG_left)

print('\n\t')

print('for right node')

IG_right = enR - entropyRN
print(IG_right)

for left node

-0.02904940554533142

for right node

0.15967246999553575

K = pd.DataFrame(sample)
print(K)
CorrelationSample = K['Attack'].corr(K['HP'])
print('\n\t')
print('Here is the correlation b/w Attack & HP\n')
print(CorrelationSample)

# Name Type 1 Type 2 HP Attack

Here is the correlation b/w Attack & HP

0.4980818208834152

LB# 820-1372 - 051-6278 - A000.schematic
No ratings yet
LB# 820-1372 - 051-6278 - A000.schematic
44 pages
Operators Telco Cloud - White Paper: 1. Executive Summary
No ratings yet
Operators Telco Cloud - White Paper: 1. Executive Summary
9 pages
Pokemon HP Predictions
No ratings yet
Pokemon HP Predictions
24 pages
BD WPS2
No ratings yet
BD WPS2
23 pages
Vertopal.com Untitled
No ratings yet
Vertopal.com Untitled
9 pages
VoThaiThaoNhi ECON209 F2024 Lab 2
No ratings yet
VoThaiThaoNhi ECON209 F2024 Lab 2
10 pages
# Importing Necessary Libraries: Import As Import As Import As Import As
No ratings yet
# Importing Necessary Libraries: Import As Import As Import As Import As
21 pages
vertopal.com_Week_4
No ratings yet
vertopal.com_Week_4
13 pages
ML Lab-1
No ratings yet
ML Lab-1
5 pages
DACLUSTER
No ratings yet
DACLUSTER
9 pages
Covid_19_Analysis_and_Visualization_using_Plotly_Express
No ratings yet
Covid_19_Analysis_and_Visualization_using_Plotly_Express
11 pages
vertopal.com_Mlt_ann_lab_2_
No ratings yet
vertopal.com_Mlt_ann_lab_2_
7 pages
vertopal.com_Heart_Disease_Classification_Full-1
No ratings yet
vertopal.com_Heart_Disease_Classification_Full-1
3 pages
task1
No ratings yet
task1
5 pages
Bose A S
No ratings yet
Bose A S
37 pages
B58_ Handling Missing Values,Feature_Selection (1)
No ratings yet
B58_ Handling Missing Values,Feature_Selection (1)
4 pages
Eidd S8 TD1
No ratings yet
Eidd S8 TD1
3 pages
B58 Random Forest
No ratings yet
B58 Random Forest
4 pages
Another Copy of Ensemble Models Original Paid
No ratings yet
Another Copy of Ensemble Models Original Paid
51 pages
keeratsi_HW8
No ratings yet
keeratsi_HW8
17 pages
vertopal.com_lec 16 pandas_continue
No ratings yet
vertopal.com_lec 16 pandas_continue
17 pages
KNN For Classification
No ratings yet
KNN For Classification
5 pages
lec 16 pandas_continue
No ratings yet
lec 16 pandas_continue
16 pages
Pub G Analysis
No ratings yet
Pub G Analysis
14 pages
21MIC0107-1
No ratings yet
21MIC0107-1
7 pages
Model
No ratings yet
Model
5 pages
Play Tennis Tree
No ratings yet
Play Tennis Tree
1 page
da-lab3-221it084-final (1)
No ratings yet
da-lab3-221it084-final (1)
6 pages
Kidney Disease Prediction.ipynb (1)
No ratings yet
Kidney Disease Prediction.ipynb (1)
148 pages
Aggregate function in Pandas.
No ratings yet
Aggregate function in Pandas.
1 page
Math Summative Test
No ratings yet
Math Summative Test
14 pages
Kidney Ipynb
No ratings yet
Kidney Ipynb
253 pages
DA_LAB3_221IT064
No ratings yet
DA_LAB3_221IT064
6 pages
AMCCATALAN DS Python Summative
No ratings yet
AMCCATALAN DS Python Summative
10 pages
QUIZ Week 2 CART Practice PDF
No ratings yet
QUIZ Week 2 CART Practice PDF
10 pages
prgm 4
No ratings yet
prgm 4
3 pages
Decision Tree on Classification Lab ML - Jupyter Notebook
No ratings yet
Decision Tree on Classification Lab ML - Jupyter Notebook
13 pages
Mayank Chaudhary DEV Practicals
No ratings yet
Mayank Chaudhary DEV Practicals
14 pages
Lab 3 ml
No ratings yet
Lab 3 ml
3 pages
1 4-EDA Ipynb
No ratings yet
1 4-EDA Ipynb
12 pages
609008987-EDA-Lab-Manual
No ratings yet
609008987-EDA-Lab-Manual
93 pages
EDA Lab Manual
100% (2)
EDA Lab Manual
93 pages
Lab Program 3
No ratings yet
Lab Program 3
6 pages
RegresiÃ N Lineal Con Python - Ipynb
No ratings yet
RegresiÃ N Lineal Con Python - Ipynb
83 pages
Lab
No ratings yet
Lab
13 pages
explicación
No ratings yet
explicación
5 pages
1 Abril PDF
No ratings yet
1 Abril PDF
10 pages
Name: Suprit Darshan Shrestha Reg - no:19BCE2584: Lab DA1 Machine Learning Lab
No ratings yet
Name: Suprit Darshan Shrestha Reg - no:19BCE2584: Lab DA1 Machine Learning Lab
9 pages
AI Final PDF
No ratings yet
AI Final PDF
38 pages
PRG 4
No ratings yet
PRG 4
2 pages
Aggregating Pokémon Data With Python and Pandas
No ratings yet
Aggregating Pokémon Data With Python and Pandas
13 pages
Heart Disease Prediction.ipynb (1)
No ratings yet
Heart Disease Prediction.ipynb (1)
207 pages
Cleaning_data - Copy
No ratings yet
Cleaning_data - Copy
6 pages
House Prices.ipynb
No ratings yet
House Prices.ipynb
23 pages
Experiment No. 1
No ratings yet
Experiment No. 1
7 pages
Untitled 0
No ratings yet
Untitled 0
537 pages
Class Notes Exploratory Data Analysis-7-12
No ratings yet
Class Notes Exploratory Data Analysis-7-12
6 pages
MLT - Lab - Manual FINAL
No ratings yet
MLT - Lab - Manual FINAL
38 pages
SQL Mastery: A Step-by-Step Guide to Learn SQL and Manage Data Effectively
From Everand
SQL Mastery: A Step-by-Step Guide to Learn SQL and Manage Data Effectively
Lena Neill
No ratings yet
Os-Unit 3 (Mid1)
No ratings yet
Os-Unit 3 (Mid1)
15 pages
CH 146 @MangaManwha
No ratings yet
CH 146 @MangaManwha
19 pages
Jujutsu Kaisen Chapter 244 Un
No ratings yet
Jujutsu Kaisen Chapter 244 Un
20 pages
CH 145 @MangaManwha
No ratings yet
CH 145 @MangaManwha
17 pages
Os 003
No ratings yet
Os 003
21 pages
CH 143 @MangaManwha
No ratings yet
CH 143 @MangaManwha
19 pages
Thorne 1987
No ratings yet
Thorne 1987
5 pages
Operating System Assignment01vinay
No ratings yet
Operating System Assignment01vinay
8 pages
Lehner 0
No ratings yet
Lehner 0
35 pages
2bhk-Images 2 4 90
No ratings yet
2bhk-Images 2 4 90
1 page
Line Current Differential System: Grid Solutions
No ratings yet
Line Current Differential System: Grid Solutions
970 pages
BC-3000plus DMS Communication Guide
No ratings yet
BC-3000plus DMS Communication Guide
14 pages
License Manager User Guide V1-3 EN
No ratings yet
License Manager User Guide V1-3 EN
21 pages
Power BI - Batch15
No ratings yet
Power BI - Batch15
19 pages
Chapter 1 The Nature of Technology - Chapter 1.1,1.2
No ratings yet
Chapter 1 The Nature of Technology - Chapter 1.1,1.2
25 pages
E-Tech Q3 Module2
50% (2)
E-Tech Q3 Module2
16 pages
Poster 3891
No ratings yet
Poster 3891
1 page
Design Database Main Module Simplified Tvet
No ratings yet
Design Database Main Module Simplified Tvet
238 pages
MDM Guide
No ratings yet
MDM Guide
13 pages
Kawasaki Zeroing en
No ratings yet
Kawasaki Zeroing en
15 pages
BT0062
No ratings yet
BT0062
1 page
System Programming Notes
No ratings yet
System Programming Notes
63 pages
ACIT4030 Machine Learning For Images and 3D Data
No ratings yet
ACIT4030 Machine Learning For Images and 3D Data
4 pages
Undergraduate Project Topics and Materials Sorted by Field of Study
No ratings yet
Undergraduate Project Topics and Materials Sorted by Field of Study
5 pages
GNN Python Code in Keras and Pytorch - by YashwanthReddyGoduguchintha - Medium
No ratings yet
GNN Python Code in Keras and Pytorch - by YashwanthReddyGoduguchintha - Medium
10 pages
Fitness Centre Management System Project
No ratings yet
Fitness Centre Management System Project
24 pages
DC PWM Motor Speed Controller: Circuit Diagram
No ratings yet
DC PWM Motor Speed Controller: Circuit Diagram
2 pages
COM 214 File Ogarnization and Management Lecture Note 4
No ratings yet
COM 214 File Ogarnization and Management Lecture Note 4
14 pages
Basics of Computers - Office Tools
No ratings yet
Basics of Computers - Office Tools
3 pages
Dinidu Seneviratne - CV - Compressed
No ratings yet
Dinidu Seneviratne - CV - Compressed
4 pages
ECE Major Curriculum
No ratings yet
ECE Major Curriculum
43 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
1 page
Security Onion Cheat Sheet
No ratings yet
Security Onion Cheat Sheet
2 pages
Unit-Ii: Socket Address Structures
No ratings yet
Unit-Ii: Socket Address Structures
17 pages
IIoT Library 4-20221027065655
No ratings yet
IIoT Library 4-20221027065655
67 pages
SOC_Essential_Terms_Cheat_Sheet
No ratings yet
SOC_Essential_Terms_Cheat_Sheet
3 pages
Digital Techniques For Wideband Receivers
No ratings yet
Digital Techniques For Wideband Receivers
4 pages
Packet Tracer 5.3 - IP Telephony Basic Configuration Tutorial Description
No ratings yet
Packet Tracer 5.3 - IP Telephony Basic Configuration Tutorial Description
4 pages

Copy of ML - Assignment

Uploaded by

Copy of ML - Assignment

Uploaded by

import numpy as np

from google.colab import drive

Drive already mounted at /content/drive; to attempt to forcibly

# Name Type 1 Type 2 HP Attack

{"summary":"{\n \"name\": \"p\",\n \"rows\": 4,\n \"fields\": [\n

{"summary":"{\n \"name\": \"n\",\n \"rows\": 6,\n \"fields\": [\n

# lets calculate entropy for root node p =4, n=6

# Entropy for root node

#total values in root node are Posotive = 4 , Negative = 6

( # Name Type 1 Type 2 HP Attack Defense Sp. Atk

( # Name Type 1 Type 2 HP Attack Defense Sp. Atk

#calculating the entropy , of LN , we have p = 3 , N= 3 , in which our

# Entropy for left node

# Entropy for right node

print('for left node')

print('for right node')

for left node

for right node

# Name Type 1 Type 2 HP Attack

Here is the correlation b/w Attack & HP

You might also like