0% found this document useful (0 votes)

45 views16 pages

PAI Practicle

The document contains details of Shivam (roll no. 23242) who is pursuing a Bachelor of Technology degree in Computer Science & Engineering from Dronacharya College of Engineering, Gurgaon. It includes a certificate stating that Shivam has completed the practical requirement for the degree by submitting a practical on "Big Data Lab" under supervision. It also contains a list of 10 practicals completed by Shivam along with signatures against each.

Uploaded by

Rohan 7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views16 pages

PAI Practicle

Uploaded by

Rohan 7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Department of AIML

PAI Practile file

NAME: Shivam
BRANCH: CSE(AI-ML)
SEM: 6TH
ROLL NO: 23242

Shivam (23242)
Department of CSE AIML
Certificate
Certified that this Practical entitled “Big Data Lab” submitted by Shivam (23242), student
of Computer Science & Engineering Department, Dronacharya College of
Engineering, Gurgaon in the partial fulfillment of the requirement for the award
Bachelor’s of Technology (Branch) Degree of MDU, Rohtak, is a record of student own
study carried under my supervision & guidance.

Shivam (23242)
Sr. Practical Name Signature
No.
1. Introduction of various python libraries used for
machine
learning.
2. Write a program to perform data pre-processing
techniques for effective machine learning.
3. Write a program to apply different feature encoding
schemes on the given dataset.

4. Write a program to apply filter feature selection

techniques

10.

Shivam (23242)
PROGRAM 1: Introduction of various python libraries used for machine learning.

Code:

[1]: pandas as pd import numpy as np

import

[2]: # reading data

data=pd.read_csv("data.csv")

[3]: data

[3]: Country Age Salary Purchased

0 France 44.0 72000.0 No
1 Spain 27.0 48000.0 Yes
2 Germany 30.0 54000.0 No
3 Spain 38.0 61000.0 No
4 Germany 40.0 NaN Yes
5 France 35.0 58000.0 Yes
6 Spain NaN 52000.0 No
7 France 48.0 79000.0 Yes
8 Germany 50.0 83000.0 No
9 France 37.0 67000.0 Yes

[4]: student_data = {"Name":['Prateek','Ronak','Geetanshu','Naman','Ankit'], "exam_no":[18,25,45,34,36],

"Result":['pass','fail','pass','pass','fail']}

df = pd.DataFrame(student_data) df

[4] : Name exam_no Result

0 Prateek 18 pass
1 Ronak 25 fail
2 Geetanshu 45 pass
3 Naman 34 pass
4 Ankit 36 fail

[6]: # access data with the help of label

[6] : df.loc[2,['Name']]
Name Geetanshu
Name: 2, dtype:
object

Shivam (23242)
[7]: df.iloc[2,0]

[7] : 'Geetanshu'

[]:

PROGRAM 2: Write a program to perform data pre-processing techniques for effective

machine learning

Shivam (23242)
[1]:# import pandas
import pandas as pd

[47]:#read csv file

df=pd.read_csv('data.csv')

[30]:# print first 5 elements

df.head()

[30]: Country Age Salary Purchased

0 France 44.0 72000.0 No
1 Spain 27.0 48000.0 Yes
2 Germany 30.0 54000.0 No
3 Spain 38.0 61000.0 No
4 Germany 40.0 NaN Yes

[6]:# import numpy

import numpy as np

[7]:# import StringIO

from io import StringIO

[31]:# check for the null value

df.isnull()

[31]: Country Age Salary Purchased

0 False False False False
1 False False False False
2 False False False False
3 False False False False
4 False False True False
5 False False False False
6 False True False False
7 False False False False
8 False False False False
9 False False False False

Shivam (23242)
[59]: # assign 10 in place of null value df["Age"].fillna(10, inplace = True) df["Salary"].fillna(10, inplace =
True)

[60]: # print updates dataset

[60]: Country Age Salary Purchased

0 France 44.0 72000.0 No
1 Spain 27.0 48000.0 Yes
2 Germany 30.0 54000.0 No
3 Spain 38.0 61000.0 No
4 Germany 40.0 10.0 Yes
5 France 35.0 58000.0 Yes
6 Spain 10.0 52000.0 No
7 France 48.0 79000.0 Yes
8 Germany 50.0 83000.0 No
9 France 37.0 67000.0 Yes

[34]: # check for null value after updation

df.isnull().sum()

[34]: Country 0
Age 0
Salary 0
Purchased 0
dtype: int64

[35]: # import SimpleImputer from sklearn

from sklearn.impute import SimpleImputer

[36]: # set model attributes

imr = SimpleImputer(strategy="constant",fill_value= 10 )

[37]: # Fit the data into the model

imr = imr.fit(df.values)

[54]: imputed_data = imr.transform(df.values)

[55]: # print data after transormed

imputed_data

[55]: array([['France', 44.0, 72000.0, 'No'],

['Spain', 27.0, 48000.0, 'Yes'],
['Germany', 30.0, 54000.0, 'No'],
['Spain', 38.0, 61000.0, 'No'],

['Germany', 40.0, 10, 'Yes'],

Shivam(23242)
['France', 35.0, 58000.0, 'Yes'],
['Spain', 10, 52000.0, 'No'],
['France', 48.0, 79000.0, 'Yes'],
['Germany', 50.0, 83000.0, 'No'],
['France', 37.0, 67000.0, 'Yes']], dtype=object)

Shivam(23242)
PROGRAM 3: Write a program to apply different feature encoding schemes on the given dataset.

[57]: #df.describe()

[57]: Age Salary

count 9.000000 9.000000
mean 38.777778 63777.777778
std 7.693793 12265.579662
min 27.000000 48000.000000
25% 35.000000 54000.000000
50% 38.000000 61000.000000
75% 44.000000 72000.000000
max 50.000000 83000.000000

[42]: # import and apply LabelEncoder to the data from sklearn.preprocessing import
LabelEncoder df_le= df
class_le = LabelEncoder()
df_le['Country'] = class_le.fit_transform(df_le['Country'].values) df_le

[42]: Country Age Salary Purchased

0 0 44.0 72000.0 No
1 2 27.0 48000.0 Yes
2 1 30.0 54000.0 No
3 2 38.0 61000.0 No
4 1 40.0 10.0 Yes
5 0 35.0 58000.0 Yes
6 2 10.0 52000.0 No
7 0 48.0 79000.0 Yes
8 1 50.0 83000.0 No
9 0 37.0 67000.0 Yes

[48]: df

[48]: Country Age Salary Purchased

0 France 44.0 72000.0 No
1 Spain 27.0 48000.0 Yes

Shivam(23242)
2 Germany 30.0 54000.0 No
3 Spain 38.0 61000.0 No
4 Germany 40.0 NaN Yes
5 France 35.0 58000.0 Yes
6 Spain NaN 52000.0 No
7 France 48.0 79000.0 Yes
8 Germany 50.0 83000.0 No
9 France 37.0 67000.0 Yes

[61]: df_new=pd.get_dummies(df)

[62]: df_new

[62]: Age Salary Country_France Country_Germany Country_Spain \

0 44.0 72000.0 1 0 0
1 27.0 48000.0 0 0 1
2 30.0 54000.0 0 1 0
3 38.0 61000.0 0 0 1
4 40.0 10.0 0 1 0
5 35.0 58000.0 1 0 0
6 10.0 52000.0 0 0 1
7 48.0 79000.0 1 0 0
8 50.0 83000.0 0 1 0
9 37.0 67000.0 1 0 0

Purchased_No Purchased_Yes
0 1 0
1 0 1
2 1 0
3 1 0
4 0 1
5 0 1
6 1 0
7 0 1
8 1 0
9 0 1

[63]: df_le['Country']

[63]: 0 0
1 2
2 1
3 2
4 1
5 0

Shivam(23242)
6 2

Shivam(23242)
7 0
8 1
9 0

Shivam(23242)
PROGRAM 4: Write a program to apply filter feature selection techniques.

Shivam(23242)
Shivam(23242)
Shivam(23242)
Shivam(23242)

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6471)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (650)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
4/5 (1176)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (651)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1860)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4104)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1278)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1025)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (583)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (945)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (466)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
3.5/5 (2141)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2024)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1093)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4377)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2886)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2815)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (929)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (841)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2547)
Job Descriptimplementation Engineer
No ratings yet
Job Descriptimplementation Engineer
2 pages
Experiment 8
No ratings yet
Experiment 8
5 pages
Experiment 7
No ratings yet
Experiment 7
3 pages
Experiment 6
No ratings yet
Experiment 6
2 pages
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Lecture Slides-01 Computer Science - Introduction
No ratings yet
Lecture Slides-01 Computer Science - Introduction
73 pages
2023 Digital System Chapter 1 Part3
No ratings yet
2023 Digital System Chapter 1 Part3
6 pages
DSD Presentation
No ratings yet
DSD Presentation
14 pages
C NOTES FULL - Final
No ratings yet
C NOTES FULL - Final
124 pages
Digispark Keyboard Apple
No ratings yet
Digispark Keyboard Apple
15 pages
MCEN 4115-5115: Mechatronics and Robotics I: Team Hodor
No ratings yet
MCEN 4115-5115: Mechatronics and Robotics I: Team Hodor
22 pages
CPE261 (OOP1) Week1Lecture1
No ratings yet
CPE261 (OOP1) Week1Lecture1
4 pages
College Management System
No ratings yet
College Management System
23 pages
A40000 English
No ratings yet
A40000 English
329 pages
Itanium2 Software Developer's Manual - V3
No ratings yet
Itanium2 Software Developer's Manual - V3
986 pages
Internship Training
No ratings yet
Internship Training
23 pages
ICT
No ratings yet
ICT
18 pages
MATHESH Matlab Final Output
No ratings yet
MATHESH Matlab Final Output
19 pages
IFN 554 Week 3 Tutorial v.1
No ratings yet
IFN 554 Week 3 Tutorial v.1
19 pages
Python 21to30
No ratings yet
Python 21to30
9 pages
TNM 5000 User Manual For Car Repairs. (Version1.03) International Revision. (August) Aplicable To TNM5000 Software Version Above 10.
No ratings yet
TNM 5000 User Manual For Car Repairs. (Version1.03) International Revision. (August) Aplicable To TNM5000 Software Version Above 10.
19 pages
Memory Controller For A 6502 CPU in VHDL: Michel Wilson, 1047981
No ratings yet
Memory Controller For A 6502 CPU in VHDL: Michel Wilson, 1047981
28 pages
Requirements: H2 Database Basics
No ratings yet
Requirements: H2 Database Basics
8 pages
DWC Ordering Information
No ratings yet
DWC Ordering Information
15 pages
Midterm Exam Feedback Control
No ratings yet
Midterm Exam Feedback Control
3 pages
Unit 4
No ratings yet
Unit 4
25 pages
Universe Cicp2100
No ratings yet
Universe Cicp2100
2 pages
Uk Interview Questions and Answers PDF
No ratings yet
Uk Interview Questions and Answers PDF
15 pages
JCOP41
No ratings yet
JCOP41
19 pages
TPL2
No ratings yet
TPL2
2 pages
Picus Manual 4189341362 Uk
No ratings yet
Picus Manual 4189341362 Uk
130 pages
Interactive System
No ratings yet
Interactive System
3 pages
Visual Basic Tutorial
100% (1)
Visual Basic Tutorial
37 pages
Cisco Application Centric Infrastructure
No ratings yet
Cisco Application Centric Infrastructure
13 pages
Interactive PDF 4CP0 - 01 QP & RB 2022
No ratings yet
Interactive PDF 4CP0 - 01 QP & RB 2022
26 pages

PAI Practicle

Uploaded by

PAI Practicle

Uploaded by

Department of AIML

PAI Practile file

4. Write a program to apply filter feature selection

[1]: pandas as pd import numpy as np

[2]: # reading data

[3]: Country Age Salary Purchased

[4]: student_data = {"Name":['Prateek','Ronak','Geetanshu','Naman','Ankit'], "exam_no":[18,25,45,34,36],

[4] : Name exam_no Result

[6]: # access data with the help of label

PROGRAM 2: Write a program to perform data pre-processing techniques for effective

[47]:#read csv file

[30]:# print first 5 elements

[30]: Country Age Salary Purchased

[6]:# import numpy

[7]:# import StringIO

[31]:# check for the null value

[31]: Country Age Salary Purchased

[60]: # print updates dataset

[60]: Country Age Salary Purchased

[34]: # check for null value after updation

[35]: # import SimpleImputer from sklearn

[36]: # set model attributes

[37]: # Fit the data into the model

[54]: imputed_data = imr.transform(df.values)

[55]: # print data after transormed

[55]: array([['France', 44.0, 72000.0, 'No'],

['Germany', 40.0, 10, 'Yes'],

[57]: Age Salary

[42]: Country Age Salary Purchased

[48]: Country Age Salary Purchased

[62]: Age Salary Country_France Country_Germany Country_Spain \

You might also like