0% found this document useful (0 votes)

20 views13 pages

Data Mining Journal 1 Kashan

Uploaded by

Kashan Riaz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views13 pages

Data Mining Journal 1 Kashan

Uploaded by

Kashan Riaz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Bahria University,

Karachi Campus

COURSE:
Data Mining
TERM: SPRING 2024, CLASS: BSE- 6(A)

Submitted By:
KASHAN RIAZ 02-131212-075
_______________________________________________
(Name) (Enroll. No.)

Submitted To:

Engr. Hamza/Engr. Misbah

Signed Remarks: Score:_____

INDEX
SNO DATE LAB NO LAB OBJECTIVE SIGN

1 17-2-24 1 GUI in Python and data mining

libraries
SNO DATE LAB NO LAB OBJECTIVE SIGN
Bahria University,
Karachi Campus

LAB EXPERIMENT NO.

_1_
LIST OF TASKS
TASK NO OBJECTIVE
1 Library Management System
2 You work for an e-commerce company and have been given a dataset with
information on customer orders over the past year. Load the data into
Pandas, and analyze it using methods like .info(), and .describe(), Which
products have the highest/lowest sales? Which customer segments spend
the most?
3 You are a data analyst at a real estate company. You have been given a dataset of housing
sale prices in different regions over the past 5 years. Load the data into Pandas and
preprocess it by handling missing values and formatting columns.
4 ▪ You are a data analyst working for an automobile company. You have been provided with
the Vega dataset which contains details on different vehicle models like price, engine
size, horsepower, dimensions etc.

▪ Requirements:
• Load the Vega dataset into a Pandas data frame.
• Using plotting libraries like Matplotlib and Seaborn, Altair create visualizations to
understand relationships between different vehicle features. Some examples:
• Scatterplot of engine size vs. horsepower
• Histogram of price distribution
• Grouping by body style and analyzing statistics

Submitted On:
Date: _17-2-24___
Task No. 01: Library Management System
Solution and output:
import pandas as pd
books_df = pd.DataFrame(columns=['Title', 'Author', 'Genre', 'Publishing Year'])
members_df = pd.DataFrame(columns=['Name', 'Email', 'Contact Number', 'Membership Status'])

def add_book(title, author, genre, year):

global books_df
books_df = books_df.append({'Title': title, 'Author': author, 'Genre': genre, 'Publishing Year': year},
ignore_index=True)

def edit_book(title, new_data):

global books_df
book_index = books_df[books_df['Title'] == title].index[0]
books_df.loc[book_index] = new_data

def delete_book(title):
global books_df
books_df = books_df.drop(books_df[books_df['Title'] == title].index)

def add_member(name, email, contact_number, membership_status):

global members_df
members_df = members_df.append({'Name': name, 'Email': email, 'Contact Number': contact_number,
'Membership Status': membership_status}, ignore_index=True)

def edit_member(name, new_data):

global members_df
member_index = members_df[members_df['Name'] == name].index[0]
members_df.loc[member_index] = new_data

def search_book(title):
global books_df
return books_df[books_df['Title'] == title]

def search_member(name):
global members_df
return members_df[members_df['Name'] == name]

print("Initial Data:")
print("Books Data Frame:")
print(books_df)
print("\nMembers Data Frame:")
print(members_df)

add_book("1984", "George Orwell", "Dystopian Fiction", 1949)

add_book("To Kill a Mockingbird", "Harper Lee", "Fiction", 1960)
print("\nAfter Adding Books:")
print(books_df)

Kashan Riaz 02-131212-075

edit_book("1984", {'Title': 'Nineteen Eighty-Four', 'Author': 'George Orwell', 'Genre': 'Dystopian Fiction',
'Publishing Year': 1949})
print("\nAfter Editing '1984' Book:")
print(books_df)

delete_book("To Kill a Mockingbird")

print("\nAfter Deleting 'To Kill a Mockingbird' Book:")
print(books_df)

add_member("John Doe", "[email protected]", "1234567890", "Active")

add_member("Jane Smith", "[email protected]", "0987654321", "Active")
print("\nAfter Adding Members:")
print(members_df)

edit_member("John Doe", {'Name': 'John Smith', 'Email': '[email protected]', 'Contact Number': '1112223333',
'Membership Status': 'Active'})
print("\nAfter Editing 'John Doe' Member:")
print(members_df)

searched_book = search_book("Nineteen Eighty-Four")

print("\nSearched Book:")
print(searched_book)

searched_member = search_member("John Smith")

print("\nSearched Member:")
print(searched_member)

Kashan Riaz 02-131212-075

Task No. 02: Customer Database For e-commerce company
Solution and output:
import pandas as pd

df = pd.read_csv("Train.csv")

print("Dataset Information:")

print(df.info())

print("\nSummary Statistics:")

print(df.describe())

product_sales = df.groupby('ID')['Cost_of_the_Product'].sum().sort_values(ascending=False)

print("\nProducts with Highest Sales:")

print(product_sales.head(5))

print("\nProducts with Lowest Sales:")

print(product_sales.tail(5))

customer_segments = df.groupby('Customer_rating')['Cost_of_the_Product'].sum().sort_values(ascending=False)

print("\nCustomer Segments with Highest Spending:")

print(customer_segments.head(5))

Kashan Riaz 02-131212-075

Kashan Riaz 02-131212-075
Task No. 03: Housing Database
Solution and output:
import pandas as pd

df = pd.read_csv("Housing.csv")

print("Dataset Information:")

print(df.info())

print("\nMissing Values:")

print(df.isnull().sum())

df['price'] = df['price'].astype(float)

print("\nPreprocessed Dataset:")

print(df.head())

Kashan Riaz 02-131212-075

Task No. 04: Automobile Database
Solution and output:
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

df = pd.read_csv("Vega.csv")

print("Dataset Information:")
print(df.info())

plt.figure(figsize=(10, 6))
sns.scatterplot(data=df, x='displacement', y='horsepower')
plt.title('Engine Displacement vs. Horsepower')
plt.xlabel('Engine Displacement')
plt.ylabel('Horsepower')
plt.grid(True)
plt.show()

plt.figure(figsize=(10, 6))
sns.histplot(data=df, x='mpg', bins=20, kde=True)
plt.title('Fuel Efficiency Distribution')
plt.xlabel('Miles per Gallon (MPG)')
plt.ylabel('Frequency')
plt.grid(True)
plt.show()

grouped_origin = df.groupby('origin').agg({'mpg': 'mean', 'weight': 'mean',

'acceleration': 'mean'})
print("\nGrouped by Origin Statistics:")
print(grouped_origin)

plt.figure(figsize=(10, 6))
sns.boxplot(data=df, x='origin', y='mpg')
plt.title('Fuel Efficiency Distribution by Origin')
plt.xlabel('Origin')
plt.ylabel('Miles per Gallon (MPG)')
plt.grid(True)
plt.show()

Kashan Riaz 02-131212-075

Kashan Riaz 02-131212-075
Kashan Riaz 02-131212-075

Shandon Cytospin 3 Operator Guide
No ratings yet
Shandon Cytospin 3 Operator Guide
68 pages
Lead Small Teams
No ratings yet
Lead Small Teams
92 pages
Portfolio Management in Kotak Securites
0% (1)
Portfolio Management in Kotak Securites
92 pages
Planning and Design of Radiology & Imaging Sciences
100% (1)
Planning and Design of Radiology & Imaging Sciences
39 pages
Brighton Spec ASME 80-10 2017 PDF
No ratings yet
Brighton Spec ASME 80-10 2017 PDF
1 page
Tle 75602
No ratings yet
Tle 75602
70 pages
Class 12 Practical File Informatics Practices
No ratings yet
Class 12 Practical File Informatics Practices
22 pages
Repetitve Nerve Stimulation (RNS) : By: Syed Irshad Murtaza Neurophysiology Dept AKUH Karachi Date:12-06-2013
No ratings yet
Repetitve Nerve Stimulation (RNS) : By: Syed Irshad Murtaza Neurophysiology Dept AKUH Karachi Date:12-06-2013
33 pages
Manual Alesis Qx25 Quickstart Guide Revb
No ratings yet
Manual Alesis Qx25 Quickstart Guide Revb
40 pages
IP Practical 2023-24 (1 To 34)
100% (1)
IP Practical 2023-24 (1 To 34)
32 pages
Kendriya Vidyalaya Sangathan, Mumbai Region 1 Pre-Board Examination 2019-20
No ratings yet
Kendriya Vidyalaya Sangathan, Mumbai Region 1 Pre-Board Examination 2019-20
11 pages
RMK Engineering College Digital India Activities
No ratings yet
RMK Engineering College Digital India Activities
2 pages
DHP Journal
No ratings yet
DHP Journal
29 pages
La Liberación Del Libro. Una Crítica Del Sistema de Precio Fijo. Pedro Schwartz.
No ratings yet
La Liberación Del Libro. Una Crítica Del Sistema de Precio Fijo. Pedro Schwartz.
79 pages
FIT ZONE Nutrition Plan For MEN by Guru Mann
100% (1)
FIT ZONE Nutrition Plan For MEN by Guru Mann
8 pages
Practical File IP
No ratings yet
Practical File IP
27 pages
Dejene Chala Stat606 Screening Quiz Programming Part
No ratings yet
Dejene Chala Stat606 Screening Quiz Programming Part
12 pages
Ip Practical File
No ratings yet
Ip Practical File
20 pages
1 s2.0 S0263224113006519 Main
No ratings yet
1 s2.0 S0263224113006519 Main
11 pages
Vaginal Exam Learning Guide
No ratings yet
Vaginal Exam Learning Guide
2 pages
What Is Behavioral Finance
No ratings yet
What Is Behavioral Finance
10 pages
HHHH
No ratings yet
HHHH
22 pages
Dsbda Lab Manual Merged
No ratings yet
Dsbda Lab Manual Merged
117 pages
Class 12 Practical File Informatics Practices
No ratings yet
Class 12 Practical File Informatics Practices
28 pages
Grade 06 History 1st Term Test Paper With Answers 2019 Sinhala Medium North Western Province
83% (6)
Grade 06 History 1st Term Test Paper With Answers 2019 Sinhala Medium North Western Province
7 pages
PT - English 1 - Q3
No ratings yet
PT - English 1 - Q3
4 pages
Indian Railway
No ratings yet
Indian Railway
29 pages
DBDAL LAB - MANUAL - Final
No ratings yet
DBDAL LAB - MANUAL - Final
93 pages
Action Plan in English
No ratings yet
Action Plan in English
4 pages
IP Project On Car Rental System in India
100% (4)
IP Project On Car Rental System in India
33 pages
Pragya File
No ratings yet
Pragya File
31 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
Belt Conveyor (V1)
No ratings yet
Belt Conveyor (V1)
45 pages
Employment Application Form..
No ratings yet
Employment Application Form..
3 pages
Chapter 7
No ratings yet
Chapter 7
19 pages
Week 1 To Week 9
No ratings yet
Week 1 To Week 9
30 pages
Cue Words Relaxation
No ratings yet
Cue Words Relaxation
4 pages
CSV and Coding PDF
No ratings yet
CSV and Coding PDF
37 pages
Job Network Transfer
No ratings yet
Job Network Transfer
4 pages
Amity International School SESSION: 2024-25 Informatics Practices (065) Class Xii Practical List
No ratings yet
Amity International School SESSION: 2024-25 Informatics Practices (065) Class Xii Practical List
5 pages
Practice Exam For Final Exam Acct301 With Answers
No ratings yet
Practice Exam For Final Exam Acct301 With Answers
9 pages
Reserch Proposal Raneesha
No ratings yet
Reserch Proposal Raneesha
22 pages
Practical File 12th
No ratings yet
Practical File 12th
19 pages
Alienation From David-McClellan-The-Thought-of-Karl-Marx
No ratings yet
Alienation From David-McClellan-The-Thought-of-Karl-Marx
17 pages
Pandas NumPy Practice Questions
No ratings yet
Pandas NumPy Practice Questions
2 pages
Bizhub C25 Spec
No ratings yet
Bizhub C25 Spec
8 pages
Practical File Infomatics Practices 2024-25
No ratings yet
Practical File Infomatics Practices 2024-25
39 pages
DW Lab File
No ratings yet
DW Lab File
18 pages
Question Bank-BDA (Module 1&2) 2
No ratings yet
Question Bank-BDA (Module 1&2) 2
5 pages
Class 12 Practical File Informatics Practices
No ratings yet
Class 12 Practical File Informatics Practices
22 pages
Create A Pandas Series From A Dictionary of Values and An Ndarray
No ratings yet
Create A Pandas Series From A Dictionary of Values and An Ndarray
15 pages
Lab 0 - (Part 1) Lab Environment Setup
No ratings yet
Lab 0 - (Part 1) Lab Environment Setup
5 pages
NumPy and Pandas Tutorial
No ratings yet
NumPy and Pandas Tutorial
8 pages
4BUIS014W Business Computing-Portfolio
No ratings yet
4BUIS014W Business Computing-Portfolio
7 pages
Vamshi ml-1,2
No ratings yet
Vamshi ml-1,2
25 pages
Final Dev Record
No ratings yet
Final Dev Record
49 pages
Biotechnology and It's Application by Hare Krishna Deepak
No ratings yet
Biotechnology and It's Application by Hare Krishna Deepak
42 pages
12 Ip Practical List With Solution Complete
No ratings yet
12 Ip Practical List With Solution Complete
5 pages
Screenshot 2023-12-27 at 7.05.37 PM
No ratings yet
Screenshot 2023-12-27 at 7.05.37 PM
23 pages
DSBDAL
No ratings yet
DSBDAL
87 pages
ML Lab Manual 1-10
No ratings yet
ML Lab Manual 1-10
58 pages
Python - Pandas - Numpy Interview Q&A
No ratings yet
Python - Pandas - Numpy Interview Q&A
12 pages
L6 and 7-Data Preprocessing-Coding
No ratings yet
L6 and 7-Data Preprocessing-Coding
34 pages
Text 3
No ratings yet
Text 3
3 pages
Index
No ratings yet
Index
4 pages
Beginner's Guide To Kirigami 24 Skill Building Projects For The Absolute Beginner Exclusive Download
100% (12)
Beginner's Guide To Kirigami 24 Skill Building Projects For The Absolute Beginner Exclusive Download
15 pages
Data Science Sample
No ratings yet
Data Science Sample
5 pages
S7 Practice Questions
No ratings yet
S7 Practice Questions
7 pages
FDS Record-1-4
No ratings yet
FDS Record-1-4
18 pages
DS Question Bank Unit-1 Part-2
No ratings yet
DS Question Bank Unit-1 Part-2
3 pages
Experiment 8
No ratings yet
Experiment 8
9 pages
Informatics Practices Record Class 12
No ratings yet
Informatics Practices Record Class 12
60 pages
B Tech-AIML-question Bank-2 Answer Key
No ratings yet
B Tech-AIML-question Bank-2 Answer Key
9 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Assignment 2
No ratings yet
Assignment 2
6 pages
DAVPy 2024GE
No ratings yet
DAVPy 2024GE
12 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
12 pages
HCLTech
No ratings yet
HCLTech
5 pages
Holidays Homework - Ip
No ratings yet
Holidays Homework - Ip
5 pages
Practice Questions2
No ratings yet
Practice Questions2
2 pages
Unit 4 - Working With Graphs - Python
No ratings yet
Unit 4 - Working With Graphs - Python
49 pages
DSBDA Manual
No ratings yet
DSBDA Manual
76 pages
DSBDAlab Manual
No ratings yet
DSBDAlab Manual
116 pages
Oddstudents
No ratings yet
Oddstudents
35 pages
NumPy and Pandas Step
No ratings yet
NumPy and Pandas Step
9 pages
Even Students
No ratings yet
Even Students
36 pages

Data Mining Journal 1 Kashan

Uploaded by

Data Mining Journal 1 Kashan

Uploaded by

Bahria University,

Engr. Hamza/Engr. Misbah

Signed Remarks: Score:_____

1 17-2-24 1 GUI in Python and data mining

LAB EXPERIMENT NO.

def add_book(title, author, genre, year):

def edit_book(title, new_data):

def add_member(name, email, contact_number, membership_status):

def edit_member(name, new_data):

add_book("1984", "George Orwell", "Dystopian Fiction", 1949)

Kashan Riaz 02-131212-075

delete_book("To Kill a Mockingbird")

add_member("John Doe", "[email protected]", "1234567890", "Active")

searched_book = search_book("Nineteen Eighty-Four")

searched_member = search_member("John Smith")

Kashan Riaz 02-131212-075

print("\nProducts with Highest Sales:")

print("\nProducts with Lowest Sales:")

print("\nCustomer Segments with Highest Spending:")

Kashan Riaz 02-131212-075

Kashan Riaz 02-131212-075

grouped_origin = df.groupby('origin').agg({'mpg': 'mean', 'weight': 'mean',

Kashan Riaz 02-131212-075

You might also like