Discretization Problem Statement21

Uploaded by

baneeru11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views2 pages

Discretization Problem Statement21

Uploaded by

baneeru11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

DISCRETIZATION

Instructions:

Please share your answers filled inline in the word document. Submit Python code and R code
files wherever applicable.

Please ensure you update all the details:

Name: hari machavarapu

Batch Id: dswdcmb 150622h

Topic: Data Pre-Processing

Problem Statement:
Everything will revolve around the data in Analytics world. Proper data will help you to
make useful predictions which improve your business. Sometimes the usage of original
data as it is does not help to have accurate solutions. It is needed to convert the data
from one form to another form to have better predictions. Explore on various
techniques to transform the data for better model performance. you can go through
this link:
https://360digitmg.com/mindmap-data-science
1) Convert the continuous data into discrete classes on iris dataset.
Prepare the dataset by performing the preprocessing techniques, to have the
data which improve model performance.

Sepal.Length Sepal.Width Petal.Length Petal.Width Species

5.1 3.5 1.4 0.2 setosa
4.9 3 1.4 0.2 setosa
4.7 3.2 1.3 0.2 setosa
4.6 3.1 1.5 0.2 setosa
5 3.6 1.4 0.2 setosa
5.4 3.9 1.7 0.4 setosa
4.6 3.4 1.4 0.3 setosa
5 3.4 1.5 0.2 setosa
4.4 2.9 1.4 0.2 setosa

4.9 3.1 1.5 0.1 setosa

CODE-
#
import pandas as pd
data = pd.read_csv("C:/Users/hudso/Downloads/DataSets-Data Pre
Processing/DataSets/iris.csv")
data.describe()
data.head()
data['SepalLength_new'] = pd.cut(data['SepalLength'], bins = [min(data.SepalLength),
data.SepalLength.mean(), max(data.SepalLength)], labels=["Low", "High"])
data['SepalWidth_new'] = pd.cut(data['SepalWidth'], bins = [min(data.SepalWidth),
data.SepalWidth.mean(), max(data.SepalWidth)], labels=["Low", "High"])
data['PetalLength_new'] = pd.cut(data['PetalLength'], bins = [min(data.PetalLength),
data.PetalLength.mean(), max(data.PetalLength)], labels=["Low", "High"])
data['PetalWidth_new'] = pd.cut(data['PetalWidth'], bins = [min(data.PetalWidth),
data.PetalWidth.mean(), max(data.PetalWidth)], labels=["Low", "High"])
data.head(150)
data.SepalLength_new.value_counts()
data.SepalWidth_new.value_counts()
data.PetalLength_new.value_counts()
data.PetalWidth_new.value_counts()

Hints:
For each assignment, the solution should be submitted in the below format
1. Work on each feature to create a data dictionary as displayed in the image
displayed below:

2. Hint: Refer to Iris.csv, which is a public dataset.

3. Research and perform all possible steps for obtaining solution
4. All the codes (executable programs) should execute without errors
5. Code modularization should be followed
6. Each line of code should have comments explaining the logic and why you are using
that function

Data Preprocessing in Machine Learning
No ratings yet
Data Preprocessing in Machine Learning
27 pages
Discretization Problem Statement
No ratings yet
Discretization Problem Statement
3 pages
Discretization Problem Statement
No ratings yet
Discretization Problem Statement
2 pages
PS5 Copy of Discretization Problem Statement
No ratings yet
PS5 Copy of Discretization Problem Statement
2 pages
06.discretization Problem Statement
50% (2)
06.discretization Problem Statement
2 pages
Discretization Problem Statement
No ratings yet
Discretization Problem Statement
3 pages
Discretization Problem Statement
No ratings yet
Discretization Problem Statement
4 pages
Dummy Variables Problem Statement
No ratings yet
Dummy Variables Problem Statement
3 pages
Data Preprocessing Report
No ratings yet
Data Preprocessing Report
6 pages
Transformations Problem Statement
No ratings yet
Transformations Problem Statement
2 pages
Thinespary Sitharam 841007106016-Supply Chain Management Data Analytic
No ratings yet
Thinespary Sitharam 841007106016-Supply Chain Management Data Analytic
6 pages
Data Preprocesing JavaPoint
No ratings yet
Data Preprocesing JavaPoint
19 pages
Lecture 2 20022025 092902am
No ratings yet
Lecture 2 20022025 092902am
87 pages
Practical No - 1
No ratings yet
Practical No - 1
5 pages
ML (Prac1)
No ratings yet
ML (Prac1)
12 pages
ML - Preprocessing - Introduction
No ratings yet
ML - Preprocessing - Introduction
14 pages
Data Science Practicals
No ratings yet
Data Science Practicals
47 pages
Machine Learning Laboratory (BTCS619-18) B.Tech Cse 6Th 2024 EVEN
No ratings yet
Machine Learning Laboratory (BTCS619-18) B.Tech Cse 6Th 2024 EVEN
29 pages
Machine Learning Lecture1 - 26-27 Aug
No ratings yet
Machine Learning Lecture1 - 26-27 Aug
30 pages
Data Assigment 1
100% (2)
Data Assigment 1
32 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
Project Synopsis On Breast Cancer Detection Using Data Mining
No ratings yet
Project Synopsis On Breast Cancer Detection Using Data Mining
3 pages
Machine Learning Summer Training
No ratings yet
Machine Learning Summer Training
118 pages
Handling Missing Values in A Real-Time Dataset During
No ratings yet
Handling Missing Values in A Real-Time Dataset During
5 pages
Practise Questions
No ratings yet
Practise Questions
26 pages
Slides On DataI
No ratings yet
Slides On DataI
33 pages
Mini 4
No ratings yet
Mini 4
9 pages
Unit - II MLT
No ratings yet
Unit - II MLT
75 pages
Team Alacrity - Amazon ML Challenge 2023 - Text File
No ratings yet
Team Alacrity - Amazon ML Challenge 2023 - Text File
8 pages
Building Good Training Sets UNIT 1 PART2
No ratings yet
Building Good Training Sets UNIT 1 PART2
46 pages
Missing Values
No ratings yet
Missing Values
3 pages
DA Lab
No ratings yet
DA Lab
27 pages
Data Mining Lab Manual CSE VII Sem
No ratings yet
Data Mining Lab Manual CSE VII Sem
63 pages
Data Preprocessing Implementation 13112023 061217pm
No ratings yet
Data Preprocessing Implementation 13112023 061217pm
31 pages
Unit 2
No ratings yet
Unit 2
19 pages
Logistic Regression For Binary Classification With Core APIs - TensorFlow Core
No ratings yet
Logistic Regression For Binary Classification With Core APIs - TensorFlow Core
22 pages
Dsbda Lab - 1 - 1736243987425
No ratings yet
Dsbda Lab - 1 - 1736243987425
10 pages
Assignment 4 R Program1
No ratings yet
Assignment 4 R Program1
11 pages
Pooja Kabadi - Predictive Modelling Project
No ratings yet
Pooja Kabadi - Predictive Modelling Project
70 pages
EDA Explanations
No ratings yet
EDA Explanations
22 pages
How To Create A Python Model
No ratings yet
How To Create A Python Model
29 pages
Lab 08 - Data Preprocessing
No ratings yet
Lab 08 - Data Preprocessing
9 pages
Dwdm-Lab Manual
No ratings yet
Dwdm-Lab Manual
39 pages
Bussiness Report PM
No ratings yet
Bussiness Report PM
44 pages
Data Mining Using Python Lab
100% (1)
Data Mining Using Python Lab
63 pages
Omkar
No ratings yet
Omkar
37 pages
EDA Document
No ratings yet
EDA Document
13 pages
GCD Detailed Syllabus
No ratings yet
GCD Detailed Syllabus
24 pages
CS3362 Data Science Laboratory Manual 2022-23
No ratings yet
CS3362 Data Science Laboratory Manual 2022-23
54 pages
ML File Syllabus
No ratings yet
ML File Syllabus
43 pages
Journal Heart Attack
No ratings yet
Journal Heart Attack
6 pages
Data Pre Process I
No ratings yet
Data Pre Process I
6 pages
Lab Mannual of ML
No ratings yet
Lab Mannual of ML
43 pages
Machine Learning Project Checklist
No ratings yet
Machine Learning Project Checklist
30 pages
69 DM Pract03
No ratings yet
69 DM Pract03
6 pages
Final ML
No ratings yet
Final ML
2 pages
AIML
No ratings yet
AIML
13 pages
Scikit Hca
No ratings yet
Scikit Hca
8 pages
Articles Xgboost Classification With Smote-Enn Algorithm
No ratings yet
Articles Xgboost Classification With Smote-Enn Algorithm
11 pages
D3 4.x数据可视化实战手册（第2版）: Chinese Edition
From Everand
D3 4.x数据可视化实战手册（第2版）: Chinese Edition
Posts & Telecom Press
No ratings yet
Question Paper Code:: (10×2 20 Marks)
No ratings yet
Question Paper Code:: (10×2 20 Marks)
2 pages
Deploying ML Production (Flask - API)
No ratings yet
Deploying ML Production (Flask - API)
27 pages
Ali Gohar - (Research Fellow) : Master of Engineering in Computer Science
No ratings yet
Ali Gohar - (Research Fellow) : Master of Engineering in Computer Science
2 pages
Unit 4
No ratings yet
Unit 4
27 pages
10 Coolest Jobs in Cybersecurity
No ratings yet
10 Coolest Jobs in Cybersecurity
1 page
Imdrf Rps WG pd1 n27r2
No ratings yet
Imdrf Rps WG pd1 n27r2
12 pages
Full Download Design Computing and Cognition'22 John S. Gero PDF
No ratings yet
Full Download Design Computing and Cognition'22 John S. Gero PDF
47 pages
C430 Datasheet A4 UK V2
No ratings yet
C430 Datasheet A4 UK V2
2 pages
Acrs 2.0
No ratings yet
Acrs 2.0
14 pages
Lab 4 Perform An SQL Injection Attack Against MSSQL To Extract Databases Using Sqlmap
No ratings yet
Lab 4 Perform An SQL Injection Attack Against MSSQL To Extract Databases Using Sqlmap
27 pages
B.sc. I, II & 3rd (Computer Science As A Subject) Session 2012-13
No ratings yet
B.sc. I, II & 3rd (Computer Science As A Subject) Session 2012-13
17 pages
Sonar™ 8: (Windows XP, Windows Vista)
No ratings yet
Sonar™ 8: (Windows XP, Windows Vista)
22 pages
ITT04101-Computer Generations
No ratings yet
ITT04101-Computer Generations
5 pages
IT Cooling Full Product Catalogue 2022 2023
No ratings yet
IT Cooling Full Product Catalogue 2022 2023
27 pages
Prince Mishra Resume
No ratings yet
Prince Mishra Resume
2 pages
Notes (1 3)
No ratings yet
Notes (1 3)
19 pages
Bethany Christian School of Tarlac Inc.: First Quarterly Examination
No ratings yet
Bethany Christian School of Tarlac Inc.: First Quarterly Examination
4 pages
Auto LISP
No ratings yet
Auto LISP
18 pages
A Case Study Analysis of JDPi Automotive Manufacturer
No ratings yet
A Case Study Analysis of JDPi Automotive Manufacturer
14 pages
Ent131 HRM Assessment
No ratings yet
Ent131 HRM Assessment
42 pages
DevOps Cheat Sheet
No ratings yet
DevOps Cheat Sheet
297 pages
Easy Excel
No ratings yet
Easy Excel
29 pages
Introduction To Dictionaries in Python
No ratings yet
Introduction To Dictionaries in Python
8 pages
Unit - I: Roots of Equation and Error Approximations (MCQ) : 1 A B C D
No ratings yet
Unit - I: Roots of Equation and Error Approximations (MCQ) : 1 A B C D
6 pages
E Invoicing Guidelines 2024
No ratings yet
E Invoicing Guidelines 2024
28 pages
Roll It!
No ratings yet
Roll It!
4 pages
BRM Unit-4
No ratings yet
BRM Unit-4
18 pages
BFS, Stacks & Queue Data Structure
No ratings yet
BFS, Stacks & Queue Data Structure
10 pages
Practise MCQ Questions
No ratings yet
Practise MCQ Questions
3 pages
Facades - Laravel 10.x - The PHP Framework For Web Artisans
No ratings yet
Facades - Laravel 10.x - The PHP Framework For Web Artisans
13 pages

Discretization Problem Statement21

Uploaded by

Discretization Problem Statement21

Uploaded by

DISCRETIZATION

Please ensure you update all the details:

Name: hari machavarapu

Batch Id: dswdcmb 150622h

Sepal.Length Sepal.Width Petal.Length Petal.Width Species

© 2013 - 2021 360DigiTMG. All Rights Reserved.

2. Hint: Refer to Iris.csv, which is a public dataset.

© 2013 - 2021 360DigiTMG. All Rights Reserved.

You might also like