0% found this document useful (0 votes)
7 views2 pages

Standardization Problem Statement

a

Uploaded by

paluruanjana
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views2 pages

Standardization Problem Statement

a

Uploaded by

paluruanjana
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

STANDARDIZATION & NORMALIZATION

Instructions:
Please share your answers filled inline in the word document. Submit Python code and R code
files wherever applicable.

Please ensure you update all the details:


Name: _________________________
Batch Id: _______________________
Topic: Data Pre-Processing

Problem Statement:
Data is one of the most important assets. Often the data are stored in distinct systems
with different formats and scales. These seemingly small differences in how the data is
stored can result in misinterpretations and inconsistencies in your analytics.
Inconsistency can make it impossible to deliver reliable information to management for
good decision-making. We have the preprocessing techniques to make the data
uniform. To explore the various techniques to have reliable uniform standard data, you
can go through this link:
https://360digitmg.com/mindmap-data-science

1) Prepare the dataset by performing the preprocessing techniques, to have the


standard scale to data.

© 360DigiTMG. All Rights Reserved.


Hints:
For each assignment, the solution should be submitted in the below format
1. Work on each feature to create a data dictionary as displayed in the image displayed
below:

2. Refer to the Seeds_data.csv file.


3. Research and perform all possible steps for obtaining the solution.
4. All the codes (executable programs) should execute without errors.
5. Code modularization should be followed.
6. Each line of code should have comments explaining the logic and why you are using that
function.

© 360DigiTMG. All Rights Reserved.

You might also like