Unit-4 Part 1 Preparing Model
Unit-4 Part 1 Preparing Model
Model
PROF. ATMIYA PATEL
Understanding about Data
Different Types of Data
Exploring Structure of Data
Two basic data types:
1. Numerical
2. Categorical
Standard dataset have data dictionary. Like UCI repository (University of California)
Link: https://archive.ics.uci.edu/ml/index.php
Exploring Numerical Data
Steps:
1. Understanding central tendency (Ex. Mean, Median)
2. Understanding data spread
a. Dispersion of data
b. Position of different data values
(a.) (b.)
Data Quality issues factors
Incorrect sample set selection: The data may not reflect normal pr regular quality due to this.
Ex. Use Festival sale data to predict the future sale.