Data Wrangling and Munging
Data Wrangling and Munging
IT4140
Section E
Lecture-18, 19
OUTLINE
▪ DATA WRANGLING
▪ DATA MUNGING
DATA WRANGLING
Data Wrangling
https://datascienceworkshops.com/blog/seven-command-line-tools-
for-data-science/
DATA WRANGLING-Type or Problem?-Case Study
• When data are completely random and, the fact that the data are
lacking in independent of the observed and unobserved data.
• Dirty data, also known as which data, are inaccurate, incomplete or
unrelevant data, especially in a computer system or database.
• Data is any record that inadvertently shares data with another
record in a Database.
• The data is easy to spot and it mostly occurs when transferring data
between systems.
• A file refers to every type of data that is not visible at all when using
a standard viewer, or under certain settings.
DATA WRANGLING Vs DATA MUNGING
• Data wrangling, sometimes referred to as data munging.
• Data munging process of transforming and mapping data from one "raw"
data form into another format with the intent of making it more
appropriate and valuable for a variety of downstream purposes such as
analytics.
• It is also preparing your data for a dedicated purpose (i.e. pre-process, data
split, data curation, data normalization etc.)
• Data munging is the process of removing errors and combining complex
data sets to make them more accessible and easier to analyse.
DATA WRANGLING or DATA MUNGING ?- Case study
▪ Calculate the range of the data set for setting the pattern of big data.
▪ When data are completely random and, the fact that the data are
lacking in independent of the observed and unobserved data.
▪ Subtract the minimum x value from the value of this data point to
decide the organisation of big data.
▪ Data is any record that inadvertently shares data with another record
in a Database.
▪ Repeat with additional data points for filling the null values in data
set.
▪ The process of creating, organizing and maintaining data sets so they
can be accessed and used by people looking for information.