REPORT - Assignment 1
REPORT - Assignment 1
Katleen Ezekeil Orata (c0848019) Hardik Solanki (0852302) Mohammad Imran Uddin (c0800487)
Artificial Intelligence & Machine Learning Artificial Intelligence & Machine Learning Artificial Intelligence & Machine Learning
Program Program Program
Lambton College Lambton College Lambton College
Toronto, Canada Toronto, Canada Toronto, Canada
[email protected] [email protected] [email protected]
Abstract—This electronic document is a “live” template and The.head() and.tail() functions are used by the students to
already defines the components of your paper [title, text, heads, investigate the head and the tail, or the first and last rows of
etc.] in its style sheet. *CRITICAL: Do Not Use Symbols, Special the dataset. This provides the students with a quick look at the
Characters, Footnotes, or Math in Paper Title or Abstract.
information, assisting them in developing a hypothesis and
(Abstract)
giving them an indication of the type of analysis they can
Keywords—data preprocessing, exploratory data analysis, conduct. The students occasionally discovered some
central tendency, dispersion, outlier, visualization duplicated rows using the.head() and.tail() functions, as seen
in Figure 2.
I. INTRODUCTION
The students are tasked to investigate a dataset (data.csv) that
contains 12,000 observations with 16 different attributes and
perform an exploratory data analysis. The purpose of the
assignment is for the students to investigate various Python
libraries that may be applied to the analysis, manage typical
data mistakes, and illustrate patterns and insights from the
data.
II. DATASET
The dataset contains 12,000 entries with a total of 16 columns.
Each column describes the various feature of the cars such as
make, model, engine fuel type, fuel type, popularity, etc.