Final DAProject
Final DAProject
Project
Toyota Carolla
2. Tool
Used
In this topic, I am going
INT
to use a data set called RO
Toyota Corolla which DU
contains information on CT
IO
the sale of used car in N
the Asia.
TOOL
extensive catalog of
statistical and graphical
methods. It includes
machine learning
USED
algorithms, linear
regression, time series,
statistical inference to
name a few.
and 10900$-11290$
- The car with the most KM is TOYOTA Corolla 1.6 16V HATCHB LINEA TERRA 2/3-Doors:
7704759 (km)
- The car with the least KM (only 1km) is:
+ TOYOTA Corolla 1.6-16v VVT-i Linea Terra Comfort AIRCO NIEUW 5DRS 4/5-Doors.
+ TOYOTA Corolla VERSO 2.0 D4D SOL (7) BNS MPV.
+ TOYOTA Corolla 1.4-16v VVT-i Linea Terra Comfort NIEUW AIRCO 4/5-Doors.
+ TOYOTA Corolla 1.6-16v VVT-i Linea Terra Comfort NIEUW AIRCO 5drs 4/5-Doors.
+ TOYOTA Corolla 1.4-16v VVT-i Linea Terra Comfort NIEUW AIRCO 4/5-Doors.
According to the chart, more than 1264
cars use Petrol fuel.
Over 155 cars use Diesel fuel.
Predict Price with 3 Model:
Regression Analysis, Random Forest, Gradient Boosting
Before using R to
predict, we split the data.
Result
CASE 1
Regression Analysis
Plot Regression
Analysis Model
Then we use this model which were created to predict price in data set
CASE 2
Random Forest
Importance feature
- Next, we can see that Age and KM have negative values, proving that it negatively affects the
price of a car. That’s mean if the car is older, it has a negative impact on the price, therefore
the older the car, the lower the price.
- We can see the column p-value (Pr(>|t|)), they are 4 column has a p value less than 0.05. We
can confirm that these properties are statistically significant. In this case, we find Age_08_04,
KM, Automatic statistically significant.
Reference
https://www.toyota.com/corolla/
https://en.wikipedia.org/wiki/Toyota_Corolla
THANK YOU FOR FOLLOWING