Analytics Projectv 1
Analytics Projectv 1
Student Name
University Name
Professor’s Name
Date
2
Enhancing Search Engine optimization (SEO) for Toyota Used Car Dealers
Problem Recognition
Introduction
The main topic that will guide this project is enhancing product quality and customer
satisfaction through predictive quality analytics at Toyota Motor Corporation. In this scenario,
however, the topic is too wide for any feasible projects under the scope of this study to be
conducted. Therefore, the project will focus on a specific area of study with the goal of
enhancing customer satisfaction and product quality at the company. The project will be limited
to Toyota customer preference analysis with an emphasis on addressing the business challenge of
enhancing search engine optimization (SEO) for Toyota used car dealers.
Motivation
Toyota Motor Corporation recognizes and highly values the need for enhancing customer
satisfaction and product quality, especially being a global leader in automotive manufacturing.
Notably, today’s competitive market requires a company to understand and meet customer
preferences in order to enhance brand loyalty and maintain market share (Narayana et al., 2021).
In the used car segment, the automotive industry faces many challenges in effectively reaching
potential customers online. For most Toyota dealers, optimizing search engine visibility is very
important. Improved SEO significantly not only increases visibility online but also the likelihood
of attracting new customers browsing websites such as KSL.com. The project aims to use
predictive quality analytics to identify key car features that significantly impact page views (PV)
As mentioned earlier, enhancing search engine optimization (SEO) in the used car
domain, is a significant challenge for dealers.The main problem is effectively aligning online
listings with the preferences of a customer with the goal of enhancing visibility and attracting
new customers. The dealers are often required to optimize the listing on websites in order to
increase page views and increase customer sales and engagement. However, they face challenges
identifying specific care features that can attract the interest of customers and potentially
translate this information into creating actionable SEO strategies (Papagiannis, 2020).
Understandably, the diversity and volume of car features across Toyoto’s products (vehicles)
makes this more difficult. Dealers must be able to discern the features and resonate well with
their potential customers and strategically incorporate relevant keywords in their listing to
enhance visibility.
On that note, the main issue addressed in this project is how Toyota used car dealers can
apply predictive quality analytics to identify important car features that can significantly improve
page views on websites like KSL.com. This will allow dealers to optimize their SEO strategies in
a bid to increase visibility, attract better leads, hence customer satisfaction and sales, hopefully
the information be used to improve the quality of Toyota Products at a manufacturing level.
The following statements characterize the problem that is the focus of the project:
● Toyota used car dealers find it hard to optimize online listings on websites such as
KSL.com. The issue is enhancing search engine visibility in order to increase page views.
● The diversity of Toyota vehicles, each with unique specifications and physical features
● Dealers need to use data-driven techniques to determine the features that resonate best
● Having poor SEO strategies can result in lower viability of listings resulting in reduced
● To address the above mentioned issues, there is a need to apply predictive quality
analytics to analyze customer data and predict the features most likely to increase page
views.
● The goal is to improve product quality perceptions and customer satisfaction through
Background Research
The concept of predictive analytics in SEO is a methodology that uses historical data and
Analyzing extensive datasets includes application of various SEO metrics such as keyword
rankings, website traffic, and user behavior, predictive analytics enables marketers to make
informed decisions. Rosas (n.d.) explains that predictive analytics allows users to navigate the
dynamic digital landscape with ease and confidence. In recent years, predictive analytics has
been transforming the automotive industry by using data analytics to anticipate future outcomes
across critical areas such as vehicle maintenance and customer engagement (Podolean, 2023).
This advanced technology allows dealers to improve operational efficiency and product quality.
to anticipate customer behavior and optimize their operational strategies accordingly. This can
include analyzing historical sales data and market trends to predict future consumer preferences
and tailor their marketing strategies to individual customer profiles (Turbo Marketing Solutions,
5
recommendations. Predictive analytics not only transforms how dealerships interact with
customers but also changes their approach to inventory management and sales forecasting.
Dealerships can accurately predict which models are likely to be in high demand at specific
Data Collection
Data Collection
In this project, the first step of the collection process involved identifying appropriate
data sources with information used Toyota car listings. After much deliberation, KSL.com was
selected as the primary data source because of its extensive coverage and user-friendly platform
for listing vehicles by both individuals and dealers. In relation to the project objectives, the
website also provided access to a diverse range of Toyota models and configurations. This was
necessary to capture a representative sample of customer preferences in the used car industry.
To gather information from the platform, web scraping techniques were applied using
Selenium WebDriver and BeautifulSoup for parsing HTML content. Selenium was used to
automate interactions with the website, allowing for the extraction of data identifiers (data_ids)
associated with each vehicle listing. The data_ids were then used as unique keys for accessing
detailed information about each vehicle. The two technologies ensured a targeted and systematic
After identification and extraction of data_ids, the next step involved systematically
collecting and preparing the HTML content of each listing for analysis. Again the Selenium
WebDriver was vital in automating the process of navigating to each listing's URL and capturing
6
the complete HTML content. The process also involved simulating user interactions, such as
clicking on 'See More' buttons to expand detailed information when necessary. This ensured
All the HTML responses captured were then saved in a designated local directory named
‘Toyota_Used’. The folder was used to store the scraped data. This structured approach of
storing data ensured well organized and easily accessible data for further processing and
analysis. The entire collection process prioritized maintaining integrity and completeness of
gathered information
Analysis
The analysis process began with acquiring the dataset containing the various attributes of
Toyota vehicles, such as year, model, mileage, price, and views, prepared in the previous step of
collection. As expected, the initial step was to load the data into a Pandas dataframe for
exploration purposes.
Initial steps involved loading the data into a Pandas DataFrame for exploration. This
phase was crucial as it enabled us to grasp the dataset's structure, identify missing values, and
discern potential challenges in data preprocessing. The screenshot below provides a slight
The next step was data cleaning and preprocessing to prepare the data for analysis and modeling.
In this scenario, missing values were marked as NaNs and handled appropriately. Categorical
variables like 'Cab Type' and 'Transmission' required special attention due to missing or
inconsistent entries. This was resolved through encoding and imputation methods. This was the
final cleaned dataset and proof that the dataset had no null values.
The next step of the analysis process was conducting Exploratory Data Analysis (EDA) to
uncover critical insights into the dataset's characteristics. In that regard, visualization techniques
such as histograms, density plots, and correlation matrices were employed to uncover patterns
and relationships among variables. The following charts were implemented in line with the
project objectives.
Figure 6: Scatter plot depicting the relationship between vehicle views and price
10
Figure 7: Pie chart illustrating the distribution of title types among Toyota vehicles
11
Figure 8: Scatter plot showing the correlation between vehicle views and price
The next step after the visualization process was to conduct feature engineering to
prepare the data for predictive modeling. As expected, categorical variables were transformed
into numerical formats using one-hot encoding. This was critical to enhance the dataset’s
capability with machine learning algorithms and ensuring that all features were represented for
modeling purposes.
12
In the next step, two primary regression models, Lasso Regression and Random Forest
Regression, were selected for predicting online views of Toyota vehicles. The Lasso Regression
was selected because of its ability to handle high-dimensional data and feature selection through
regularization. The parameters were fine-tuned using cross-validation techniques and model
Random Forest Regression approach was used to gather insights on the importance of
features and decision-making processes, particularly useful for understanding which vehicle
attributes significantly influenced online views. The results provided actional information that
was used to implement the final model with a focus on identifying influential factors like vehicle
Interpretation
The Lasso regression model has an optimal alpha value of 10.0. This shows the
regularization strength that consequently minimizes prediction error. The cross-validation scores
14
mean score of -0.1017. The mean score shows the overall performance in predicting Toyota used
car prices, which means the model is very good at predicting vehicle attributes. The selected
features for the model are Year, Model_Land Cruiser, Model_MR2, and Interior
Condition_Poor. They show significant coefficients, -105.24, 113.27, 71.28, and 531.36
respectively. The Interior Condition_Poor stands out with the highest coefficient, this shows its
Solution
To use the above uncovered insights effectively, dealers can strategically enhance their
search engine optimization (SEO) practices. SEO strategies should be centered around influential
Condition_Poor, in online listings and marketing strategies is important. Optimizing content and
keywords around these attributes will allow dealers to improve their visibility and relevance in
search engine results. This approach will not only attract potential customers who prioritize these
features but also enhance the dealership's overall online presence and competitive edge in the
market.
Conclusion
It is safe to conclude that integrating specific values derived from the Lasso regression,
optimal alpha of 10.0 and mean cross-validation score of -0.1017, into SEO strategies represents
a data-driven approach to addressing business challenges in Toyota's used car market segment.
When dealers focus on attributes such as Year, specific Toyota models like Land Cruiser and
MR2, and the condition attribute Interior Condition_Poor, they can align their marketing efforts
closely with customer preferences and trends. This approach improves customer satisfaction by
providing vehicles aligned with their preferences and strengthens the perceived quality of
Toyota's used car offerings. This project contributes to enhancing customer satisfaction and
References
Narayana, C. V., Likhitha, C. L., Bademiya, S., & Kusumanjali, K. (2021, August).
Machine learning techniques to predict the price of used cars: predictive analytics in retail
Papagiannis, N. (2020). Effective SEO and content marketing: the ultimate guide for
analytics-in-the-automotive-industry-opportunities-and-challenges
Rosas, C. (n.d.). Harnessing Predictive Analytics in SEO: Taking Out The Guesswork.
Turbo Marketing Solutions. (n.d.). Leveraging Predictive Analytics for Car Dealerships:
https://www.turbomarketingsolutions.com/single-post/leveraging-predictive-analytics-for-
car-dealerships-maximizing-sales-and-improving-customer-insight