0% found this document useful (0 votes)
8 views6 pages

Data Analytics Project Sem4

The project analyzes the BlinkIT Grocery Data dataset to uncover sales patterns, customer preferences, and product performance using data analytics tools like Python, Tableau, and Power BI. Key findings reveal that product categories like Fruits & Vegetables perform well, sales vary by outlet type and size, and newer outlets show different sales distributions. The project emphasizes the importance of data analytics in optimizing business strategies and enhancing customer satisfaction in the e-commerce sector.

Uploaded by

sumanyumimroth
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views6 pages

Data Analytics Project Sem4

The project analyzes the BlinkIT Grocery Data dataset to uncover sales patterns, customer preferences, and product performance using data analytics tools like Python, Tableau, and Power BI. Key findings reveal that product categories like Fruits & Vegetables perform well, sales vary by outlet type and size, and newer outlets show different sales distributions. The project emphasizes the importance of data analytics in optimizing business strategies and enhancing customer satisfaction in the e-commerce sector.

Uploaded by

sumanyumimroth
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 6

MINOR PROJECT

Introduction
In today’s rapidly evolving digital economy, e-commerce platforms generate vast amounts of data that
can be leveraged to gain valuable business insights. Our project aims to explore and analyze sales
patterns, customer preferences, recent trends and product performance using various data analytics
techniques.

Our study is based on the BlinkIT Grocery Data dataset, which provides extensive details about product
categories, outlet types, pricing, and sales figures. By applying modern data analytics tools such as
Python (Pandas, Matplotlib, Seaborn), Tableau, Power BI, and Excel, we aim to extract meaningful
insights from the dataset.

The key objectives of our project are:

 To perform Exploratory Data Analysis (EDA) and summarize key statistics.


 To visualize trends in product category sales and outlet performance.
 To analyze the correlation between variables such as pricing, visibility, and sales.
 To build predictive models using regression and classification techniques.
 To provide actionable insights that can help businesses optimize their sales and marketing
strategies.

Through this project, we demonstrate the significance of data analytics in improving business decision-
making and driving growth in the competitive e-commerce industry.

Data Analytics Tools and Trends Analysis


for E-commerce Dataset
Based on the BlinkIT Grocery Data dataset you've provided, here are suitable data analytics tools and
a suggested trend analysis:

Suitable Data Analytics Tools

Python with Pandas & Matplotlib/Seaborn:


 Excellent for data cleaning, exploration, and visualization
 Free and widely used in academic setti ngs
 Good for handling Excel data like this
Tableau/Power BI:
 Great for interactive visualizations
 User-friendly drag-and-drop interface
 Good for presenting findings to non-technical audiences

Excel Data Analysis Tools:


 Already in Excel format, so pivot tables/charts would work
 Familiar interface for most students
 Limited for more complex analysis

Suitable Data Analytics Tools


Given the structure of the dataset, here are some analytics techniques we used :

Descriptive Statistics and Exploratory Data Analysis (EDA):


Tools: Pandas, Matplotlib, Seaborn.
Purpose: Get insights into the distribution of sales, ratings, item visibility, etc.

Data Visualization:
Tools: Matplotlib, Seaborn, Plotly.
Purpose: Visualize sales trends, outlet types, and item categories to draw patterns.

Correlation Analysis:
Tools: Pandas, Seaborn.
Purpose: Explore relationships between variables like item visibility, item weight, and
sales.
Regression Analysis:
Tools: Scikit-learn.
Purpose: Build predictive models for sales based on features like item visibility, outlet
type, and outlet size.

Clustering:
Tools: Scikit-learn.
Purpose: Group outlets or products based on similarity in sales, size, or customer ratings.

Classification:
Tools: Scikit-learn.
Purpose: Classify outlets based on type (Supermarket Type1, Type2) or predict customer
ratings.

Key Trends to Explore


Product Category Performance:
Which item types (Fruits & Vegetables, Frozen Foods, etc.) have highest sales?

How does fat content (low fat vs regular) affect sales?

Outlet Analysis:
How do sales vary by outlet type (Supermarket Type1, Grocery Store, etc.)?

Does outlet size or location tier affect sales?


Temporal Trends:
Is there a relationship between outlet establishment year and sales?

Are newer outlets performing better than older ones?

Pricing Analysis:
Relationship between item weight and sales

How does item visibility correlate with sales

Rating Analysis:
Since most ratings are 5, is there any variation worth exploring?

Relationship between rating and other variables

For our project, we focused on 2-3 of these trends with clear visualizations and
interpretations. The product category performance and outlet type analysis have
made particularly interesting starting points.

Making a graph for recent trends


Output:
Conclusion
Our analysis of the BlinkIT Grocery Data dataset has provided valuable insights into product
performance, outlet trends, and pricing strategies. By using data visualization tools and statistical
techniques, we identified key trends that can help businesses make informed decisions.

Some of our key findings include:

 Product Category Performance: Certain product categories, such as Fruits & Vegetables, have
consistently high sales, while fat content influences purchasing behavior.
 Outlet Analysis: Sales vary significantly based on outlet type and location tier, with larger
supermarkets generally outperforming smaller grocery stores.
 Temporal Trends: Newer outlets tend to have a different sales distribution compared to older
ones, highlighting the importance of store expansion strategies.
 Pricing and Sales Correlation: Product visibility and pricing play a crucial role in influencing
customer purchases.
 Rating Analysis: Most ratings are concentrated at the higher end (5-star), but further
segmentation could reveal nuanced consumer preferences.

Overall, our project demonstrates how businesses can leverage data analytics to optimize their
operations, improve product offerings, and enhance customer satisfaction. By utilizing tools like
Python, Tableau, and Power BI, organizations can transform raw data into actionable insights,
ultimately driving success in the e-commerce sector.

This project has provided us with a deeper understanding of real-world data analytics applications,
equipping us with essential skills for future industry challenges.

You might also like