0% found this document useful (0 votes)

25 views17 pages

Intern Report

The document discusses predicting sales for Big Mart stores using machine learning algorithms. It describes the problem statement of predicting future sales and demand. It then provides a literature review of several papers that have used techniques like random forest, gradient boosting, SVM and neural networks for sales prediction in other domains.

Uploaded by

ruchitcsganesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views17 pages

Intern Report

Uploaded by

ruchitcsganesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

CHAPTER 1

INTRODUCTION

Day by day competition among different shopping malls as well as big marts is getting more
serious and aggressive only due to the rapid growth of the global malls and on-line shopping.
Every mall or mart is trying to provide personalized and short-time offers for attracting more
customers depending upon the day, such that the volume of sales for each item can be predicted
for inventory management of the organization, logistics and transport service, etc. Present
machine learning algorithm are very sophisticated and provide techniques to predict or forecast
the future demand of sales for an organization, which also helps in overcoming the cheap
availability of computing and storage systems.

Big Mart is a Grocery Super Market Brand. Big Mart Brand has started out its journey with
free home delivery offerings of food and grocery. Big Mart lets in you to walk far away from
the drudgery of grocery shopping and welcome a clean comfortable way of browsing and
shopping for groceries. Discover new merchandise and shop for all of your food and grocery
desires from the comfort of your private home or workplace. No greater getting stuck in traffic
jams, procuring parking, standing in long queues and wearing heavy bags – get everything you
want when you want, right at the doorstep.

In this paper, we are addressing the problem of big mart sales prediction or forecasting of an
item on customer’s future demand in different big mart stores across various locations and
products based on the previous record. Different machine learning algorithms like linear
regression analysis, random forest, etc. are used for the prediction of sales volume. Since good
sales are the life of every organization the forecasting of sales plays an important role in any
shopping complex. Always a better prediction is helpful, to develop as well as to enhance the
strategies of business about the marketplace which is also helpful to improve the knowledge of
marketplace.

A standard sales prediction study can help in deeply analyzing the situations or the conditions
previously occurred and then, the inference can be applied about customer acquisition,

Dept. of CSE Page | 1

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

funds inadequacy and strengths before setting a budget and marketing plans for the upcoming
year. In other words, sales prediction is based on the available resources from the past. In depth
knowledge of past is required for enhancing and improving the likelihood of marketplace
irrespective of any circumstances especially the external circumstance, which allows to prepare
the upcoming needs for the business. The basic and foremost technique used in predicting sale
is the statistical methods, which is also known as the traditional method, but these methods
take much more time for predicting a sale also these methods could not handle non linear data
so to over these problems in traditional methods machine learning techniques are deployed.
Machine learning techniques can not only handle non-linear data but also huge data-set
efficiently. To measure the performance of the models we can use the accuracy measure so that
accordingly we can decide which model predicts better.

This is a complete exploratory analysis on the Big Mart Sales. It’s a regression practice problem
where in we have to predict sales product-wise and store-wise

Dept. of CSE Page | 2

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

CHAPTER 2

PROBLEM STATEMENT

Most of the business organizations heavily depend on a knowledge base and demand prediction
of sales trends. Sales forecasting is the process of estimating future sales. Accurate sales forecasts
enable companies to make informed business decisions and predict short-term and long-term
performance. Companies can base their forecasts on past sales data, industrywide comparisons,
and economic trends. Sales forecasts help sales teams achieve their goals by identifying early
warning signals in their sales pipeline and course correct before it’s too late. The goal is to
improve the accuracy from the existing project. So that the sales and profit could be increased for
the companies. Choosing an efficient algorithm from comparing different algorithms to improve
the prediction further more.
The primary issues that need to be addressed are:

1. Data Deluge: Small businesses often grapple with a deluge of sales data, creating a challenge in
manually sifting through and extracting meaningful insights from the substantial volume.

2. Trend Blindness: Without a structured analysis approach, small businesses find it challenging to
discern crucial sales trends, cyclic patterns, and fluctuations that play a pivotal role in influencing
their revenue.

3. Resource Allocation Inefficiency: In the absence of data-driven sales predictions, small

businesses may inefficiently allocate resources, leading to issues like excess inventory, stockouts,
or suboptimal utilization of marketing efforts.

4. Competitive Disadvantage: Smaller enterprises face a competitive disadvantage when they are
unable to leverage the full potential of their sales data, putting them at a strategic disadvantage
in the market.

Dept. of CSE Page | 3

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

CHAPTER 3

LITERATURE SURVEY

➢ Sunitha cheriyan “Intelligent Sales Prediction Using Machine Learning Techniques.”

The detailed study and analysis of comprehensible predictive models to improve
future sales predictions are carried out in this research. Traditional forecast systems
are difficult to deal with the big data and accuracy of sales forecasting. The models
implemented for prediction are Random Forest, Gradient Boosting and Extremely
Randomized Trees (Extra Trees) Classifiers.Random Trees was confirmed to be a
very effective.[1]

➢ Shuyun Ren “Forecasting the Retail Sales of China’s Catering Industry Using Support
Vector Machines.”
The forecast of China's catering retail sales was studied in this paper. The seasonal
impact was considered in the forecasting. The retail sales were predicted using the
seasonal auto-regressive integrated moving average (ARIMA) model. ARIMA,
SVM. SVM method is obviously superior to the seasonal ARIMA method regardless
of the long-term forecasting or the shortterm forecasting.[2]

➢ Avinash kumar Sharma “An Intelligent Model For Predicting the Sales of a Product.”
The approach shown in this paper is a systematic, accurate and precise model building
to be used in computing and predicting current scenario and future projection of a
product in market respectively. Random forest algorithm, neural network. Neural
network.[3]

➢ Renesa Ray “Sales Prediction Using Machine Learning Algorithms.”

The aim of this paper is to propose a dimension for predicting the future sales of Big
Mart Companies keeping in view the sales of previous years. A comprehensive study
of sales prediction is done using Machine Learning models. Linear Regression, K-
Neighbours Regressor, XGBoost, Regressor and Random Forest Regressor. Random
Forest Algorithm is found to be the most suitable[4]

Dept. of CSE Page | 4

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

➢ Pratik patil “Comparison of Different Machine Learning Algorithms for Multiple

Regression on Black Friday Sales Data.”
This study focuses on the field of prediction models to develop an accurate and
efficient algorithm to analyze the customer spending in the past and output the future
spending of the customers with same features. Regression, Decision
Tree,XGBoost.[5]

➢ Auddie heichmeir“Forecasting of Walmart Sales using Machine Learning Algorithm.”

The ability to predict data accurately is extremely valuable in a vast array of domains
such as stocks, sales, weather or even sports.,consisting of weekly retail sales numbers
from different departments in Walmart retail outlets all over the United States of
America.The models implemented for prediction are Random Forest, Gradient
Boosting and Extremely Randomized Trees (Extra Trees) Classifiers. Random Trees
was confirmed to be a very effective.[6]

➢ Narayana R “Sales Prediction For Big Mart”.

A retailer company wants a model that can predict accurate sales so that it can keep
track of customers future demand and update in advance the sale inventory. In this
work, we propose a technique to optimize the parameters and select the best tuning
hyper parameters, further ensemble with Xgboost techniques for forecasting the
future sales of a retailer company such as Big Mart and we found our model produces
the better result. Xgboost techniques. Experimental analysis found our technique
produce more accurate[7]

➢ Kaneko and Yada “A Deep Learning Approach for the Prediction of Retail Store
Sales.”
The purpose of this research is to construct a sales prediction model for retail stores
using the deep learning approach, which has gained significant attention in the rapidly
developing field of machine learning in recent years. Using such a model for analysis,
an approach to store management could be formulated . Logistic regression model
The accuracy decreased by around 13% when the logistic regression model was
used.[8]

Dept. of CSE Page | 5

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

CHAPTER 4

OBJECTIVES

1. Data Gathering and Refinement:

Acquire historical sales data from the small business and meticulously prepare it for analysis
through comprehensive cleaning, transformation, and appropriate structuring.

2. Unearthing Sales Trends:

Apply advanced data science techniques to unearth pivotal sales trends, delving into aspects
like seasonality, cyclic patterns, and notable fluctuations that significantly influence business
revenue.

3. Precision in Sales Projections:

Design and implement robust data science models tailored for precise sales predictions.
Empower the small business to foresee future sales with a commendable level of accuracy
through the application of cutting-edge forecasting techniques.

4. Efficient Resource Deployment:

Guide the small business in streamlining inventory management, refining pricing strategies,
and optimizing marketing endeavors based on insights extracted from the comprehensive
analysis of sales data.

5. Insightful Customer Behavior Examination:

Conduct a thorough examination of customer behavior and preferences utilizing sales data,
enabling the customization of marketing strategies for enhanced customer engagement and
satisfaction.

6. Strategic Competitive Positioning:

Equip the small business with the tools to establish a strategic advantage by leveraging their
sales data for informed and strategic decision-making, fostering a position of strength in the
competitive landscape.

Dept. of CSE Page | 6

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

CHAPTER 5

SYSTEM REQUIREMENTS

➢ HARDWARE REQUIREMENTS

• System : i3 Processor
• Hard Disk : 500 GB.
• Monitor : 15’’LED
• Ram : 4GB

➢ SOFTWARE REQUIREMENTS

• Operating system : Windows 7 or above, linux.

• Scripting Tool: Jupyter Notebook, Google colab
• Language:Python3

Dept. of CSE Page | 7

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

CHAPTER 6

METHODOLOGY

Sales prediction is preferably a regression problem than a time series problem. Practice shows
that the use of regression procedures can often supply us better results comparing with time series
techniques. Machine learning algorithms make it possible to find patterns in the time series.
BigMart sales dataset consists of 2013 sales data for 1559 products throughout 10 special stores
in unique towns.

We have 2 dataset the train dataset which has 8523 rows and 12 features and the test dataset
which has 5681 rows and 11 columns. The train dataset has 1 extra column which is the target
variable. We will predict this target variable for the test dataset. Calculations done in the Python
environment using the main packages pandas, sklearn, numpy, matplotlib, seaborn etc. To
conduct the analysis, we will be using Jupyter Notebook.

The goal of the BigMart sales prediction ML challenge is to build a regression model for
expecting the sales of every of 1559 products for the following year in every of the 10 specific
BigMart stores. The BigMart sales dataset additionally includes certain attributes for each
product and store. This model allows BigMart to know the properties of products and stores that
play an essential position in growing their universal sales. We divided the entire analysis process
to following five stages:
1. Exploratory data analysis (EDA)
2. Data Pre-processing
3. Feature engineering & Feature Transformation
4. Modeling
5. Hyperparameter tuning and Evaluation

Each step is explained below in details.

1. Exploratory data analysis (EDA)
In this phase useful information about the data has been extracted from the dataset. That is

Dept. of CSE Page | 8

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

trying to identify the information from hypotheses vs available data. Which shows that the
attributes Outlet size and Item weight face the problem of missing values, also the minimum
value of Item Visibility is zero which is not actually practically possible. Establishment year
of Outlet varies from 1985 to 2009. These values may not be appropriate in this form. So, we
need to convert them into how old a particular outlet is. There are 1559 unique products, as
well as 10 unique outlets, present in the dataset. The attribute Item type contains 16 unique
values. Where as two types of Item Fat Content are there but some of them are misspelled as
regular instead of ’Regular’ and low fat, LF instead of Low Fat
2. Data Cleaning
It was observed from the previous section that the attributes Outlet Size and Item Weight has
missing values. In our work in case of Outlet Size missing value we replace it by the mode
of that attribute and for the Item Weight missing values we replace by mean of that particular
attribute. The missing attributes are numerical where the replacement by mean and mode
diminishes the correlation among imputed attributes. For our model we are assuming that
there is no relationship between the measured attribute and imputed attribute
3. Feature Engineering & Feature Transformation
Some nuances were observed in the data-set during data exploration phase. So, this phase is
used in resolving all nuances found from the dataset and make them ready for building the
appropriate model. During this phase it was noticed that the Item visibility attribute had a
zero value, practically which has no sense. So, the mean value item visibility of that product
will be used for zero values attribute. This makes all products likely to sell. All categorical
attributes discrepancies are resolved by modifying all categorical attributes into appropriate
ones. In some cases, it was noticed that non-consumables and fat content property are not
specified. To avoid this, we create a third category of Item fat content i.e. none. In the Item
Identifier attribute, it was found that the unique ID starts with either DR or FD or NC. So, we
create a new attribute Item Type New with three categories like Foods, Drinks and Non-
consumables. Finally, for determining how old a particular outlet is, we add an additional
attribute Year to the dataset.
4. Model Building
After completing the previous phases, the dataset is now ready to build proposed model. Once
the model is built it is used as predictive model to forecast sales of Big Mart. In our work, we
make model based on different algorithms such as Random Forest algorithm, Linear
regression, Lasso Regression, Ridge regression, Decision tree etc. and compare it with other

Dept. of CSE Page | 9

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

machine learning techniques. All models received features as input, which are then segregated
into training and test set. The test dataset is used for sales prediction.
5. Hyperparameter tuning and Evaluation
The next and final step in our project is the tuning of different parameters in every model and
saw improvement in model performance. While this is an important step in modeling, it is by
nomeans the only way to improve performance.

Dept. of CSE Page | 10

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

CHAPTER 7

TESTING

TABLE :- MODEL TESTING

Maximum
AUC AUC Run-time Memory
Algorithm (Training) (Holdout) (Training) Utilization
(Of 16 GB)
XGBoost 0.88 0.86 16 min 12 sec 12%

Logistic Regression 0.66 0.50 52 sec 20%

Naïve Bayesian 0.64 0.59 59 sec 20%

Random Forest
(Depth controlled) 23 min 10 sec 29%
0.79 0.51

SVM (RBF 105 min 30 sec 21%

kernel) 0.68 0.52

LDA 0.74 0.52 6 min 51 sec 35%

KNN
(Euclidean distance) 0.52 0.5 180 min 12 seca 35%

Dept. of CSE Page | 11

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

CHAPTER 8

RESULTS
It was found that our target variable ‘Item_Outlet_Sales’ is skewed to the right, towards the
higher sales, with higher concentration on lower sales.

FIG 8.1:Outlet item sales

From the current numeric variables, we can observe that the Item_Visibility is the feature
with the lowest correlation with our target variable. Therefore, the less visible the product is
in the store the higher the price will be. This is curious since from the initial assumptions this
variable was expected to have high impact in the sales increase. Moreover, this feature has a
negative correlation with all of the other features. Furthermore, the most positive correlation
belongs to Item_MRP.

FIG 8.2:Corelation map

Dept. of CSE Page | 12

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

There seems to be a low number of stores with size equals to “High”. Most of the existent
stores seem to be either “Small” or “Medium. It was observed that lowest sales were produced
in smallest locations. However, in some cases it was found that medium size location
produced highest sales though it was type-3 (there are three type of super market e.g. super
market type-1, type-2 and type-3) super market instead of largest size location to increase the
product sales of Big mart in a particular outlet, more locations should be switched to Type 3
Supermarkets.

FIG 8.3:Impact of Outlet type

FIG 8.4:Impact of Outlet Size

Dept. of CSE Page | 13

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

However, if we look at our results, we see that in fact it is stores from Tier 2 cities that present
the highest results, followed by Tier 3 cities and with Tier 1 cities with the lowest results of
the three type of locations.

FIG 8.5:Impact of Outlet Location

However, the proposed model gives better predictions among other models for future sales at
all locations. The Item Outlet Sales is strongly correlated with Item MRP. Less visible items
are sold more compared to more visibility that means it describes that the less visible products
are sold more compared to the higher visibility products which is not possible practically.

FIG 8.6:Item Visibility

Dept. of CSE Page | 14

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24
It is also observed that target attribute Item Outlet Sales is affected by sales of the Item Type.
Similarly, it is also observed that highest sales are made by OUT027 which is actually a
medium size outlet in the super market type-3.

FIG 8.7:Impact of Outlet on Item Sales

Dept. of CSE Page | 15

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

CONCLUSION

In present era of digitally connected world every shop demand of product sales or user demands.
Extensive research in this area at enterprise level is happening for accurate sales prediction. As
the profit made by a company is directly proportional to the accurate predictions of sales, the Big
marts are desiring more accurate prediction algorithm so that the company will not su er any ff
losses. In this work, we have designed a predictive model by modifying Random Forest technique
and experimented it on the 2013 Big Mart dataset for predicting sales of the product from a
particular outlet. Experiments support that our technique produces more accurate prediction
compared to than other available techniques like decision trees, ridge regression etc.

Dept. of CSE Page | 16

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

REFERENCES

[1] Sunitha Cheriyan, Shaniba Ibrahim, Saju Mohanan & Susan Treesa (2018) Intelligent
Sales Prediction Using Machine Learning Techniques.
[2] Xiangsheng Xie & Gang Hu (2008). Forecasting the Retail Sales of China’s Catering
Industry.
[3] Avinash kumar, Neha Gopal & Jatin Rajput(2020). An Intelligent Model For Predicting
the Sales of a Product.
[4] Purvika Bajaj, Renesa Ray, Shivani Shedge & Shravani Vidhate(2020). SALES
PREDICTION USING MACHINE LEARNING ALGORITHMS.
[5] Ching-Seh (Mike) Wu. Pratik Patil & Saravana Gunaseelan(2018). Comparison of
Different Machine Learning Algorithms for Multiple Regression on Black Friday Sales
Data.
[6 ] Nikhil Sunil Elias, Seema Singh(2019).FORECASTING of WALMART SALES using
MACHINE LEARNING ALGORITHMS.
[7] Yuta Kaneko & Katsutoshi Yada(2016). A Deep Learning Approach for the Prediction of
Retail Store Sales.
[8] Gopal Behera & Neeta Nain (2019). Sales Prediction For Big Mart.

https://www.kaggle.com/aakash2016/big-mart-sales-prediction
https://datahack.analyticsvidhya.com/contest/practice-problem-big-mart-sales-iii/
https://medium.com/diogo-menezes-borges/project-1-bigmart-sale-prediction-fdc04f07dc1e
https://rstudio-pubsstatic.s3.amazonaws.com/381886_981132516a8e437284327a405ca4d91a.html

Dept. of CSE Page | 17

MMPC-001 Management Functions and Organisational Processes
No ratings yet
MMPC-001 Management Functions and Organisational Processes
303 pages
Lesson 3-Analysis of Procedures Such As Survey, Interview and Observation
100% (4)
Lesson 3-Analysis of Procedures Such As Survey, Interview and Observation
16 pages
Course Material On Gns 302
100% (2)
Course Material On Gns 302
65 pages
Basepaper 1
No ratings yet
Basepaper 1
7 pages
Big Mart Sales Prediction Analysis: Dr.B.Santosh Kumar
No ratings yet
Big Mart Sales Prediction Analysis: Dr.B.Santosh Kumar
90 pages
Analysis of Machine Learning Model For Predicting Sales Forecasting
No ratings yet
Analysis of Machine Learning Model For Predicting Sales Forecasting
6 pages
Construction Engineering AND Management
No ratings yet
Construction Engineering AND Management
15 pages
Sales Prediction For Big Mart 3.0.pptx MM
No ratings yet
Sales Prediction For Big Mart 3.0.pptx MM
25 pages
Big Mart Sales Analysis
No ratings yet
Big Mart Sales Analysis
4 pages
Paper 9427
No ratings yet
Paper 9427
6 pages
Project Report Shruti
No ratings yet
Project Report Shruti
66 pages
Big Mart Outlets
100% (2)
Big Mart Outlets
11 pages
Predictive Analysis For Big Mart Sales Using Machine
100% (1)
Predictive Analysis For Big Mart Sales Using Machine
11 pages
Predicting The Future of Sales: A Machine Learning Analysis of Rossman Store Sales
No ratings yet
Predicting The Future of Sales: A Machine Learning Analysis of Rossman Store Sales
11 pages
Prediction of Big Mart Sales Using Machine Learning: (Peer-Reviewed, Open Access, Fully Refereed International Journal)
No ratings yet
Prediction of Big Mart Sales Using Machine Learning: (Peer-Reviewed, Open Access, Fully Refereed International Journal)
8 pages
BMSP-ML: Big Mart Sales Prediction Using Different Machine Learning Techniques
No ratings yet
BMSP-ML: Big Mart Sales Prediction Using Different Machine Learning Techniques
10 pages
Salespredmmmm
No ratings yet
Salespredmmmm
15 pages
Cie A2 Maths 9709 Statistics2 v2 Znotes
No ratings yet
Cie A2 Maths 9709 Statistics2 v2 Znotes
10 pages
Comparative Analysis of Supervised Machine Learnin
No ratings yet
Comparative Analysis of Supervised Machine Learnin
10 pages
Sales Analysis and Forecasting in Shopping Mart: Amit Kumar, Kartik Sharma, Anup Singh, Dravid Kumar
No ratings yet
Sales Analysis and Forecasting in Shopping Mart: Amit Kumar, Kartik Sharma, Anup Singh, Dravid Kumar
4 pages
Biostat
100% (1)
Biostat
66 pages
Bigmart Sales Using Machine Learning With Data Analysis
No ratings yet
Bigmart Sales Using Machine Learning With Data Analysis
5 pages
Big Mart Sales Analysis
No ratings yet
Big Mart Sales Analysis
4 pages
Bigmarket Sale Abstract
No ratings yet
Bigmarket Sale Abstract
1 page
Govind Sadashiv Ghurye: Module-14
No ratings yet
Govind Sadashiv Ghurye: Module-14
7 pages
Data Analysis On BigMart Sales
67% (3)
Data Analysis On BigMart Sales
17 pages
Application of Big Data Analysis in Sales Forecast
No ratings yet
Application of Big Data Analysis in Sales Forecast
7 pages
Thesis Statement Anchor Chart
100% (3)
Thesis Statement Anchor Chart
7 pages
Neba 2672024 AJPAS118179
No ratings yet
Neba 2672024 AJPAS118179
24 pages
Basepaper 3
No ratings yet
Basepaper 3
14 pages
ECSFS Report (670 - Kumar Shantanu)
No ratings yet
ECSFS Report (670 - Kumar Shantanu)
21 pages
Educational Research Quantitat R Robert Burke Johnson
No ratings yet
Educational Research Quantitat R Robert Burke Johnson
6 pages
IJNRD2406005
No ratings yet
IJNRD2406005
8 pages
DSP Research Paper by Shanmukh and Meher
No ratings yet
DSP Research Paper by Shanmukh and Meher
33 pages
Easychair Preprint: Mansi Panjwani, Rahul Ramrakhiani, Hitesh Jumnani, Krishna Zanwar and Rupali Hande
No ratings yet
Easychair Preprint: Mansi Panjwani, Rahul Ramrakhiani, Hitesh Jumnani, Krishna Zanwar and Rupali Hande
9 pages
Sales 1
No ratings yet
Sales 1
36 pages
Dilla University Senate Legislation Final - Dec - 2012
100% (2)
Dilla University Senate Legislation Final - Dec - 2012
219 pages
1142pm - 1.EPRA JOURNALS 14814
No ratings yet
1142pm - 1.EPRA JOURNALS 14814
6 pages
Mini PRJCT
No ratings yet
Mini PRJCT
11 pages
ForecastingRetailSalesusingMachine Learning Models
No ratings yet
ForecastingRetailSalesusingMachine Learning Models
34 pages
JICET-Abdullah Bin Tayyab
No ratings yet
JICET-Abdullah Bin Tayyab
11 pages
Amit Kumar: Bigmart Sales Prediction A Project Report
No ratings yet
Amit Kumar: Bigmart Sales Prediction A Project Report
47 pages
Sales Forecasting Elsvier
No ratings yet
Sales Forecasting Elsvier
19 pages
1july Presentation
No ratings yet
1july Presentation
18 pages
Chapter 1: Introduction: 1.1 Background Theory
No ratings yet
Chapter 1: Introduction: 1.1 Background Theory
36 pages
Sales Prediction Model For Big Mart: Parichay: Maharaja Surajmal Institute Journal of Applied Research
No ratings yet
Sales Prediction Model For Big Mart: Parichay: Maharaja Surajmal Institute Journal of Applied Research
11 pages
Major ppt-1
No ratings yet
Major ppt-1
13 pages
Karanja (2015)
No ratings yet
Karanja (2015)
52 pages
RP 3
No ratings yet
RP 3
12 pages
An Effective Predicting E Commerce Sales
No ratings yet
An Effective Predicting E Commerce Sales
11 pages
Ammmp2023 87 94
No ratings yet
Ammmp2023 87 94
8 pages
Geopolitics in The Foreign Office. British Representations of Argentina 1945-1961
No ratings yet
Geopolitics in The Foreign Office. British Representations of Argentina 1945-1961
19 pages
Finaal Project
No ratings yet
Finaal Project
13 pages
DLP 2 Importance of Quantitative Research
No ratings yet
DLP 2 Importance of Quantitative Research
2 pages
Sales Forecast Paper
No ratings yet
Sales Forecast Paper
8 pages
Final DMT Report PDF
No ratings yet
Final DMT Report PDF
27 pages
Emerging Strong Program TIF
No ratings yet
Emerging Strong Program TIF
35 pages
Grid Search Optimization (GSO) Based Future Sales Prediction For Big Mart
No ratings yet
Grid Search Optimization (GSO) Based Future Sales Prediction For Big Mart
7 pages
3 Main
No ratings yet
3 Main
9 pages
KOC3402 Group 8 Assignment Latest
No ratings yet
KOC3402 Group 8 Assignment Latest
18 pages
Seminar Report
No ratings yet
Seminar Report
25 pages
Final PBL of Aaryan & Satyam
No ratings yet
Final PBL of Aaryan & Satyam
19 pages
BigMart Sale Prediction Using Machine Learning
No ratings yet
BigMart Sale Prediction Using Machine Learning
2 pages
Synopsis-Big Mart Sales Prediction
No ratings yet
Synopsis-Big Mart Sales Prediction
3 pages
IJCRT2105404 Bigmart 4
No ratings yet
IJCRT2105404 Bigmart 4
4 pages
Chetan Research Paper
No ratings yet
Chetan Research Paper
7 pages
Improvizing Big Market Sales Prediction: Meghana N
No ratings yet
Improvizing Big Market Sales Prediction: Meghana N
7 pages
PPIR!1
No ratings yet
PPIR!1
9 pages
AOL-2-Mod-1 MA
No ratings yet
AOL-2-Mod-1 MA
17 pages
PPIR
No ratings yet
PPIR
8 pages
Assessing The Utilization of Digital Wallet On
No ratings yet
Assessing The Utilization of Digital Wallet On
4 pages
Community Scale Sustainability
No ratings yet
Community Scale Sustainability
25 pages
Academic Writing
No ratings yet
Academic Writing
16 pages
FinalPaper SalesPredictionModelforBigMart
No ratings yet
FinalPaper SalesPredictionModelforBigMart
14 pages
Sales Prediction
No ratings yet
Sales Prediction
37 pages
Power and Group Work in Physical Education: A Foucauldian Perspective
No ratings yet
Power and Group Work in Physical Education: A Foucauldian Perspective
15 pages
El-Angbawi Et Al-2015-Cochrane Database of Systematic Reviews
No ratings yet
El-Angbawi Et Al-2015-Cochrane Database of Systematic Reviews
30 pages
Apstats Unit2combined Practice Test
No ratings yet
Apstats Unit2combined Practice Test
8 pages
Decline of Igloo Ice-Cream in Pakistan Due To Insu
No ratings yet
Decline of Igloo Ice-Cream in Pakistan Due To Insu
12 pages
3aa5fc367415d40d722c37e70fff9bf0
No ratings yet
3aa5fc367415d40d722c37e70fff9bf0
77 pages
Entry Mode Ethiopy
No ratings yet
Entry Mode Ethiopy
36 pages
MGMT 432 02 Final - Exam - Review - Outline - Fall 2024
No ratings yet
MGMT 432 02 Final - Exam - Review - Outline - Fall 2024
12 pages
Intervention Analysis
No ratings yet
Intervention Analysis
37 pages
Case Study Set B
No ratings yet
Case Study Set B
2 pages
Performance of Public Sector
No ratings yet
Performance of Public Sector
20 pages
JPMC Strategy Analytics Analyst Resume
No ratings yet
JPMC Strategy Analytics Analyst Resume
3 pages
Sales Forecasting: Data Science Models
From Everand
Sales Forecasting: Data Science Models
Azhar ul Haque Sario
No ratings yet
How AI will Impact Retail Business
From Everand
How AI will Impact Retail Business
Ramesh Venkatachalam
No ratings yet
Business Intelligence Questions, Analytical & Reporting Hint
From Everand
Business Intelligence Questions, Analytical & Reporting Hint
Dr. Zemelak Goraga
No ratings yet
AI-Powered Growth: 54 Proven Strategies for Small Businesses to Boost Revenue: How AI Can Change Business Outcomes That Increase Revenue
From Everand
AI-Powered Growth: 54 Proven Strategies for Small Businesses to Boost Revenue: How AI Can Change Business Outcomes That Increase Revenue
Rick Spair
No ratings yet

Intern Report

Uploaded by

Intern Report

Uploaded by

BIGMART SALES PREDICTION USING DATA SCIENCE 2023-24

Dept. of CSE Page | 1

Dept. of CSE Page | 2

3. Resource Allocation Inefficiency: In the absence of data-driven sales predictions, small

Dept. of CSE Page | 3

➢ Sunitha cheriyan “Intelligent Sales Prediction Using Machine Learning Techniques.”

➢ Renesa Ray “Sales Prediction Using Machine Learning Algorithms.”

Dept. of CSE Page | 4

➢ Pratik patil “Comparison of Different Machine Learning Algorithms for Multiple

➢ Auddie heichmeir“Forecasting of Walmart Sales using Machine Learning Algorithm.”

➢ Narayana R “Sales Prediction For Big Mart”.

Dept. of CSE Page | 5

1. Data Gathering and Refinement:

2. Unearthing Sales Trends:

3. Precision in Sales Projections:

4. Efficient Resource Deployment:

5. Insightful Customer Behavior Examination:

6. Strategic Competitive Positioning:

Dept. of CSE Page | 6

• Operating system : Windows 7 or above, linux.

Dept. of CSE Page | 7

Each step is explained below in details.

Dept. of CSE Page | 8

Dept. of CSE Page | 9

Dept. of CSE Page | 10

TABLE :- MODEL TESTING

Logistic Regression 0.66 0.50 52 sec 20%

Naïve Bayesian 0.64 0.59 59 sec 20%

SVM (RBF 105 min 30 sec 21%

LDA 0.74 0.52 6 min 51 sec 35%

Dept. of CSE Page | 11

FIG 8.1:Outlet item sales

FIG 8.2:Corelation map

Dept. of CSE Page | 12

FIG 8.3:Impact of Outlet type

FIG 8.4:Impact of Outlet Size

Dept. of CSE Page | 13

FIG 8.5:Impact of Outlet Location

FIG 8.6:Item Visibility

Dept. of CSE Page | 14

FIG 8.7:Impact of Outlet on Item Sales

Dept. of CSE Page | 15

Dept. of CSE Page | 16

Dept. of CSE Page | 17

You might also like