0% found this document useful (0 votes)

7 views24 pages

SDP EV2 Updated

The document presents a senior design project focused on developing a machine learning-based dynamic pricing strategy for perishable products to optimize sales and minimize waste. It highlights the limitations of traditional pricing models and proposes using the TD3 algorithm to dynamically adjust prices based on factors like demand and expiration dates. The project aims to enhance profitability, reduce food wastage, and support retailers with data-driven decision-making.

Uploaded by

debashree

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views24 pages

SDP EV2 Updated

Uploaded by

debashree

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Senior Design Project

Review-1 Presentation
Machine learning based dynamic pricing for
perishable products
Supervised By: Prof. Sankarsan Sahoo

Group No.: C9
Neejara Dikshita Choudhury :-
2141003023
Shubham Swain :- 2141019401
Joydeep Sutradhar :- 2141019400
Debashree Priyadarshini :- 2141016343 Department of Computer Sc. and
Engineering
Faculty of Engineering & Technology (ITER)
Siksha ‘O’ Anusandhan (Deemed to be)
University
Bhubaneswar, Odisha 1
Introduction
Perishable products like fresh produce, dairy, and
meat require efficient inventory and pricing
strategies to minimize waste and maximize
profits. Traditional pricing models, such as fixed
discounts, fail to adapt to real-time demand
fluctuations, leading to significant losses. Studies
[1] indicate that 40% of fresh produce is wasted
due to ineffective sales strategies.

Fig 1.0 3
Problem Statement
In traditional retail, pricing perishable products is a balance between making the most
profit and reducing unsold stock. Fixed prices or fixed discounts don’t work well
because they don’t consider real-time customer demand, stock levels, or how long a
product will stay fresh. The challenge is to create a smart, automated pricing system
that adjusts based on these factors. This way, businesses can increase profits while
keeping waste to a minimum.

Motivation
• Enhance sales efficiency and maximize profit using, TD3 (Twin Delayed Deep
Deterministic Policy Gradient)
• Reduce food wastage through better inventory management [2].
• Implement data-driven pricing strategies for improved decision-making [1].
• Benefit retailers and grocery stores with optimized pricing models [1].
4
Objectives
This project aims to develop a Machine Learning-based dynamic pricing strategy for
perishable products to optimize sales and minimize wastage. By analyzing factors like
expiration dates, demand fluctuations, and market trends, the system will adjust
prices dynamically to maximize revenue.

Expected Impacts
• Increased Revenue: Optimized pricing ensures higher profitability using , (TD3
Algorithm).
• Reduced Waste: Minimizes perishable product wastage through dynamic pricing
[2].
• Consumer Benefits: Encourages fair pricing and affordability for customers [1].
• Data-Driven Decision Making: Supports businesses with AI-powered sales
strategies [1].
5
• Sustainability Contribution: Reduces environmental impact by lowering food
Literature Review

Author & Purpose of ML Dataset Input to Target Evaluation

Year the Study Techniques Used Model Variable Criteria
Used (Source)

Wenchuan Dynamic Multi-Agent Simulated Market Optimal Learning

Qiao et al. pricing of Reinforceme market data state, dynamic speed,
(2024) multiple nt Learning product pricing revenue
[1] perishable (MARL), Q- prices, optimization
products learning, inventory
DQN levels,
demand
interactions
Tuğçe Dynamic Deep Simulated Pricing, Optimal Optimizatio
Yavuz, Onur pricing and Reinforceme dataset inventory pricing and n
Kaya (2024) inventory nt Learning levels, stock levels performanc
[2] manageme (DRL) demand e, stability
nt for data in stochastic
6
Table 1.0
Limitations of Existing Research
• Short Product Lifetimes Considered
• Qiao et al. [1] utilized multi-agent reinforcement learning (MARL) to optimize pricing for
perishable products. Yavuz et al. [2] proposed a deep reinforcement learning (DRL) model
to enhance pricing and inventory management, reducing waste and maximizing revenue

• Limited Use of Advanced RL Algorithms

• Most studies, including Qiao et al. [1] and Yavuz et al. [2], rely on basic RL models like
DQN, with limited exploration of advanced RL techniques such as PPO or SAC for dynamic
pricing

• No Simultaneous Multi-Age Product Pricing

• Qiao et al. [1] focused on single-age product pricing, overlooking multi-age inventory
interactions. Yavuz et al. [2] optimized pricing but did not address simultaneous pricing for
different product ages.

7
Improvements Over Existing Solution

• Our project employs Reinforcement Learning (RL) techniques, specifically TD3

(Twin Delayed Deep Deterministic Policy Gradient), to dynamically adjust pricing
based on demand fluctuations and product aging.

• Unlike prior studies that assume a two-period lifetime, our model is designed to
handle perishable products with longer and variable shelf lives, providing a more
practical and scalable solution [2].

• By leveraging deep reinforcement learning (DRL), our model outperforms

traditional heuristic or simulation-based approaches by continuously improving
pricing strategies based on learned patterns from historical data [2].

8
Work-flow Diagram

Fig 1.1
9
Key Components/Features & Modules
Cont.
• Data Collection & Preprocessing
• Data Source:Collected from a retail dataset containing product details, demand factors,
pricing history, and expiration dates.
• Feature Engineering:
• Days_To_Expire = EXPIRATION_DATE - CURRENT_DATE
• Discount_Applied = Original_Price - Discounted_Price
• Data Cleaning:
• Handling missing values using median imputation.
• Removing duplicate entries.
• Feature Transformation:
• Used standardScalar to normalize the feature.

10
Key Components/Features & Modules
• Machine Learning Model Training & Optimization
• Algorithm Used:
• TD3 (Twin Delayed DDPG) Reinforcement Learning for dynamic discounting.
• Training Setup:
• State Space: (Product price, demand factor, stock level, expiration days).
• Action Space: (Discount % applied dynamically).
• Reward = Profit - Overstocking Penalty - Expiration Loss
• Optimization:
• Learning Rate: 0.0003
• Training Episodes: 5000+ for convergence.

• Visualization & Insights

• Graphs (e.g., histograms, and box plots)

11
Visualizations and Insight

12
Fig 1.2 Fig 1.3
Algorithms and Methods Used
• TD3 (Twin Delayed Deep Deterministic Policy Gradient)
Best Fit for continuous action spaces like flexible discounts
Key Strengths: Handles noise, avoids overestimation

• A2C (Advantage Actor-Critic)

Decent Fit for both continuous & discrete actions but less stable than TD3.
Key Strengths: Balanced value learning, faster updates

• DQN (Deep Q-Network)

Good Fit for discrete pricing actions
Key Strengths: Simple, effective in low-dimensional action space

• Heuristic Methods
Reward Method
Probability Model to find Demand Factor 13
Technologies, Frameworks & Tools
Used
• Programming Language: Python

• (specifically implemented in a Jupyter Notebook)

• Libraries for Data Handling: Pandas, NumPy

• Machine Learning Frameworks: Scikit-learn, Stable-Baselines3, Gym, TD3

• Visualization Tools: Matplotlib, Seaborn

• Backend Processing: FlaskAPI

• Dataset: Grocery Inventory and Sales Dataset (www.kaggle.com)

14
Results and Analysis
User Input when DQN is implemented: (Good Fit for discrete pricing actions , only for
Discrete Data)

Fig 1.4

15
Results and Analysis
Visualization using DQN:
● Not Efficient due to its improper model training

Fig 1.5
16
Results and Analysis
User Input when A2C is implemented:
Decent Fit for both continuous & discrete actions but less stable than TD3

Fig 1.6

17
Results and Analysis
Visualization using A2C:
● A2C has high gradient variance and limited exploration.

Fig 1.7 18
Results and Analysis
User Input when TD3 is implemented: (Best Model Performance)

Fig 1.8 19
Results and Analysis
Visualization using TD3:
● Reduces overestimation bias with twin critics.

Fig 1.9 20
Conclusion and Future Work
Key Findings:-

● TD3 handled continuous action spaces (prices) better, giving more stable and realistic
pricing compared to A2C (high variance) and DQN (discrete-only).

● Normalizing features like Stock_Level, Days_To_Expire significantly improved model

convergence and stability.

● A well-shaped reward (profit vs. penalties) was key to training efficiency and realistic
pricing decisions.

● Flask integration proved effective for testing real-time predictions and making the
solution user-interactive.

● TD3 required more training time than DQN but provided more accurate and profitable 21
Conclusion and Future Work
Future Work:-

● Batch Processing: Upload CSV/Excel datasets instead of single inputs.

● Feature Matching: Automatically map uploaded features to model’s expected features

● Generalization: Retrain or fine-tune the model to handle varied input distributions

across different sellers.

● Scalability: Automate optimal pricing for hundreds/thousands of products in one go.

● Real-time Integration: Enable live inventory and demand monitoring to support

continuous and dynamic price updates.

● User Experience (UX) Enhancements: Upgrade the React-based frontend to ensure a

more intuitive and interactive interface for users. 22
Bibliography
● References
[1] W. Qiao, M. Huang, Z. Gao, X. Wang, Distributed dynamic pricing of multiple perishable
products using multi-agent reinforcement learning, Expert Syst. Appl. 237 (2024) 121252
https://doi.org/10.1016/j.eswa.2024.121252.

[2] T. Yavuz, O. Kaya, Deep reinforcement learning algorithms for dynamic pricing and
inventory management of perishable products, Appl. Soft Comput. 111864 (2024)
https://doi.org/10.1016/j.asoc.2024.111864.

[3] J. Shen, Y. Wang, F. Xiao, "Dynamic Pricing Strategy for Data Product Through Deep
Reinforcement Learning," IEEE Access, vol. 12, pp. 194829-194838, 2024
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10810405

[4] S. B. Gadipudi, R. K. Kalaimani, "Reinforcement Learning for Dynamic Pricing under

Competition for Perishable Products," 28th International Conference on System Theory,
Control and Computing (ICSTCC), Sinaia, Romania, 2024
https://www.sciencedirect.com/science/article/abs/pii/S1568494624006380

● Web Resources
○ Kaggle. (n.d.). Grocery Inventory and Sales Dataset www.kaggle.com
24

The New Cisco Partner Program
100% (1)
The New Cisco Partner Program
47 pages
Doc
85% (26)
Doc
27 pages
452V1 - Services Marketing
75% (8)
452V1 - Services Marketing
21 pages
Agriculture Branding
No ratings yet
Agriculture Branding
11 pages
Toast, Inc. (NYSETOST) Reports
No ratings yet
Toast, Inc. (NYSETOST) Reports
431 pages
Major Project Review
No ratings yet
Major Project Review
15 pages
Project Presentation
No ratings yet
Project Presentation
11 pages
Deep Reinforcement Learning Algorithms For Dynamic Pricing
No ratings yet
Deep Reinforcement Learning Algorithms For Dynamic Pricing
18 pages
SRM Paper
No ratings yet
SRM Paper
11 pages
Ecommerce PPT
No ratings yet
Ecommerce PPT
32 pages
Dynamic Pricing Implemetation
No ratings yet
Dynamic Pricing Implemetation
4 pages
Retail Price Optimization
No ratings yet
Retail Price Optimization
16 pages
Building A Dynamic Pricing Engine With Machine Lea
No ratings yet
Building A Dynamic Pricing Engine With Machine Lea
11 pages
Dynamic Pricing Model of E-Commerce Platforms Base
No ratings yet
Dynamic Pricing Model of E-Commerce Platforms Base
17 pages
Phase 4
No ratings yet
Phase 4
11 pages
Paper Dynamic Pricing KNN - Aaaa
No ratings yet
Paper Dynamic Pricing KNN - Aaaa
15 pages
Price Optimization in Fashion E-Commerce
No ratings yet
Price Optimization in Fashion E-Commerce
9 pages
Pricing Recommendation by Applying Statistical Modeling Techniques
No ratings yet
Pricing Recommendation by Applying Statistical Modeling Techniques
73 pages
RL-Dynamic Pricing E-Com Report
No ratings yet
RL-Dynamic Pricing E-Com Report
80 pages
C A M M L M R S F: Omparative Nalysis of Odern Achine Earning Odels For Etail Ales Orecasting
No ratings yet
C A M M L M R S F: Omparative Nalysis of Odern Achine Earning Odels For Etail Ales Orecasting
20 pages
Dynamic Pricing Leveraging AI For Smarter Pricing Decisions
No ratings yet
Dynamic Pricing Leveraging AI For Smarter Pricing Decisions
8 pages
Dynamic Retail Pricing Via Q-Learning Nov 2024
No ratings yet
Dynamic Retail Pricing Via Q-Learning Nov 2024
4 pages
AI-Driven Product Pricing Optimization: 1. Define The Problem and Project Scope
No ratings yet
AI-Driven Product Pricing Optimization: 1. Define The Problem and Project Scope
6 pages
An Automated Deep Reinforcement Learning Pipeline For Dynamic Pricing
No ratings yet
An Automated Deep Reinforcement Learning Pipeline For Dynamic Pricing
10 pages
02+HR 738 JIER With DOI
No ratings yet
02+HR 738 JIER With DOI
7 pages
Project File
No ratings yet
Project File
49 pages
DeepLogic AI Licious
No ratings yet
DeepLogic AI Licious
12 pages
Product Pricing Solutions Using Hybrid Machine Learning Algorithm
No ratings yet
Product Pricing Solutions Using Hybrid Machine Learning Algorithm
12 pages
Item Cost Analyzer
No ratings yet
Item Cost Analyzer
14 pages
Retail Price Optimization Model
No ratings yet
Retail Price Optimization Model
7 pages
SSRN 5075202
No ratings yet
SSRN 5075202
11 pages
ML Research Summaries 7 To 12
No ratings yet
ML Research Summaries 7 To 12
3 pages
Classification of Retail Products From Probabilist
No ratings yet
Classification of Retail Products From Probabilist
24 pages
Sales Prediction and Product Recommendation Model Through
No ratings yet
Sales Prediction and Product Recommendation Model Through
20 pages
DR S Melchior1
No ratings yet
DR S Melchior1
23 pages
Pricing Brochure Tryolabs
No ratings yet
Pricing Brochure Tryolabs
9 pages
11 Optimizing Dynamic Pricing - MUHAMMAD Awais
No ratings yet
11 Optimizing Dynamic Pricing - MUHAMMAD Awais
10 pages
Java
No ratings yet
Java
34 pages
Project Synopsis
No ratings yet
Project Synopsis
12 pages
Algorithm Proposal - Inventory Demand Forecasting Using Machine Learning
No ratings yet
Algorithm Proposal - Inventory Demand Forecasting Using Machine Learning
5 pages
The Role of Dynamic Pricing Models in Increasing M
No ratings yet
The Role of Dynamic Pricing Models in Increasing M
15 pages
Predictive Analytics For Dynamic Pricing in E-Commerce
No ratings yet
Predictive Analytics For Dynamic Pricing in E-Commerce
12 pages
Sems Paper
No ratings yet
Sems Paper
3 pages
Impact of Artificial Intelligence On Dynamic Pricing: Good
No ratings yet
Impact of Artificial Intelligence On Dynamic Pricing: Good
7 pages
Boz 5 Z 2 Iqfeljxvcakkx 0 MZ 4 Odd 0 ZBGH 9
No ratings yet
Boz 5 Z 2 Iqfeljxvcakkx 0 MZ 4 Odd 0 ZBGH 9
6 pages
Srejita Literature Review
No ratings yet
Srejita Literature Review
2 pages
Operand Whitepaper
No ratings yet
Operand Whitepaper
6 pages
AUTOMATION
No ratings yet
AUTOMATION
11 pages
Basepaper 2
No ratings yet
Basepaper 2
13 pages
Business Case
No ratings yet
Business Case
2 pages
Vegetable Price Prediction Based On Time Series Analysis
50% (2)
Vegetable Price Prediction Based On Time Series Analysis
7 pages
Casestudy Simu
No ratings yet
Casestudy Simu
11 pages
A Nonparametric Approach To Multiproduct Pricing: Paat Rusmevichientong Benjamin Van Roy, Peter W. Glynn
No ratings yet
A Nonparametric Approach To Multiproduct Pricing: Paat Rusmevichientong Benjamin Van Roy, Peter W. Glynn
17 pages
Assignment 4
No ratings yet
Assignment 4
1 page
Simulating Market Maker Behavior Using Deep Reinforcement Learning To Understand Market (PDFDrive)
No ratings yet
Simulating Market Maker Behavior Using Deep Reinforcement Learning To Understand Market (PDFDrive)
106 pages
Group 23
No ratings yet
Group 23
15 pages
Smart Inventory Management System
No ratings yet
Smart Inventory Management System
5 pages
A Selection of Advanced Technologies For Demand Forecasting in The Retail Industry
No ratings yet
A Selection of Advanced Technologies For Demand Forecasting in The Retail Industry
4 pages
ML Research Summaries 5 and 6
No ratings yet
ML Research Summaries 5 and 6
1 page
Laptop Price Pred
No ratings yet
Laptop Price Pred
11 pages
SSRN 4628041
No ratings yet
SSRN 4628041
6 pages
Cost Management: A Case for Business Process Re-engineering
From Everand
Cost Management: A Case for Business Process Re-engineering
Ivor Ogidefa
No ratings yet
Deequ for Scalable Data Quality Assurance: The Complete Guide for Developers and Engineers
From Everand
Deequ for Scalable Data Quality Assurance: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
LightGBM in Practice: Definitive Reference for Developers and Engineers
From Everand
LightGBM in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mastering GCP for Web Applications: A Well-Architected Approach to Cloud Excellence
From Everand
Mastering GCP for Web Applications: A Well-Architected Approach to Cloud Excellence
Chinmoy Mukherjee
No ratings yet
Nehari Senanayake - ME
100% (1)
Nehari Senanayake - ME
49 pages
Questionnaires
100% (1)
Questionnaires
5 pages
CVP Guide Questions - University of San Jose - Recoletos ..
No ratings yet
CVP Guide Questions - University of San Jose - Recoletos ..
2 pages
RaiMelan Presentation.
No ratings yet
RaiMelan Presentation.
43 pages
June 2021 Examiner Report
No ratings yet
June 2021 Examiner Report
35 pages
Business Plan
No ratings yet
Business Plan
29 pages
Dinesh & Shailesh CP
No ratings yet
Dinesh & Shailesh CP
24 pages
Simulation in Class Debrief Slides
No ratings yet
Simulation in Class Debrief Slides
18 pages
Eco Bottle Project
No ratings yet
Eco Bottle Project
5 pages
Integrated Marketing Communications
No ratings yet
Integrated Marketing Communications
27 pages
Marketing Strategy Notes
No ratings yet
Marketing Strategy Notes
34 pages
Merged (MMZG611/MBAZG611/POMZG611/QMZG611) Experiential Learning Project (ELP) The Context
No ratings yet
Merged (MMZG611/MBAZG611/POMZG611/QMZG611) Experiential Learning Project (ELP) The Context
10 pages
Dynamic Pricing - Building An Advantage in B2B Sales - Bain & Company PDF
No ratings yet
Dynamic Pricing - Building An Advantage in B2B Sales - Bain & Company PDF
12 pages
Marketing Management Practical File
No ratings yet
Marketing Management Practical File
4 pages
TB ch02
No ratings yet
TB ch02
18 pages
CASE 1 - RK Forging Company (Repaired)
No ratings yet
CASE 1 - RK Forging Company (Repaired)
7 pages
L4M5 Questionrevised Paper (1) With Answers (1) (1) (1) 5
No ratings yet
L4M5 Questionrevised Paper (1) With Answers (1) (1) (1) 5
63 pages
Rebate Process Configuration
No ratings yet
Rebate Process Configuration
12 pages
Supermarket Analysis - (MF Presentation)
No ratings yet
Supermarket Analysis - (MF Presentation)
18 pages
Presentation On Big Bazaar: Presentation by - Gurdeep Singh - Pushpak Pandey - Shaleen Agarwal - Rohan Tandon - Richa Kohli
No ratings yet
Presentation On Big Bazaar: Presentation by - Gurdeep Singh - Pushpak Pandey - Shaleen Agarwal - Rohan Tandon - Richa Kohli
40 pages
Customer Perception Study With Respect To Vishal Mega Mart
No ratings yet
Customer Perception Study With Respect To Vishal Mega Mart
86 pages
SKRIPSI JESTYN Removed Watermark
No ratings yet
SKRIPSI JESTYN Removed Watermark
108 pages
Managed Services Delivery: Path Infotech LTD
No ratings yet
Managed Services Delivery: Path Infotech LTD
19 pages
ACI Airport Charges Policy Brief November 2021
No ratings yet
ACI Airport Charges Policy Brief November 2021
60 pages
Midterm - Set B
100% (2)
Midterm - Set B
7 pages

SDP EV2 Updated

Uploaded by

SDP EV2 Updated

Uploaded by

Senior Design Project

Author & Purpose of ML Dataset Input to Target Evaluation

Wenchuan Dynamic Multi-Agent Simulated Market Optimal Learning

• Limited Use of Advanced RL Algorithms

• No Simultaneous Multi-Age Product Pricing

• Our project employs Reinforcement Learning (RL) techniques, specifically TD3

• By leveraging deep reinforcement learning (DRL), our model outperforms

• Visualization & Insights

• A2C (Advantage Actor-Critic)

• DQN (Deep Q-Network)

• (specifically implemented in a Jupyter Notebook)

• Libraries for Data Handling: Pandas, NumPy

• Machine Learning Frameworks: Scikit-learn, Stable-Baselines3, Gym, TD3

• Visualization Tools: Matplotlib, Seaborn

• Backend Processing: FlaskAPI

• Dataset: Grocery Inventory and Sales Dataset (www.kaggle.com)

● Normalizing features like Stock_Level, Days_To_Expire significantly improved model

● Batch Processing: Upload CSV/Excel datasets instead of single inputs.

● Feature Matching: Automatically map uploaded features to model’s expected features

● Generalization: Retrain or fine-tune the model to handle varied input distributions

● Scalability: Automate optimal pricing for hundreds/thousands of products in one go.

● Real-time Integration: Enable live inventory and demand monitoring to support

● User Experience (UX) Enhancements: Upgrade the React-based frontend to ensure a

[4] S. B. Gadipudi, R. K. Kalaimani, "Reinforcement Learning for Dynamic Pricing under

You might also like