0% found this document useful (0 votes)

16 views7 pages

Reinventing Grocery Shopping With Reinforcement Learning

The paper discusses the integration of Reinforcement Learning (RL) to optimize grocery shopping experiences by utilizing Q-learning and sentiment analysis for personalized product recommendations. It highlights the challenges of traditional grocery shopping and the potential of AI to mitigate decision fatigue and enhance user satisfaction through tailored suggestions. The proposed system aims to streamline the shopping process, improve decision-making, and adapt to user preferences over time.

Uploaded by

yogita.gawdeds

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views7 pages

Reinventing Grocery Shopping With Reinforcement Learning

Uploaded by

yogita.gawdeds

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/388121891

Reinventing Grocery Shopping with Reinforcement Learning

Conference Paper · January 2025

DOI: 10.1109/ICACRS62842.2024.10841576

CITATIONS READS

0 30

5 authors, including:

Prof Yamarthi Narasimha Rao

VIT-AP University
64 PUBLICATIONS 194 CITATIONS

SEE PROFILE

All content following this page was uploaded by Prof Yamarthi Narasimha Rao on 18 January 2025.

The user has requested enhancement of the downloaded file.

Proceedings of the Third International Conference on Automation, Computing and Renewable Systems (ICACRS-2024)
IEEE Xplore Part Number: CFP24CB5-ART; ISBN: 979-8-3315-3242-0

Reinventing Grocery Shopping with Reinforcement

Learning
2024 3rd International Conference on Automation, Computing and Renewable Systems (ICACRS) | 979-8-3315-3242-0/24/$31.00 ©2024 IEEE | DOI: 10.1109/ICACRS62842.2024.10841576

Anvitha Choppa Appikatla Kesavarshini Mallampati Bhavishya

B.Tech Student B.Tech Student B.Tech Student
School of Computer Science and School of Computer Science and School of Computer Science and
Engineering, VIT-AP University, Engineering, VIT-AP University, Engineering, VIT-AP University,
Amaravathi-522237, Guntur, India Amaravathi-522237, Guntur, India Amaravathi-522237, Guntur, India
[email protected] [email protected] mallampatibhavishyachowdary@gm
ail.com

Dr. Yamarthi Narasimha Rao

Professor
School of Computer Science and
Engineering, VIT-AP University,
Amaravathi-522237, Guntur, India
[email protected]
Abstract—— Grocery shopping stands as a quintessential task Traditional grocery buying involves frequent trips to shops
in everyday life, yet it often involves complex decision-making and decisions based on personal preferences, brand loyalty,
processes influenced by factors such as individual preferences, and real-time considerations. Online grocery systems allow
budget constraints, and time limitations. In response to the consumers to browse and choose items from home. Despite
intricacies of this routine chore, the integration of Artificial this ease, the multitude of digital options may lead to decision
Intelligence (AI), particularly Reinforcement Learning (RL), has fatigue, necessitating intelligent solutions that help users make
emerged as a promising avenue for optimization. RL empowers educated and tailored decisions. This effort aims to redefine
autonomous agents to navigate the grocery shopping landscape
grocery shopping by seamlessly integrating Q-learning, an
by learning optimal strategies through iterative interactions with
iterative RL algorithm, with sentiment-driven product
the environment. By employing RL algorithms, these agents can
adapt and refine their decision-making processes over time,
suggestions. User-generated reviews, sentiment analysis, and
aiming to maximize cumulative rewards such as cost savings, Q-learning's adaptive nature are used to improve and modify
time efficiency, and customer satisfaction. Such personalized and product choices to match users' tastes and attitudes. Sentiment
dynamic approaches hold significant potential for revolutionizing analysis was included in the recommendation algorithm since
traditional grocery shopping experiences, offering tailored user reviews provide so much information. Each review
recommendations, mitigating decision fatigue, and ultimately includes the user's opinion and product sentiment, revealing
streamlining the overall shopping journey for consumers. In this other consumers' preferences and experiences. Q-learning uses
context, the application of RL techniques serves as a powerful iterative learning to adapt and grow, learning from user
tool to optimize the process of grocery shopping. By formulating interactions and feedback to enhance suggestion accuracy and
the problem within a framework of states, actions, transition relevance. This study at the interface of AI, RL, and e-
probabilities, and rewards, RL algorithms enable an autonomous commerce promises a dynamic answer to the immensity of
agent to learn an optimal strategy for navigating through a digital grocery platforms. Q-learning in the sentiment-driven
network of shops to fulfill a given shopping list efficiently. In this recommendation system will make grocery shopping more
work, through the implementation of value iteration, Q-learning, customized, efficient, and pleasurable. As the work examined
and representation policy iteration methods, the agent can learn the technique, execution, and intended results of this new
and refine its decision-making policy, ultimately simplifying and approach, it is aimed to redefine grocery shopping in the
enhancing the grocery shopping experience for consumers. digital era. The modern era witnesses an ever-increasing
Keywords— Grocery, Customers, Online Shopping,
demand for convenience, efficiency, and personalization in
Reinforcement Learning, Decision-Making. various aspects of daily life. One such area is grocery
shopping, which traditionally involves navigating through
multiple stores to find desired items at competitive prices.
I. INTRODUCTION This process is not only time-consuming but also poses
The rise of internet grocery shopping has changed the way challenges for individuals with busy schedules or specific
customers buy necessities. In this digital age, customers have preferences. While online grocery platforms offer a degree of
many options, making product selection difficult and convenience, they often fall short in providing personalized
frequently overwhelming. This study offered a novel strategy recommendations tailored to individual preferences
that uses Q-learning, an RL algorithm, to improve sentiment- effectively. To address these challenges and enhance the
driven product suggestions and supermarket shopping. grocery shopping experience, researchers and developers have

979-8-3315-3242-0/24/$31.00 ©2024 IEEE 1375

Authorized licensed use limited to: VIT-Amaravathi campus. Downloaded on January 18,2025 at 03:13:27 UTC from IEEE Xplore. Restrictions apply.
Proceedings of the Third International Conference on Automation, Computing and Renewable Systems (ICACRS-2024)
IEEE Xplore Part Number: CFP24CB5-ART; ISBN: 979-8-3315-3242-0

explored various technologies and methodologies, one of correct pricing data. The research evaluates the multi-agent
which is the utilization of agent-based systems powered by shopping system's resilience to faults and manipulation, such
reinforcement learning (RL) algorithms. as price misreporting or store pricing adjustments. Researchers
tested the system's capacity to survive obstacles and optimize
II. OBJECTIVES client cost reductions using simulations with random and
To design a dynamic and adaptable grocery shopping systematic mistakes. This study helps educate conversations
recommendation system using Q-learning, a reinforcement on the practical implementation and possible advantages for
learning algorithm, and sentiment analysis was introduced in customers in real-world grocery shopping situations by
this work. Applying sentiment analysis to user-generated assessing the approach's practicality and dependability. The
reviews and understanding grocery product perceptions is the experiment also showed that the suggested multi-agent
main aim of this work. Developing and optimizing the shopping system might save money and be resilient to pricing
recommendation model based on user input and Q-learning inaccuracies, which was not previously examined.
iterations is one objective. Adjusting the system to change Researchers developed an agent-based grocery shopping
user choices, habits, and sentiment patterns is another system to automate the shopping process by matching store
objective. The work also aims to create an easy-to-use data to consumer preferences [3]. Their solution uses user,
interface that interfaces with online grocery purchasing information management, and store server role agents to
systems. This work aims to improve user experience by cooperate and fulfill user objectives. The suggested system
buys the finest foods depending on the user's desire to save
making sentiment-driven product suggestions easier to
time and effort. It meets the grocery shopping system
navigate. Shortening the recommendation system learning
functional criteria and supports the five customer purchasing
curve and providing accurate and relevant grocery product behavior model phases. However, this study does not
recommendations from the start is another objective. The main specifically discuss RL for adjusting to user preferences over
aim is to increase grocery shopping platform customer time, which may vary from previous studies.
happiness and engagement by improving suggestion accuracy.
Researchers introduced an agent-based shopping system to
The work offers suggestions that match consumers' tastes to help supermarket consumers at home [4]. Lightweight agent
make grocery shopping more pleasurable and customized. To implementation TEEMA (TRLabs Execution Environment for
evaluate the Q-learning-driven sentiment-based Mobile Agents) provides basic agent communication,
recommendation system's influence on user happiness and migration, and location services using a microkernel approach.
engagement is the main motto. To help businesses understand Name, storage, security, and database services may be added
grocery customer attitudes and preferences is one objective. to TEEMA to improve functionality. The suggested system
To help retailers use sentiment-driven analytics to choose lets users send agents with shopping lists to chosen
products, promote, and manage inventories is another supermarkets, where they get restricted pricing lists. The
objective. To determine whether sentiment-driven suggestions system protects user data during agent travel via a residential
may optimize the grocery supply chain by affecting inventory, gateway. The system displays the search results and sends an
waste, and sustainability is the main aim of this work. agent to a supermarket via the residential gateway to get
comprehensive price lists if the user visits. User location,
III. RELATED WORKS
home gateway, mobile terminal, and participating
In [1], researchers developed an agent-based grocery supermarkets are the logical components of the distributed
shopping system that automates and considers consumer system architecture. The authors noted that their agent-based
preferences. They employed a network of agents to acquire system automates inventory management, helps supermarket
grocery store data, compared it to consumer preferences, and shoppers choose based on price and availability, and provides
changed depending on user input. Social grocery shopping, real-time special offer updates. They also addressed how to
where people share prices and amounts to get the greatest integrate the system with older database and server
savings and easiest shopping schedules, was established in [2]. technologies. They accept the problems and limits, notably in
This technique empowers customers to make better buying privacy protection, user authentication, and system resilience
choices and save money. Their strategy relies on agents to faults or manipulation. Comparing the agent-based
representing individual clients, which simplifies deployment approach to supermarket shopping with others shows its
and builds confidence by revealing which agents can supply benefits and flaws in maximizing customer shopping
trustworthy information. The writers assure realism and experiences. E-commerce has grown dramatically in recent
relevancy by using U.S. Consumer Price Index-based decades due to advances in information and
shopping lists. The suggested solution addressed a frequent telecommunications technologies and changing lifestyles.
problem in conventional grocery shopping: the absence of Horizontal collaboration among enterprises may reduce
price comparison tools and the hassle of visiting various transportation costs, improve service quality, reduce
locations to locate the best discounts. To democratize the environmental effects, mitigate risk, and increase market
shopping experience and foster fairer relationships between share, according to researchers [5]. Recently, e-grocery,
consumers and shops, imagine an online platform where users especially fresh produce, has become the most cost-effective
can exchange pricing information and get recommendations to and time-efficient delivery method. Food safety, storage
stores with the lowest overall costs for their preferred goods. temperature, and perishability are logistical concerns for e-
However, effective implementation and incentivization tactics groceries. The authors examined how cooperation-based
are needed to get consumers to actively engage and provide initiatives affected service quality in Pamplona supermarkets

[6]. The process began with a rigorous Pamplona survey to model to evaluate the is contingent on various
model e-grocery demand. Consumers demand longer shelf effect of horizontal external factors, including the
cooperation on lead times availability and accuracy of
life, but merchants prefer sending shorter-lived commodities and customer satisfaction data from grocery stores,
first to prevent food waste. Second, using survey data, the in e-grocery distribution. fluctuations in market
authors created an agent-based simulation model for conditions, and the reliability of
cooperative and non-cooperative situations. The simulation communication networks. Any
framework generates and solves a Vehicle Routing Problem disruptions or inaccuracies in
using a biased randomization approach. In conclusion, these external factors could
compromise the system's
horizontal collaboration in e-grocery delivery reduces lead performance and user
times and improves consumer satisfaction. Table I lists E- experience.
Grocery donations. The proposed method is an
Mitigating dependencies may
agent-based micromodel
TABLE I. CONTRIBUTIONS TO E-GROCERY, HORIZONTAL require redundancies, fallback
for simulating spatial
COOPERATION, AND AGENT-BASED SIMULATION [6] mechanisms, and continuous
choice in grocery shopping
monitoring to ensure robustness
Study Method Demerits behavior based on
and resilience.
individual populations.
As the number of users and
An agent-based grocery
transactions increases, An agent-based supermarket shopping system tackles
managing a large network of persistent issues in conventional grocery buying. Consumers
shopping system automates
agents and ensuring efficient
shopping by gathering
communication between them
must search several retailers for desired products at
information from multiple competitive pricing using traditional techniques [7]. These
[1] may become challenging. This
stores, comparing it with
user preferences, and
complexity could hinder the strategies may also fail to meet customer preferences, resulting
system's ability to scale in inferior selections. Given these inefficiencies, the suggested
adapting over time through
effectively to accommodate a
feedback.
growing user base and handle
solution uses modern technology and algorithms to transform
increasing transaction volumes. grocery shopping. E-commerce and consumer behavior
The system's reliance on
literature emphasizes the value of individualized suggestions
The proposed method gathering and analyzing user in customer pleasure and engagement. Researchers like [8]
entails customers data to tailor recommendations have studied customized recommendation systems, which
exchanging information on raises significant privacy and adjust product recommendations to customer preferences.
item prices and quantities security concerns. Collecting
[2]
to optimize savings and sensitive information about
Traditional recommendation fails to account for price changes
convenience, facilitated by users' preferences, purchasing and item availability in real time. The proposed approach
agents representing habits, and location data may bridges this gap by incorporating RL algorithms to allow
customers. pose risks if not adequately agents to learn and adjust their decision-making processes to
protected.
changing settings, giving users more customized and relevant
Addressing algorithmic bias
The proposed method
requires ongoing monitoring,
suggestions. RL algorithms may optimize decision-making in
involves an agent-based dynamic contexts, as shown by Sutton and Barto's pioneering
evaluation, and mitigation
grocery shopping system
that consists of three role
strategies to ensure equitable work. Modeling the grocery shopping issue as a Markov
treatment for all users. Without Decision Process (MDP) [9] and using Q-learning allows
agents: a user agent, an
careful design and oversight,
[3] information management
the system may inadvertently
agents to make intelligent store selection and item purchasing
agent, and a store server choices. This method optimizes grocery shopping by using RL
favor certain demographics or
agent, which cooperate to
purchase groceries
product categories over others, algorithms' capacity to learn from experience and modify
leading to disparities in decision-making rules [10]. The suggested solution uses RL
according to user
recommendation accuracy and
preferences.
fairness.
algorithms, unlike static recommendation systems used in
While the system aims to standard e-commerce platforms. The system uses RL to adapt
automate and optimize the to changing customer preferences, market circumstances, and
The method involves
grocery shopping process, it store offers in real time, making shopping more customized
may limit user control and and efficient [11]. The suggested solution also follows e-
employing TEEMA an
transparency over decision-
agent, to facilitate agent-
making. Users may feel
commerce automation and optimization trends, which aim to
based shopping, enabling use cutting-edge technology to solve long-standing problems
disenfranchised if they perceive
users to send agents with
shopping lists to select
the system as making decisions and improve customer satisfaction [12]. In conclusion, the
on their behalf without agent-based grocery purchasing system combines e-
[4] supermarkets, retrieve
sufficient input or explanation.
price lists, and receive real-
Providing users with greater
commerce, customized recommendation systems, and
time updates on special reinforcement learning. The technology uses powerful
control over preferences,
offers, ensuring privacy
and
recommendations, and algorithms and real-time data to improve grocery shopping by
decision-making processes, as improving convenience, customization, and efficiency. The
compatibility with legacy
well as enhancing transparency
software.
in how the system operates, is technology might make grocery shopping more smooth,
essential to foster trust and user personalized, and delightful for customers globally via
acceptance. iterative improvement and adaption [13].
[5] An agent-based simulation The effectiveness of the system

IV. PROPOSED METHODOLOGY improve the learned policy. This component gathers rewards
and compares the learned policy against a random policy in
The complete process of the Grocery Shopping is shown in
many circumstances. It compares learned policy performance
Fig. 1.
to baseline (random) policy. This research aims to optimize
A. Methodology purchasing selections across several stores using Q-learning to
The code initializes problem parameters in Data Setup. evaluate product availability, pricing, and shop distances. Q-
This contains MRPs, stores, goods, etc. A dictionary stores learning is a model-free RL algorithm that learns the best
MRPs with prices for each item. The state space in the state action-selection policy for an environment without a dynamics
space definition phase includes all agent states. Each state has model. Q-learning helps the autonomous agent learn the
two parts: the store number and item purchase status. Whether predicted cumulative benefits of adopting a certain action in a
an item is purchased is represented by a binary vector. given state for grocery shopping optimization. Q-Learning
Transition probabilities define the chance of changing states implementation is shown in Fig.2.
after an action in transition probabilities and rewards. Rewards
are instant benefits or costs of moving from one place to
another. Transition probabilities and incentives for each
activity (choosing a store) from one state to another are done
in the next phase. It takes into account store availability,
pricing, and distances.

Fig. 1. Complete Process of the Grocery Shopping

B. Q-Learning Algorithm
The model-free reinforcement learning algorithm, Q-learning
learns optimum policies. Q-values show the predicted future
benefit of doing that action in that state for each state-activity Fig. 2. Implementation using Q-Learning
pair. The Q-learning function iterates across multi-step
episodes. The agent chooses an action based on an epsilon- In Fig.3, a model-based RL technique called value iteration
greedy strategy (balancing exploration and exploitation), computes the best value function for each environmental state,
changes Q-values depending on reward and transitions, and indicating the predicted cumulative rewards from that
collects rewards in each step. The code evaluates the optimum condition forward. Value iteration is used to calculate the
policy against a random policy after Q-learning. It simulates optimum value function in grocery shopping optimization by
purchasing situations and compares benefits from both plans. updating each state's value based on the Bellman equation,
Rewards from Q-learning episodes and evaluating the learned which describes the link between a state's value and its nearby
policy against the random policy are done in this module. This states. The agent randomly initializes the value function and
graphic shows policy learning and performance. Each state's iteratively updates it until convergence, bringing the values
best action (shop) indicates the learned optimum policy. It also closer to their ideal values. After computing the optimum
shows reward plots during learning and testing to assess policy value function, the agent may choose the action with the
performance and learning. These are the Q-learning algorithm greatest predicted value in each state to maximize cumulative
and reward calculation utility functions. They compute rewards. Value iteration is beneficial for grocery shopping
penalties using item pricing, shop distances, item availability optimization when the shopping environment is well-defined
probabilities, and transition probabilities. It runs the Q- because it applies a systematic approach to finding the best
learning algorithm for a set number of episodes and steps. Q- policy in known dynamics.
values are updated depending on rewards and transitions to

available, and provide the customer with the best alternative to

save time and avoid confusion while choosing goods from
multiple vendors. These aspects were added to the simulated
environment and agent design to produce a realistic and
successful grocery shopping optimization system. Rewards
and transition probabilities guide the agent's decision-making,
while the environment offers a realistic setting for testing and
assessing solutions.

V. RESULTS AND DISCUSSION

A. Experimental SetUp
Multi-core processors with 2.5 GHz or greater clock speeds
are suggested for reinforcement learning algorithms,
especially during training. Reinforcement learning tasks are
memory-intensive, particularly with big datasets, hence 16 GB
RAM is recommended. NVIDIA GPUs with CUDA support
accelerate reinforcement learning model training. At least 4
GB of GPU VRAM is recommended. An SSD with at least
512 GB is suggested for quicker data access, model storage,
and retrieval. Downloading datasets, model updates, and
cloud-based training or deployment need a reliable, fast
internet connection. Many machine learning tools and
frameworks function well with Linux-based operating systems
like Ubuntu 18.04 or above. The preferred language for
reinforcement learning algorithms is Python 3. x. NumPy,
PyTorch, and OpenAI Gym are essential. Deep learning
frameworks TensorFlow and PyTorch implement
reinforcement learning neural network designs. Jupyter
Notebook or Visual Studio Code may be used to code, test,
Fig. 3. Implementation of Value Iteration and debug reinforcement learning systems. Unity ML-Agents
or bespoke OpenAI Gym settings may simulate grocery
C. Proposed Model shopping.
This work proposes a solution to grocery buying problems.
An ecosystem resembling food stores was constructed. Each B. Results
store will have its unique inventory, availability, and pricing Fig.4 shows the connection between episode rewards and
patterns. Distances between each pair of stores in the episode count, revealing the Q-learning algorithm's learning
ecosystem are also defined. Real-world or simplified distances ability. The algorithm's increasing trend shows its capacity to
might be used for simulation. Next, the agent's state will optimize rewards as it learns. In Fig. 4, the agent interacts
contain its present store and its purchasing status, which with the grocery shopping environment, making choices
shows whether each item on the shopping list has been (actions) based on its present state and getting feedback
purchased. The agent might shift between stores depending on (rewards) depending on its actions. During each contact, the
the situation. A list of stores the agent may visit from the agent updates its Q-values using the Q-learning update
present shop will be provided. Given a state, the agent will algorithm, which includes the observed reward and the
operate according to a policy. Rendition learning algorithms following state's maximum Q-value. The agent learns the
like Q-learning or policy iteration may teach this policy. ideal Q-values via repeated interactions and updates, allowing
Agents earn favorable rewards for finding the needed item in it to choose actions in various stages to optimize cumulative
stores. Rewards might be fixed or depending on the item price. rewards like cost savings, time efficiency, and customer
If the item is not discovered at the store, the agent may be satisfaction. Fig. 5 compares Q-learning awards to random
penalized to encourage finding it elsewhere. The agent may be rewards in 100 experiments. It starts each test with a random
penalized for going between stores to reduce distance. The state, follows the regulations, and earns rewards until the
agent may be penalized for item costs, prompting it to locate episode ends or a step limit is met. The x-axis shows the test
cheaper solutions. The transition model calculates the number and the y-axis shows the incentives, enabling
probability of changing states depending on agent activity and comparison of the two policies' performance. The graph in
result. Historical data or a distribution (e.g., Bernoulli Fig. 6 does value iteration 1000 times to get the best policy. It
distribution) might predict a shop's item availability. Distance then compares the rewards from a random policy with the
and traffic circumstances may affect the likelihood of ideal policy from value iteration, summing and displaying
switching shops. The transition probabilities may also evaluate them for visual comparison.
merchandise pricing, preferring stores with cheaper costs if

integrating physical and digital shopping experiences,

leveraging data to increase customer pleasure and operational
efficiency, and expanding RL-based optimization to online
grocery platforms. Global consumers' efficiency, convenience,
and sustainability can be increased by adopting these future
trends and developing AI-enabled grocery shopping. Finally,
RL may enhance grocery buying. Iterative interactions with
the environment enable RL-powered agents to learn and adjust
optimal strategies for individual preferences, financial
limitations, and temporal constraints. These agents optimize
cost savings, time efficiency, and client satisfaction to ease
shopping. Shopping experiences increase with value, Q-
learning, and representation policy iteration in decision-
Fig. 4. Rewards on each Iteration making policies.

References
[1] Joo, Kwang Hyoun, Tetsuo Kinoshita, and Norio Shiratori. "Agent-
based grocery shopping system based on user's preference." Proceedings
Seventh International Conference on Parallel and Distributed Systems:
Workshops. IEEE, 2000.
[2] Du, Hongying, and Michael N. Huhns. "A Multiagent system approach
to grocery shopping." Advances on Practical Applications of Agents and
Multiagent Systems: 9th International Conference on Practical
Applications of Agents and Multiagent Systems. Berlin, Heidelberg:
Springer Berlin Heidelberg, 2011.
[3] Joo, Kwang Hyoun, Tetsuo Kinoshita, and Norio Shiratori. "Design and
implementation of an agent-based grocery shopping system." IEICE
TRANSACTIONS on Information and Systems 83.11 (2000): 1940-
1951.
Fig. 5. Comparison of Q-Learning with Random Algorithm for Rewards Vs [4] L.Benedicenti, Xuguang Chen, Xiaoran Cao, and R. Paranjape, "An
Test No agent-based shopping system," Canadian Conference on Electrical and
Computer Engineering 2004 (IEEE Cat. No.04CH37513), Niagara Falls,
ON, Canada, 2004, pp.703-705Vol.2, doi:
10.1109/CCECE.2004.1345210.
[5] Serrano-Hernandez, Adrian, et al. "Agent-based simulation improves E-
grocery deliveries using horizontal cooperation." 2020 Winter
Simulation Conference (WSC). IEEE, 2020.
[6] Schenk, Tilman A., Günter Löffler, and Jürgen Rauh. "Agent-based
simulation of consumer behavior in grocery shopping on a regional
level."JournalofBusinessresearch 60.8(2007):894-903.
[7] Yu-San Chen, Chang-Franw Lee, Proceedings of the 2009 International
Conference on Artificial Intelligence (ICAI'09), “Agent-Based
Simulation of Consumer Purchasing Behaviour in a Virtual
Environment”.
[8] Giuseppe Mangioni, Rosario Sinatra, Vincenzo Nicosia, Vito Latora,
Journal of Artificial Societies and Social Simulation, “A Multi-Agent
Fig. 6. Value Iteration Vs Random Algorithm System for Modelling Consumer Behaviour in Supermarkets”.
VI. CONCLUSION FUTURE SCOPE [9] Yuri Kaniovski, Martin Summer, Journal of Artificial Societies and
Social Simulation, “Agent-based Modeling of Store Choice Dynamics”.
AI-enabled grocery shopping optimization has several exciting [10] Kozma Zsolt, Máté Csorba, 7th International
research and development opportunities. For a truly ConferenceonAppliedInformatics,2015, “Agent-based Simulation of
customized shopping experience, refine RL algorithms to Consumer Behaviour in the Retail Sector”.
provide more personalized recommendations based on diet, [11] Michele Piccione, Annual Review of Economics, 2016, “Agent-Based
health goals, and culture. The present work focused on Modeling in Consumer Economics”.
increasing multi-agent system research to model complex [12] Pavle Boskovic, Srdjan Boskovic, Aleksandar Ivanovic, 2018 17th
International Symposium INFOTEH-JAHORINA (INFOTEH), “Agent-
customer, merchant, and stakeholder interactions to increase Based Model of Consumer Decision Making Process in Online Grocery
joint optimization and system efficiency. The work aimed at Shopping”.
developing adaptable RL algorithms to adjust shopping [13] Jungsoo Park, Hyunju Park, 2014 International Conference on Control,
strategies to product availability, pricing, and store layout Automation and Information Sciences (ICCAIS), “Agent-Based
Simulation for Understanding Consumers' Shopping Behavior in Virtual
changes. RL framework sustainability principles promote food Supermarket.
waste reduction, eco-friendly product selection, and carbon-
efficient transportation. The present work focused on

Authorized licensed use limited to: VIT-Amaravathi campus. Downloaded on January 18,2025 at 03:13:27 UTC from IEEE Xplore. Restrictions apply.

View publication stats

Spring Professional Certification Study Guide
No ratings yet
Spring Professional Certification Study Guide
12 pages
En TS 8.1.1 TSCSTA Book
No ratings yet
En TS 8.1.1 TSCSTA Book
260 pages
The United Republic of Tanzania Prime Minister'S Office Regional Administration and Local Government
No ratings yet
The United Republic of Tanzania Prime Minister'S Office Regional Administration and Local Government
45 pages
NSE7 - LAN Edge FortiGate 7.0 - Study Guide
No ratings yet
NSE7 - LAN Edge FortiGate 7.0 - Study Guide
478 pages
Sample Questionnaire For Thesis About Enrollment System
100% (1)
Sample Questionnaire For Thesis About Enrollment System
5 pages
Virb 360: Owner's Manual
No ratings yet
Virb 360: Owner's Manual
24 pages
CPPLUS T Sense Face Recognition Terminal User Manual
No ratings yet
CPPLUS T Sense Face Recognition Terminal User Manual
103 pages
Research Paper On e Grocery
No ratings yet
Research Paper On e Grocery
31 pages
Document Details Rev. Format Notes Location Document No. Date Compiled Current Revision Date Revision Status DOC Originator (Company)
No ratings yet
Document Details Rev. Format Notes Location Document No. Date Compiled Current Revision Date Revision Status DOC Originator (Company)
13 pages
8Cspl6241-Advanced Java Programming: Prof. Mohan Reddy - Y SOSS, CMR University
No ratings yet
8Cspl6241-Advanced Java Programming: Prof. Mohan Reddy - Y SOSS, CMR University
10 pages
Sheet Primary 6 (ICT)
No ratings yet
Sheet Primary 6 (ICT)
22 pages
Plagiarism Detection Research
No ratings yet
Plagiarism Detection Research
23 pages
Spirit Breaker
No ratings yet
Spirit Breaker
9 pages
20-Shanmuga Priyan - SMART CART TO RECOGNIZE OBJECTS PDF
No ratings yet
20-Shanmuga Priyan - SMART CART TO RECOGNIZE OBJECTS PDF
5 pages
MCQS For Practice
No ratings yet
MCQS For Practice
16 pages
Ii Project Documentation Template
No ratings yet
Ii Project Documentation Template
86 pages
E - Pachari Market-Basket Analysis-Based On Recommendation System For Grocery
No ratings yet
E - Pachari Market-Basket Analysis-Based On Recommendation System For Grocery
7 pages
AI Final Report
No ratings yet
AI Final Report
29 pages
iSTAR 360 Degree Measurement Module Integrated by Imaging Companies
No ratings yet
iSTAR 360 Degree Measurement Module Integrated by Imaging Companies
2 pages
1) Publication Page 12-17 Year 2022
No ratings yet
1) Publication Page 12-17 Year 2022
42 pages
Online Grocery Shop
No ratings yet
Online Grocery Shop
6 pages
Customer Sentiment Based Online Grocery Recommendation Engine
No ratings yet
Customer Sentiment Based Online Grocery Recommendation Engine
6 pages
Implementing A Product Recommendation Engine With Implicit Feedback: An Application To Grocery Shopping.
100% (1)
Implementing A Product Recommendation Engine With Implicit Feedback: An Application To Grocery Shopping.
8 pages
Classification of Retail Products From Probabilist
No ratings yet
Classification of Retail Products From Probabilist
24 pages
Next Basket
No ratings yet
Next Basket
11 pages
Online Grocery Shopping and Consumer Perception: A Case of Karachi Market in Pakistan
No ratings yet
Online Grocery Shopping and Consumer Perception: A Case of Karachi Market in Pakistan
13 pages
Project Domain / Category: Recommender System For Online Grocery Shopping
No ratings yet
Project Domain / Category: Recommender System For Online Grocery Shopping
2 pages
Assignment 3 Pavithra
No ratings yet
Assignment 3 Pavithra
4 pages
Analysis of Technology Architectures and Cybersecurity Assessment
No ratings yet
Analysis of Technology Architectures and Cybersecurity Assessment
9 pages
1 E9x References
No ratings yet
1 E9x References
3 pages
Llama3.1 Paper
No ratings yet
Llama3.1 Paper
92 pages
Satellite - Radius14 E45W C4200X
No ratings yet
Satellite - Radius14 E45W C4200X
4 pages
5 The Impact of E-Retail Usage On Relative Retail Patronage Formation
No ratings yet
5 The Impact of E-Retail Usage On Relative Retail Patronage Formation
17 pages
Food Recommendation System Using One-Stage Algorithm
No ratings yet
Food Recommendation System Using One-Stage Algorithm
14 pages
Sec A - Group 14 - Migros Turkey - Scaling Online Operations
No ratings yet
Sec A - Group 14 - Migros Turkey - Scaling Online Operations
6 pages
FCRM Notes POs
No ratings yet
FCRM Notes POs
6 pages
Dumpsys ANR WindowManager
No ratings yet
Dumpsys ANR WindowManager
3,370 pages
Fairyrose 3 RRL Synthesis
No ratings yet
Fairyrose 3 RRL Synthesis
4 pages
1 s2.0 S2405844024018887 Main
No ratings yet
1 s2.0 S2405844024018887 Main
18 pages
Proceeding Aicseet-2022
No ratings yet
Proceeding Aicseet-2022
9 pages
Online Grocery Management System: Article
No ratings yet
Online Grocery Management System: Article
6 pages
Ontology and Pervasive Retail
No ratings yet
Ontology and Pervasive Retail
5 pages
Do Dissertations Go Through Turnitin
100% (2)
Do Dissertations Go Through Turnitin
4 pages
Recommendation System in Business Intelligence Solutions For Grocery Shops Challenges and Perspective
No ratings yet
Recommendation System in Business Intelligence Solutions For Grocery Shops Challenges and Perspective
5 pages
Research GAP
No ratings yet
Research GAP
2 pages
Foods 13 02948
No ratings yet
Foods 13 02948
34 pages
Online Grocery Shopping A Study of Consumer Behaviour On Staying and Switching Between Amazon Big Basket and Grofers
No ratings yet
Online Grocery Shopping A Study of Consumer Behaviour On Staying and Switching Between Amazon Big Basket and Grofers
13 pages
E-Commerce Personalized Recommendation Based On Ma
No ratings yet
E-Commerce Personalized Recommendation Based On Ma
11 pages
Wireless Communications and Mobile Computing - 2022 - Wang - Data Marketing Optimization Method Combining Deep Neural
No ratings yet
Wireless Communications and Mobile Computing - 2022 - Wang - Data Marketing Optimization Method Combining Deep Neural
10 pages
2022 Summer Question Paper (Msbte Study Resources)
No ratings yet
2022 Summer Question Paper (Msbte Study Resources)
2 pages
(J) 2021 - Smart Cart With Multi-Shopping Solutions
No ratings yet
(J) 2021 - Smart Cart With Multi-Shopping Solutions
17 pages
Online Grocery Management System 3
No ratings yet
Online Grocery Management System 3
6 pages
Gis Cat 3
No ratings yet
Gis Cat 3
6 pages
Customers Satisfaction Based On Zomato Ratings and Reviews Using Machine Learning
No ratings yet
Customers Satisfaction Based On Zomato Ratings and Reviews Using Machine Learning
5 pages
Descriptive Analysis of Food Restaurants in Bengaluru
No ratings yet
Descriptive Analysis of Food Restaurants in Bengaluru
4 pages
GM New Finallll
No ratings yet
GM New Finallll
13 pages
Smart Cart IRJSET
No ratings yet
Smart Cart IRJSET
8 pages
The Future of Grocery Shopping
No ratings yet
The Future of Grocery Shopping
32 pages
Large Scale Product Recommendation of Supermarket
No ratings yet
Large Scale Product Recommendation of Supermarket
19 pages
Master TT (2024-25) EVEN SEM - 4-1-2025
No ratings yet
Master TT (2024-25) EVEN SEM - 4-1-2025
43 pages
Design Thinking Rest Experiment
No ratings yet
Design Thinking Rest Experiment
17 pages
Prerequisites: R Installation
No ratings yet
Prerequisites: R Installation
11 pages
MP Exp.1
No ratings yet
MP Exp.1
7 pages
Product Recommendation System For Supermarket
No ratings yet
Product Recommendation System For Supermarket
7 pages
Updated 24-25 Mentors
No ratings yet
Updated 24-25 Mentors
6 pages
Market Basket Analysis Using Apriori Algorithm Gro
No ratings yet
Market Basket Analysis Using Apriori Algorithm Gro
9 pages
Intelligent Food Recommendation System Using Machine Learning
No ratings yet
Intelligent Food Recommendation System Using Machine Learning
4 pages
How To Setup A Solaris 11.4 Local Repository GOOD
No ratings yet
How To Setup A Solaris 11.4 Local Repository GOOD
7 pages
Consequences of Personalized Product Recommendations and Price Promotions in Online Grocery Shopping
No ratings yet
Consequences of Personalized Product Recommendations and Price Promotions in Online Grocery Shopping
10 pages
QMR Assignment
No ratings yet
QMR Assignment
8 pages
STTP Report
No ratings yet
STTP Report
21 pages
2102 - Group 6 - Final
No ratings yet
2102 - Group 6 - Final
14 pages
1 s2.0 S0925527324002743 Main
No ratings yet
1 s2.0 S0925527324002743 Main
18 pages
BE Student Data Verification Sheet (24-25)
No ratings yet
BE Student Data Verification Sheet (24-25)
17 pages
ENSIA Exam Time Table 2024-2025 - Midterm S1 Planning-1
No ratings yet
ENSIA Exam Time Table 2024-2025 - Midterm S1 Planning-1
1 page
A Personalized Product Recommendation Model in E-Commerce Based On Retrieval Strategy
No ratings yet
A Personalized Product Recommendation Model in E-Commerce Based On Retrieval Strategy
14 pages
Questions On DS
No ratings yet
Questions On DS
8 pages
SEGMENTATION
No ratings yet
SEGMENTATION
12 pages
The International Review of Retail Distr
No ratings yet
The International Review of Retail Distr
24 pages
MCQ On Tasm
No ratings yet
MCQ On Tasm
3 pages
List of MP Experiments
No ratings yet
List of MP Experiments
1 page
BE Project Notice25-26
No ratings yet
BE Project Notice25-26
1 page
Paper 15417
No ratings yet
Paper 15417
6 pages
Final Report - PBL
No ratings yet
Final Report - PBL
15 pages
Exp 2
No ratings yet
Exp 2
3 pages
UA-Utilities JD - Domain Consultant
No ratings yet
UA-Utilities JD - Domain Consultant
6 pages
Agriculture 13 01803
No ratings yet
Agriculture 13 01803
16 pages
Untitled 0
No ratings yet
Untitled 0
1 page
GRASSHOPPER Paper Review
No ratings yet
GRASSHOPPER Paper Review
17 pages
Report
No ratings yet
Report
10 pages
Srijan Tripathi
No ratings yet
Srijan Tripathi
1 page
Syllabus Design NEP 2025
No ratings yet
Syllabus Design NEP 2025
18 pages
When The Recipe Is More Important Than The Ingredients - 2025 - Journal of Reta
No ratings yet
When The Recipe Is More Important Than The Ingredients - 2025 - Journal of Reta
14 pages
Module 5 - Introduction-to-Pentium-Processor
No ratings yet
Module 5 - Introduction-to-Pentium-Processor
15 pages
Q Commece Jan June24 IPE
No ratings yet
Q Commece Jan June24 IPE
8 pages
Application Format For E-YUVA Fellow - Group 2
No ratings yet
Application Format For E-YUVA Fellow - Group 2
11 pages
Experiment No. 10 MP
No ratings yet
Experiment No. 10 MP
3 pages
CP 4152 Database Practices I Previous Question Paper
No ratings yet
CP 4152 Database Practices I Previous Question Paper
6 pages
MP KT Ia
No ratings yet
MP KT Ia
1 page
Solution of CN New
No ratings yet
Solution of CN New
23 pages
Qbank For MP
No ratings yet
Qbank For MP
1 page
Intelligent Classification and Personalized Recommendation of E-Commerce Products Based On Machine Learning
No ratings yet
Intelligent Classification and Personalized Recommendation of E-Commerce Products Based On Machine Learning
7 pages
Smart Cart
No ratings yet
Smart Cart
14 pages
Irjet A Systematic Review On Consumers B
No ratings yet
Irjet A Systematic Review On Consumers B
6 pages
Dec 2022
No ratings yet
Dec 2022
7 pages
Orientation On Major Project
No ratings yet
Orientation On Major Project
1 page
The Interactive Effect of Recommendation Subjects An - 2025 - Journal of Retaili
No ratings yet
The Interactive Effect of Recommendation Subjects An - 2025 - Journal of Retaili
14 pages
Journal Pre-Proof: European Journal of Operational Research
No ratings yet
Journal Pre-Proof: European Journal of Operational Research
33 pages
ICT Project Management: Framework for ICT-based Pedagogy System: Development, Operation, and Management
From Everand
ICT Project Management: Framework for ICT-based Pedagogy System: Development, Operation, and Management
Suman Ahmmed
No ratings yet

Reinventing Grocery Shopping With Reinforcement Learning

Uploaded by

Reinventing Grocery Shopping With Reinforcement Learning

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Reinventing Grocery Shopping with Reinforcement Learning

Conference Paper · January 2025

Prof Yamarthi Narasimha Rao

The user has requested enhancement of the downloaded file.

Reinventing Grocery Shopping with Reinforcement

Anvitha Choppa Appikatla Kesavarshini Mallampati Bhavishya

Dr. Yamarthi Narasimha Rao

979-8-3315-3242-0/24/$31.00 ©2024 IEEE 1375

979-8-3315-3242-0/24/$31.00 ©2024 IEEE 1376

979-8-3315-3242-0/24/$31.00 ©2024 IEEE 1377

Fig. 1. Complete Process of the Grocery Shopping

979-8-3315-3242-0/24/$31.00 ©2024 IEEE 1378

available, and provide the customer with the best alternative to

V. RESULTS AND DISCUSSION

979-8-3315-3242-0/24/$31.00 ©2024 IEEE 1379

integrating physical and digital shopping experiences,

979-8-3315-3242-0/24/$31.00 ©2024 IEEE 1380

View publication stats

You might also like