Knowledge-Based Reinforcement Learning For Data Mining

The document discusses two techniques for knowledge-based reinforcement learning (KBRL) that can be applied to data mining tasks where an agent actively collects data. The first technique uses high-level STRIPS operator knowledge to shape rewards and focus the search for an optimal policy. The second technique shapes rewards using a hierarchical tile coding approach when STRIPS knowledge is unavailable. Both KBRL techniques aim to overcome limitations of heuristic domain knowledge and reinforcement learning to produce optimal solutions for complex, real-world data collection tasks.

Uploaded by

mtsha

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views

Knowledge-Based Reinforcement Learning For Data Mining

Uploaded by

mtsha

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Knowledge-Based Reinforcement Learning for Data Mining

Daniel Kudenko and Marek Grzes

Department of Computer Science, University of York, York YO105DD, UK {kudenko,grzes}@cs.york.ac.uk

Extended Abstract

Data Mining is the process of extracting patterns from data. Two general avenues of research in the intersecting areas of agents and data mining can be distinguished. The rst approach is concerned with mining an agents observation data in order to extract patterns, categorize environment states, and/or make predictions of future states. In this setting, data is normally available as a batch, and the agents actions and goals are often independent of the data mining task. The data collection is mainly considered as a side eect of the agents activities. Machine learning techniques applied in such situations fall into the class of supervised learning. In contrast, the second scenario occurs where an agent is actively performing the data mining, and is responsible for the data collection itself. For example, a mobile network agent is acquiring and processing data (where the acquisition may incur a certain cost), or a mobile sensor agent is moving in a (perhaps hostile) environment, collecting and processing sensor readings. In these settings, the tasks of the agent and the data mining are highly intertwined and interdependent (or even identical). Supervised learning is not a suitable technique for these cases. Reinforcement Learning (RL) enables an agent to learn from experience (in form of reward and punishment for explorative actions) and adapt to new situations, without a teacher. RL is an ideal learning technique for these data mining scenarios, because it ts the agent paradigm of continuous sensing and acting, and the RL agent is able to learn to make decisions on the sampling of the environment which provides the data. Nevertheless, RL still suers from scalability problems, which have prevented its successful use in many complex real-world domains. The more complex the tasks, the longer it takes a reinforcement learning algorithm to converge to a good solution. For many real-world tasks, human expert knowledge is available. For example, human experts have developed heuristics that help them in planning and scheduling resources in their work place. However, this domain knowledge is often rough and incomplete. When the domain knowledge is used directly by an automated expert system, the solutions are often sub-optimal, due to the incompleteness of the knowledge, the uncertainty of environments, and the possibility to encounter unexpected situations. RL, on the other hand, can overcome the weaknesses of the heuristic domain knowledge and produce optimal solutions. In the talk we propose two techniques, which represent rst steps in
L. Cao et al. (Eds.): ADMI 2009, LNCS 5680, pp. 2122, 2009. c Springer-Verlag Berlin Heidelberg 2009

D. Kudenko and M. Grzes

the area of knowledge-based RL (KBRL). The rst technique [1] uses high-level STRIPS operator knowledge in reward shaping to focus the search for the optimal policy. Empirical results show that the plan-based reward shaping approach outperforms other RL techniques, including alternative manual and MDP-based reward shaping when it is used in its basic form. We showed that MDP-based reward shaping may fail and successful experiments with STRIPS-based shaping suggest modications which can overcome encountered problems. The STRIPSbased method we propose allows expressing the same domain knowledge in a dierent way and the domain expert can choose whether to dene an MDP or STRIPS planning task. We also evaluated the robustness of the proposed STRIPS-based technique to errors in the plan knowledge. In case that STRIPS knowledge is not available, we propose a second technique [2] that shapes the reward with hierarchical tile coding. Where the Q-function is represented with low-level tile coding, a V-function with coarser tile coding can be learned in parallel and used to approximate the potential for ground states. In the context of data mining, our KBRL approaches can also be used for any data collection task where the acquisition of data may incur considerable cost. In addition, observing the data collection agent in specic scenarios may lead to new insights into optimal data collection behaviour in the respective domains. In future work, we intend to demonstrate and evaluate our techniques on concrete real-world data mining applications.

References
1. Grzes, M., Kudenko, D.: Plan-based Reward Shaping for Reinforcement Learning. In: Fourth International IEEE Conference on Intelligent Systems, vol. 2, pp. 2229 (2008) 2. Grzes, M., Kudenko, D.: Learning potential for reward shaping in reinforcement learning with tile coding. In: Proceedings AAMAS 2008 Workshop on Adaptive and Learning Agents and Multi-Agent Systems (ALAMAS-ALAg 2008), pp. 1723 (2008)

Follow Actions PDF
No ratings yet
Follow Actions PDF
42 pages
UNIT-3
No ratings yet
UNIT-3
29 pages
Reinforcement Learning (RL) : Big Data Mining
No ratings yet
Reinforcement Learning (RL) : Big Data Mining
86 pages
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
No ratings yet
Reinforcement Learning: By: Chandra Prakash IIITM Gwalior
64 pages
Activity Recognition: Fundamentals and Applications
From Everand
Activity Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Lecture 29 RL
No ratings yet
Lecture 29 RL
38 pages
Fundamentals of Reinforcement Learning
No ratings yet
Fundamentals of Reinforcement Learning
33 pages
Unit-8 - Reinforcement Learning
No ratings yet
Unit-8 - Reinforcement Learning
52 pages
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
No ratings yet
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
34 pages
Algorithms For Reinforcement Learning - Szepesvari
No ratings yet
Algorithms For Reinforcement Learning - Szepesvari
98 pages
Book Review: Learning To Maximize Rewards: A Review of R.S. Sutton and A.G. Barto's Pages, ISBN 0-262-19398-1, $42.00
No ratings yet
Book Review: Learning To Maximize Rewards: A Review of R.S. Sutton and A.G. Barto's Pages, ISBN 0-262-19398-1, $42.00
3 pages
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Deep Reinforcement Learning: An Essential Guide
From Everand
Deep Reinforcement Learning: An Essential Guide
Robert Johnson
No ratings yet
21ai020 & Reinforcement Learning UNIT 1-LM:1
No ratings yet
21ai020 & Reinforcement Learning UNIT 1-LM:1
8 pages
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
RL Unit 1
100% (1)
RL Unit 1
26 pages
Reinforcement Learning Notes ?
No ratings yet
Reinforcement Learning Notes ?
40 pages
ML Unit 5
No ratings yet
ML Unit 5
57 pages
Bayesian Deep Reinforcement Learning Via Deep Kernel Learning
No ratings yet
Bayesian Deep Reinforcement Learning Via Deep Kernel Learning
8 pages
Module 01
No ratings yet
Module 01
66 pages
AI (IT) UNIT-5
No ratings yet
AI (IT) UNIT-5
43 pages
Model-Based Reinforcement Learning
No ratings yet
Model-Based Reinforcement Learning
67 pages
RLAlgs in MDPs
No ratings yet
RLAlgs in MDPs
98 pages
Reinforcement Learning: Nazia Bibi
100% (1)
Reinforcement Learning: Nazia Bibi
61 pages
Algorithms For Reinforced Learning
No ratings yet
Algorithms For Reinforced Learning
98 pages
RL-PRESENTATION
No ratings yet
RL-PRESENTATION
12 pages
Markov Decicion
No ratings yet
Markov Decicion
40 pages
Grzes 10 Thesis
No ratings yet
Grzes 10 Thesis
199 pages
ML Unit-4 - RTU
No ratings yet
ML Unit-4 - RTU
18 pages
Reinforcement Learning
100% (1)
Reinforcement Learning
64 pages
Bayesian Reinforcement Learning: A Survey
No ratings yet
Bayesian Reinforcement Learning: A Survey
147 pages
13-RL DRL
No ratings yet
13-RL DRL
102 pages
Reinforcement Learning - Basics
No ratings yet
Reinforcement Learning - Basics
7 pages
RL-Endterm Report - Mridul Agarwal
No ratings yet
RL-Endterm Report - Mridul Agarwal
27 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
23 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
Assignment_15_Modern_AI
No ratings yet
Assignment_15_Modern_AI
3 pages
A Survey of Preference-Based Reinforcement Learning Methods
No ratings yet
A Survey of Preference-Based Reinforcement Learning Methods
46 pages
RL & DL Notes
No ratings yet
RL & DL Notes
73 pages
unit4(AI)2024.docx-1
No ratings yet
unit4(AI)2024.docx-1
22 pages
Elementos Basicos Aprendizaje Por Refuerzo
No ratings yet
Elementos Basicos Aprendizaje Por Refuerzo
52 pages
Reinforcement_learning
No ratings yet
Reinforcement_learning
19 pages
Unit-1
No ratings yet
Unit-1
18 pages
Unit I
No ratings yet
Unit I
8 pages
MDP Concepts
No ratings yet
MDP Concepts
23 pages
Reinforcement
No ratings yet
Reinforcement
9 pages
Unit-5 Reinforcemnt and Q learning
No ratings yet
Unit-5 Reinforcemnt and Q learning
45 pages
Module 1
No ratings yet
Module 1
72 pages
Reinforcement Learning MY101
No ratings yet
Reinforcement Learning MY101
15 pages
Contextual Image Classification: Understanding Visual Data for Effective Classification
From Everand
Contextual Image Classification: Understanding Visual Data for Effective Classification
Fouad Sabry
No ratings yet
Sandholm 1996
No ratings yet
Sandholm 1996
20 pages
Unit 5-1
No ratings yet
Unit 5-1
8 pages
Reinforcement Learning An Introduction 2 Trimmed Edition Richard S. Sutton instant download
100% (1)
Reinforcement Learning An Introduction 2 Trimmed Edition Richard S. Sutton instant download
54 pages
Seminar Report
No ratings yet
Seminar Report
12 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Reinforcement learning
No ratings yet
Reinforcement learning
10 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Lecture RL
No ratings yet
Lecture RL
37 pages
Unit 1 - Reinforcement Learning,Overfitting, Training, Validation Sets, Metrics, Bias and Variance
No ratings yet
Unit 1 - Reinforcement Learning,Overfitting, Training, Validation Sets, Metrics, Bias and Variance
16 pages
15
No ratings yet
15
17 pages
Ansys Workbench 14 Test: Toolbox. (T/F)
100% (1)
Ansys Workbench 14 Test: Toolbox. (T/F)
2 pages
LPT Report.
No ratings yet
LPT Report.
6 pages
Pilot's Guide For The: Software Release 1.x
No ratings yet
Pilot's Guide For The: Software Release 1.x
64 pages
Taurus Product-Catalog Winter-2019 PDF
0% (1)
Taurus Product-Catalog Winter-2019 PDF
31 pages
Murf 820
No ratings yet
Murf 820
5 pages
stm32 Discovery Schematic
No ratings yet
stm32 Discovery Schematic
4 pages
Step by Step Guide For Booking Extra Seat in Air Vistara 1
No ratings yet
Step by Step Guide For Booking Extra Seat in Air Vistara 1
6 pages
Power Converter For Wind Turbine Application: of PO Idaho
No ratings yet
Power Converter For Wind Turbine Application: of PO Idaho
2 pages
Civil 3D Handling of Survey Points Practice Manual
No ratings yet
Civil 3D Handling of Survey Points Practice Manual
7 pages
Your E-Bill For July.2020 Customer 507140 1441.12.16.02.34.1234 PDF
No ratings yet
Your E-Bill For July.2020 Customer 507140 1441.12.16.02.34.1234 PDF
2 pages
Pre PSPM 2
No ratings yet
Pre PSPM 2
7 pages
BP Project (Flat Slab) 11th Floor
No ratings yet
BP Project (Flat Slab) 11th Floor
37 pages
Ifc 200
No ratings yet
Ifc 200
1 page
SLR Jh13: High Speed Master Trip Relay
No ratings yet
SLR Jh13: High Speed Master Trip Relay
2 pages
Overview of Performance-Based Seismic Design of Building Structures in China
No ratings yet
Overview of Performance-Based Seismic Design of Building Structures in China
12 pages
Adobe Scan 24 Sept 2024
No ratings yet
Adobe Scan 24 Sept 2024
12 pages
18 - Aizam - HV IT in TNB Substations - Paper PDF
No ratings yet
18 - Aizam - HV IT in TNB Substations - Paper PDF
5 pages
Electronics Sample Business Plan
100% (1)
Electronics Sample Business Plan
41 pages
MQ-F-0301-19 Ballast Waster Declaration Form (ROMPE)
No ratings yet
MQ-F-0301-19 Ballast Waster Declaration Form (ROMPE)
1 page
Test Management
No ratings yet
Test Management
9 pages
Fahidy, Approaches To The Study of Electrochemical Process Instability - A Review, 2006
No ratings yet
Fahidy, Approaches To The Study of Electrochemical Process Instability - A Review, 2006
6 pages
Gec524 - Engineering Ethics, Code of Conducts and The Responsibilities of An Engineer PDF
No ratings yet
Gec524 - Engineering Ethics, Code of Conducts and The Responsibilities of An Engineer PDF
16 pages
Composer Cheat Sheet
No ratings yet
Composer Cheat Sheet
4 pages
Juniper SRX IKEv2 EAP MD5 Config
No ratings yet
Juniper SRX IKEv2 EAP MD5 Config
57 pages
Physical and Chemical Properties of Matter
No ratings yet
Physical and Chemical Properties of Matter
1 page
EBRair PDF
No ratings yet
EBRair PDF
290 pages
DA42 GFC700 Checklist Edit18 1 A4
No ratings yet
DA42 GFC700 Checklist Edit18 1 A4
34 pages
Product Name: Aron Alpha PP Primer Series
No ratings yet
Product Name: Aron Alpha PP Primer Series
1 page
S and S Drivetrain 2016 PDF
No ratings yet
S and S Drivetrain 2016 PDF
108 pages
DC School Fire Drill Citations
No ratings yet
DC School Fire Drill Citations
130 pages

Knowledge-Based Reinforcement Learning For Data Mining

Uploaded by

Knowledge-Based Reinforcement Learning For Data Mining

Uploaded by

Knowledge-Based Reinforcement Learning for Data Mining

Daniel Kudenko and Marek Grzes

D. Kudenko and M. Grzes

You might also like