Practical Deep Reinforcement Learning with Python: Concise Implementation of Algorithms, Simplified Maths, and Effective Use of TensorFlow and PyTorch (English Edition)

Ebook628 pages4 hours

Practical Deep Reinforcement Learning with Python: Concise Implementation of Algorithms, Simplified Maths, and Effective Use of TensorFlow and PyTorch (English Edition)

Name: Practical Deep Reinforcement Learning with Python: Concise Implementation of Algorithms, Simplified Maths, and Effective Use of TensorFlow and PyTorch (English Edition)
Brand: BPB Online LLP
Rating: 4.0 (1 reviews)

By Ivan Gridin

Rating: 4 out of 5 stars

4/5

()

Read preview

Reinforcement Learning
Q-Learning
Machine Learning
Neural Networks
Tensorflow
Problem-Solving
Mentor
Hero's Journey
Workaholic
Absent-Minded Professor
Exploration
Genius Protagonist
Family Man
Learning From Experience
Intelligent Machines
Deep Q-Network
Pytorch
Deep Learning
Artificial Intelligence
Stock Trading

About this ebook

Reinforcement learning is a fascinating branch of AI that differs from standard machine learning in several ways. Adaptation and learning in an unpredictable environment is the part of this project. There are numerous real-world applications for reinforcement learning these days, including medical, gambling, human imitation activity, and robotics.
This book introduces readers to reinforcement learning from a pragmatic point of view. The book does involve mathematics, but it does not attempt to overburden the reader, who is a beginner in the field of reinforcement learning.
The book brings a lot of innovative methods to the reader's attention in much practical learning, including Monte-Carlo, Deep Q-Learning, Policy Gradient, and Actor-Critical methods. While you understand these techniques in detail, the book also provides a real implementation of these methods and techniques using the power of TensorFlow and PyTorch. The book covers some enticing projects that show the power of reinforcement learning, and not to mention that everything is concise, up-to-date, and visually explained.
After finishing this book, the reader will have a thorough, intuitive understanding of modern reinforcement learning and its applications, which will tremendously aid them in delving into the interesting field of reinforcement learning.

Skip carousel

Enterprise Applications

LanguageEnglish

PublisherBPB Online LLP

Release dateJul 15, 2022

ISBN9789355512062

Author

Ivan Gridin

Related to Practical Deep Reinforcement Learning with Python

Related ebooks

Skip carousel

Advanced Machine Learning with Python
Ebook
Advanced Machine Learning with Python
byJohn Hearty
Rating: 0 out of 5 stars
0 ratings
Beginning with Deep Learning Using TensorFlow: A Beginners Guide to TensorFlow and Keras for Practicing Deep Learning Principles and Applications
Ebook
Beginning with Deep Learning Using TensorFlow: A Beginners Guide to TensorFlow and Keras for Practicing Deep Learning Principles and Applications
byMohan Kumar Silaparasetty
Rating: 0 out of 5 stars
0 ratings
Deep Learning with Keras
Ebook
Deep Learning with Keras
byAntonio Gulli
Rating: 4 out of 5 stars
4/5
Deep Reinforcement Learning Hands-On - Second Edition: Apply modern RL methods to practical problems of chatbots, robotics, discrete optimization, web automation, and more, 2nd Edition
Ebook
Deep Reinforcement Learning Hands-On - Second Edition: Apply modern RL methods to practical problems of chatbots, robotics, discrete optimization, web automation, and more, 2nd Edition
byMaxim Lapan
Rating: 0 out of 5 stars
0 ratings
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
Ebook
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
byWilliam Sullivan
Rating: 1 out of 5 stars
1/5
Practical Mathematics for AI and Deep Learning: A Concise yet In-Depth Guide on Fundamentals of Computer Vision, NLP, Complex Deep Neural Networks and Machine Learning (English Edition)
Ebook
Practical Mathematics for AI and Deep Learning: A Concise yet In-Depth Guide on Fundamentals of Computer Vision, NLP, Complex Deep Neural Networks and Machine Learning (English Edition)
byTamoghna Ghosh
Rating: 0 out of 5 stars
0 ratings
Hands-On Deep Learning Algorithms with Python: Master deep learning algorithms with extensive math by implementing them using TensorFlow
Ebook
Hands-On Deep Learning Algorithms with Python: Master deep learning algorithms with extensive math by implementing them using TensorFlow
bySudharsan Ravichandiran
Rating: 0 out of 5 stars
0 ratings
Hands-on Supervised Learning with Python
Ebook
Hands-on Supervised Learning with Python
byMadeleine Shang
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Data Architects: Unleash the power of Python's deep learning algorithms (English Edition)
Ebook
Deep Learning for Data Architects: Unleash the power of Python's deep learning algorithms (English Edition)
byShekhar Khandelwal
Rating: 0 out of 5 stars
0 ratings
Python Machine Learning: A Step by Step Beginner’s Guide to Learn Machine Learning Using Python
Ebook
Python Machine Learning: A Step by Step Beginner’s Guide to Learn Machine Learning Using Python
byBrady Ellison
Rating: 0 out of 5 stars
0 ratings
Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques
Ebook
Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques
byBob Mather
Rating: 5 out of 5 stars
5/5
Hands-On Data Analysis with Pandas: Efficiently perform data collection, wrangling, analysis, and visualization using Python
Ebook
Hands-On Data Analysis with Pandas: Efficiently perform data collection, wrangling, analysis, and visualization using Python
byStefanie Molin
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
Ebook
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Microsoft Azure Machine Learning
Ebook
Microsoft Azure Machine Learning
bySumit Mund
Rating: 4 out of 5 stars
4/5
Image Processing in Python
Ebook
Image Processing in Python
byMartin McBride
Rating: 0 out of 5 stars
0 ratings
Fundamentals of Machine Learning: An Introduction to Neural Networks
Ebook
Fundamentals of Machine Learning: An Introduction to Neural Networks
byPeter Johnson
Rating: 0 out of 5 stars
0 ratings
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
Ebook
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
byCésar Pérez López
Rating: 0 out of 5 stars
0 ratings
No-Code Artificial Intelligence: The new way to build AI powered applications (English Edition)
Ebook
No-Code Artificial Intelligence: The new way to build AI powered applications (English Edition)
byAmbuj Agrawal
Rating: 3 out of 5 stars
3/5
Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)
Ebook
Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)
byMathangi Sri Ramachandran
Rating: 0 out of 5 stars
0 ratings
NumPy Cookbook
Ebook
NumPy Cookbook
byIvan Idris
Rating: 5 out of 5 stars
5/5
Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch
Ebook
Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch
byIvan Vasilev
Rating: 0 out of 5 stars
0 ratings
Elements of Deep Learning for Computer Vision: Explore Deep Neural Network Architectures, PyTorch, Object Detection Algorithms, and Computer Vision Applications for Python Coders (English Edition)
Ebook
Elements of Deep Learning for Computer Vision: Explore Deep Neural Network Architectures, PyTorch, Object Detection Algorithms, and Computer Vision Applications for Python Coders (English Edition)
byBharat Sikka
Rating: 0 out of 5 stars
0 ratings
Reinforcement Learning Algorithms with Python: Learn, understand, and develop smart algorithms for addressing AI challenges
Ebook
Reinforcement Learning Algorithms with Python: Learn, understand, and develop smart algorithms for addressing AI challenges
byAndrea Lonza
Rating: 0 out of 5 stars
0 ratings
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
Ebook
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
byFrank Millstein
Rating: 0 out of 5 stars
0 ratings
Learning PyTorch 2.0, Second Edition: Utilize PyTorch 2.3 and CUDA 12 to experiment neural networks and deep learning models
Ebook
Learning PyTorch 2.0, Second Edition: Utilize PyTorch 2.3 and CUDA 12 to experiment neural networks and deep learning models
byMatthew Rosch
Rating: 0 out of 5 stars
0 ratings
Designing Machine Learning Systems with Python
Ebook
Designing Machine Learning Systems with Python
byDavid Julian
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Finance
Ebook
Machine Learning for Finance
bySaurav Singla
Rating: 5 out of 5 stars
5/5
Markov Models Supervised and Unsupervised Machine Learning: Mastering Data Science And Python
Ebook
Markov Models Supervised and Unsupervised Machine Learning: Mastering Data Science And Python
byWilliam Sullivan
Rating: 2 out of 5 stars
2/5
Mastering Machine Learning Algorithms - Second Edition: Expert techniques for implementing popular machine learning algorithms, fine-tuning your models, and understanding how they work, 2nd Edition
Ebook
Mastering Machine Learning Algorithms - Second Edition: Expert techniques for implementing popular machine learning algorithms, fine-tuning your models, and understanding how they work, 2nd Edition
byGiuseppe Bonaccorso
Rating: 0 out of 5 stars
0 ratings
Deep Learning with Keras: Beginner’s Guide to Deep Learning with Keras
Ebook
Deep Learning with Keras: Beginner’s Guide to Deep Learning with Keras
byFrank Millstein
Rating: 3 out of 5 stars
3/5

Trending on #Booktok

Skip carousel

Powerless
Ebook
Powerless
byLauren Roberts
Rating: 4 out of 5 stars
4/5
The Assassin and the Pirate Lord: A Throne of Glass Novella
Ebook
The Assassin and the Pirate Lord: A Throne of Glass Novella
bySarah J. Maas
Rating: 4 out of 5 stars
4/5
Icebreaker: A Novel
Ebook
Icebreaker: A Novel
byHannah Grace
Rating: 4 out of 5 stars
4/5
A Court of Mist and Fury
Ebook
A Court of Mist and Fury
bySarah J. Maas
Rating: 5 out of 5 stars
5/5
It Ends with Us: A Novel
Ebook
It Ends with Us: A Novel
byColleen Hoover
Rating: 4 out of 5 stars
4/5
A Little Life: A Novel
Ebook
A Little Life: A Novel
byHanya Yanagihara
Rating: 4 out of 5 stars
4/5
Pride and Prejudice
Ebook
Pride and Prejudice
byJane Austen
Rating: 4 out of 5 stars
4/5
The Secret History: A Read with Jenna Pick: A Novel
Ebook
The Secret History: A Read with Jenna Pick: A Novel
byDonna Tartt
Rating: 4 out of 5 stars
4/5
If We Were Villains: A Novel
Ebook
If We Were Villains: A Novel
byM. L. Rio
Rating: 4 out of 5 stars
4/5
Once Upon a Broken Heart
Ebook
Once Upon a Broken Heart
byStephanie Garber
Rating: 4 out of 5 stars
4/5
The Summer I Turned Pretty
Ebook
The Summer I Turned Pretty
byJenny Han
Rating: 4 out of 5 stars
4/5
Funny Story
Ebook
Funny Story
byEmily Henry
Rating: 4 out of 5 stars
4/5
Crime and Punishment
UNLIMITED
Crime and Punishment
byFyodor Dostoevsky
Rating: 4 out of 5 stars
4/5
Normal People: A Novel
Ebook
Normal People: A Novel
bySally Rooney
Rating: 4 out of 5 stars
4/5
Happy Place
Ebook
Happy Place
byEmily Henry
Rating: 4 out of 5 stars
4/5
The Love Hypothesis
Ebook
The Love Hypothesis
byAli Hazelwood
Rating: 4 out of 5 stars
4/5
Seven Stones to Stand or Fall: A Collection of Outlander Fiction
Ebook
Seven Stones to Stand or Fall: A Collection of Outlander Fiction
byDiana Gabaldon
Rating: 4 out of 5 stars
4/5
Atomic Habits: An Easy & Proven Way to Build Good Habits & Break Bad Ones
Ebook
Atomic Habits: An Easy & Proven Way to Build Good Habits & Break Bad Ones
byJames Clear
Rating: 4 out of 5 stars
4/5
Fire & Blood: 300 Years Before A Game of Thrones
Ebook
Fire & Blood: 300 Years Before A Game of Thrones
byGeorge R. R. Martin
Rating: 4 out of 5 stars
4/5
Beauty and the Beast
Ebook
Beauty and the Beast
by Gabrielle-Suzanne Barbot de Villeneuve
Rating: 4 out of 5 stars
4/5
Divine Rivals: A Novel
Ebook
Divine Rivals: A Novel
byRebecca Ross
Rating: 4 out of 5 stars
4/5
Better Than the Movies
Ebook
Better Than the Movies
byLynn Painter
Rating: 4 out of 5 stars
4/5
The 48 Laws of Power
Ebook
The 48 Laws of Power
byRobert Greene
Rating: 4 out of 5 stars
4/5
The Little Prince: New Translation Version
Ebook
The Little Prince: New Translation Version
byAntoine de Saint-Exupery
Rating: 5 out of 5 stars
5/5
Rich Dad Poor Dad
Ebook
Rich Dad Poor Dad
byRobert T. Kiyosaki
Rating: 4 out of 5 stars
4/5
Dune
Ebook
Dune
byFrank Herbert
Rating: 4 out of 5 stars
4/5
The Lord Of The Rings: One Volume
Ebook
The Lord Of The Rings: One Volume
byJ. R. R. Tolkien
Rating: 5 out of 5 stars
5/5
Finnegans Wake
Ebook
Finnegans Wake
byJames Joyce
Rating: 4 out of 5 stars
4/5
Beach Read
Ebook
Beach Read
byEmily Henry
Rating: 4 out of 5 stars
4/5
Milk and Honey: 10th Anniversary Collector's Edition
Ebook
Milk and Honey: 10th Anniversary Collector's Edition
byRupi Kaur
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

#60 Geometric Deep Learning Blueprint (Special Edition)
UNLIMITED
#60 Geometric Deep Learning Blueprint (Special Edition)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
UNLIMITED
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Exploring deep reinforcement learning: with Thomas Simonini of Hugging Face
UNLIMITED
Exploring deep reinforcement learning: with Thomas Simonini of Hugging Face
byPractical AI
0 ratings
0% found this document useful
Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657
UNLIMITED
Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
S1:E1 "The Beginning"
UNLIMITED
S1:E1 "The Beginning"
byData Science Now
0 ratings
0% found this document useful
Leveling Up Natural Language Processing with Transfer Learning: An interview with Paul Azunre about how you can use transfer learning techniques to build more flexible natural language processing systems and reduce the requirements for labelled data.
UNLIMITED
Leveling Up Natural Language Processing with Transfer Learning: An interview with Paul Azunre about how you can use transfer learning techniques to build more flexible natural language processing systems and reduce the requirements for labelled data.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Exploring Large Language Models with ChatGPT - #603
UNLIMITED
Exploring Large Language Models with ChatGPT - #603
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Episode 19 (Python for Data Science - Python Files - Scripts and Modules)
UNLIMITED
Episode 19 (Python for Data Science - Python Files - Scripts and Modules)
byHow to Data (Joshiverse- Journey of a Budding Data Scientist)
0 ratings
0% found this document useful
Aiden Gomez - CEO of Cohere (AI's 'Inner Monologue' – Crucial for Reasoning)
UNLIMITED
Aiden Gomez - CEO of Cohere (AI's 'Inner Monologue' – Crucial for Reasoning)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
UNLIMITED
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
Learning Python Through Errors
UNLIMITED
Learning Python Through Errors
byThe Real Python Podcast
0 ratings
0% found this document useful
The Past, Present, and Future of Deep Learning In PyTorch: An interview with the creator of the popular PyTorch deep learning framework
UNLIMITED
The Past, Present, and Future of Deep Learning In PyTorch: An interview with the creator of the popular PyTorch deep learning framework
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Episode 161: Trapped as a QA engineer and trapped as a generalist
UNLIMITED
Episode 161: Trapped as a QA engineer and trapped as a generalist
bySoft Skills Engineering
0 ratings
0% found this document useful
Photonic computing for AI acceleration: with Nick Harris, CEO of Lightmatter
UNLIMITED
Photonic computing for AI acceleration: with Nick Harris, CEO of Lightmatter
byPractical AI
0 ratings
0% found this document useful
Anaconda + Pyston and more: with Peter Wang, CEO of Anaconda
UNLIMITED
Anaconda + Pyston and more: with Peter Wang, CEO of Anaconda
byPractical AI
0 ratings
0% found this document useful
#114 - Secrets of Deep Reinforcement Learning (Minqi Jiang)
UNLIMITED
#114 - Secrets of Deep Reinforcement Learning (Minqi Jiang)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Declarative Machine Learning For High Performance Deep Learning Models With Predibase
UNLIMITED
Declarative Machine Learning For High Performance Deep Learning Models With Predibase
byThe Python Podcast.__init__
0 ratings
0% found this document useful
#124 Using AI to Improve Data Quality in Healthcare
UNLIMITED
#124 Using AI to Improve Data Quality in Healthcare
byDataFramed
0 ratings
0% found this document useful
[MINI] Long Short Term Memory: Thanks to our sponsor brilliant.org/dataskeptics A Long Short Term Memory (LSTM) is a neural unit, often used in Recurrent Neural Network (RNN) which attempts to provide the network the capacity to store information for longer periods of time. An...
UNLIMITED
[MINI] Long Short Term Memory: Thanks to our sponsor brilliant.org/dataskeptics A Long Short Term Memory (LSTM) is a neural unit, often used in Recurrent Neural Network (RNN) which attempts to provide the network the capacity to store information for longer periods of time. An...
byData Skeptic
0 ratings
0% found this document useful
084: Yves Hilpisch – Quantitative finance and programming trading strategies w/ The Python Quants: Dr. Yves Hilpisch is the founder of The Python Quants, a keynote speaker, and a three-time published author (most notably, Python For Finance). He regularly contracts to hedge funds, banks and exchanges, and hosts workshops on Python programming and algor
UNLIMITED
084: Yves Hilpisch – Quantitative finance and programming trading strategies w/ The Python Quants: Dr. Yves Hilpisch is the founder of The Python Quants, a keynote speaker, and a three-time published author (most notably, Python For Finance). He regularly contracts to hedge funds, banks and exchanges, and hosts workshops on Python programming and algor
byChat With Traders
0 ratings
0% found this document useful
The last mile of AI app development: with Travis Fischer, builder of open source AI projects like @ChatGPTBot
UNLIMITED
The last mile of AI app development: with Travis Fischer, builder of open source AI projects like @ChatGPTBot
byPractical AI
0 ratings
0% found this document useful
#42: Meta’s Segment Anything Model (SAM) for Computer Vision, ChatGPT’s Safety Problem, and the Limitations of ChatGPT Detectors
UNLIMITED
#42: Meta’s Segment Anything Model (SAM) for Computer Vision, ChatGPT’s Safety Problem, and the Limitations of ChatGPT Detectors
byThe Artificial Intelligence Show
0 ratings
0% found this document useful
#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
UNLIMITED
#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
byDataFramed
0 ratings
0% found this document useful
MLA 015 SageMaker 1: Part 1 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See for an overview of tooling (also generally a great ML educational run-down.) And I forgot to...
UNLIMITED
MLA 015 SageMaker 1: Part 1 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See for an overview of tooling (also generally a great ML educational run-down.) And I forgot to...
byMachine Learning Guide
0 ratings
0% found this document useful
DR. JEFF BECK - THE BAYESIAN BRAIN
UNLIMITED
DR. JEFF BECK - THE BAYESIAN BRAIN
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Eric Schmidt
UNLIMITED
Eric Schmidt
byTheory and Practice
0 ratings
0% found this document useful
61: Look at this Graph! (Graph Theory): In mathematics, nature is a constant driving inspiration; mathematicians are part of nature, so this is natural. A huge part of nature is the idea of things like networks. These are represented by mathematical objects called 'graphs'. Graphs allow us...
UNLIMITED
61: Look at this Graph! (Graph Theory): In mathematics, nature is a constant driving inspiration; mathematicians are part of nature, so this is natural. A huge part of nature is the idea of things like networks. These are represented by mathematical objects called 'graphs'. Graphs allow us...
byBreaking Math Podcast
0 ratings
0% found this document useful
Robert Chang: Building the Minerva Metrics Store @ Airbnb: Robert Chang is a product manager for the data platform at Airbnb, where he helped build and roll out Minerva, Airbnb's internal metrics store. They use Minerva to track over 12,000(!) metrics and 4,000(!) dimensions with consistency across the...
UNLIMITED
Robert Chang: Building the Minerva Metrics Store @ Airbnb: Robert Chang is a product manager for the data platform at Airbnb, where he helped build and roll out Minerva, Airbnb's internal metrics store. They use Minerva to track over 12,000(!) metrics and 4,000(!) dimensions with consistency across the...
byThe Analytics Engineering Podcast
0 ratings
0% found this document useful
Fluent Python revisited: Ahead of the release of the second edition of his landmark book, Fluent Python, our team catch up with author Luciano Ramalho to hear about what’s happening in the world of Python — and why it’s popularity continues to endure.
UNLIMITED
Fluent Python revisited: Ahead of the release of the second edition of his landmark book, Fluent Python, our team catch up with author Luciano Ramalho to hear about what’s happening in the world of Python — and why it’s popularity continues to endure.
byThoughtworks Technology Podcast
0 ratings
0% found this document useful
Visualization and Interpretability: joins us to discuss how data visualization can be used to help make machine learning more interpretable and explainable. Find out more about Enrico at . More from Enrico with co-host Moritz Stefaner on the podcast!
UNLIMITED
Visualization and Interpretability: joins us to discuss how data visualization can be used to help make machine learning more interpretable and explainable. Find out more about Enrico at . More from Enrico with co-host Moritz Stefaner on the podcast!
byData Skeptic
0 ratings
0% found this document useful

Related categories

Skip carousel

Reviews for Practical Deep Reinforcement Learning with Python

Rating: 4 out of 5 stars

4/5

1 rating0 reviews

Book preview

Practical Deep Reinforcement Learning with Python - Ivan Gridin

Part - I

The first part of the book will be devoted to classical reinforcement learning methods. This part will consider the theoretical foundations of reinforcement learning problems and the primary techniques for solving them. One of the main concepts of the book's first part is the Q-Learning method. The Q-Learning method described in Chapter 6: Escaping Maze With Q-Learning, is the cornerstone for most reinforcement learning solutions. The book's first part can be considered as an introduction to reinforcement learning.

CHAPTER 1

Introducing Reinforcement Learning

Reinforcement learning ( RL ) is one of the most active research areas in machine learning. Many researchers think that RL will take us closer to reaching artificial general intelligence. In the past few years, RL has evolved rapidly and has been used in complex applications ranging from stock trading to self-driving cars. The main reason for this growth is the involvement of deep reinforcement learning, which is a combination of deep learning and reinforcement learning. Reinforcement learning is one of the most promising areas of machine learning that we will study in this book.

Structure

In this chapter, we will discuss the following topics:

What is reinforcement learning?

Reinforcement learning mechanism

Reinforcement learning vs. supervised learning

Applications of reinforcement learning

Objectives

After completing this chapter, you will have a basic understanding of reinforcement learning and its key definitions. You will also have learned how reinforcement learning works and how it differs from other machine learning approaches.

What is reinforcement learning?

Reinforcement learning is defined as a machine learning technique concerned with how agents should take actions in a surrounding environment depending on their current state. RL is a part of machine learning that helps an agent maximize the cumulative reward collected after making some sequence of actions. In RL, agents act in a known or unknown environment to constantly adapt and learn based on collected experience. The feedback of an environment might be positive, also known as rewards, or negative, also called punishments. At this point of time, all the above definitions may seem too abstract and unclear, but we will elaborate on them in this chapter.

The following figure represents the key concept of RL:

Figure 1.1: Reinforcement learning

Here, the agent is in some initial state in some environment. Then, the agent decides to take some action. The environment reacts to the agent's action, returns the agent some reward for his action, and transfers him to another state.

Most used reinforcement learning keywords are as follows:

Agent is a decision-maker who defines what action to take.

Examples: Self-driving car, chess player, stock trading robot

Action is a concrete act in a surrounding environment that is taken by the agent.

Examples: Turn car left, move chess pawn one cell forward, sell all assets

Environment is a problem context that the agent cooperates with.

Examples: Car track, chess board, stock market

State is a position of the agent in the environment.

Examples: Car coordinates on the track and its speed, arrangement of pieces on the chessboard, price of assets

Reward is a numerical value returned by an environment as the reaction to the agent's action.

Example: To reach a goal on the car without any accidents, to win chess play, to earn more money

RL is learning what to do or how to map situations to actions to maximize a reward. The agent doesn't know which actions to take but must learn which actions produce the most reward by trying them. Usually, actions may affect the immediate reward and the next situation and all subsequent rewards. It means that the agent should not think about the immediate reward only but about the reward in the long-term sense.

Reinforcement learning mechanics

In our life, we usually try to maximize our rewards. And it does not mean that we are always thinking about money or materialistic things. To give an example, when we read a new book to learn new skills, we understand that it is better to read a book carefully, without hurrying. Our way to read a book is a strategy, and the skills we gain are our reward. When we are negotiating with other people, we are trying to be polite, and the feedback we get is our reward.

The purpose of the reward is to tell our agent how well it has behaved. The main goal of RL is to find such strategy that maximizes the reward after some number of actions. Let's see some simple examples that help you illustrate the reinforcement learning mechanism.

Consider the following scientifically factual scenario. A robot has arrived on our planet. This robot is very good at designing posters but does not know how to negotiate with people. His target is to get a job and make a lot of money in 5 years. Good plan, why not? Every day, the robot makes a particular decision about how it will act today. At the end of the day, he checks his bank account and summarizes his state in the company.

Let's consider the first scenario. The robot decides to steal a computer from the office and sell it on the first working day. And it may seem that this is a pretty good decision because it will help the robot increase its balance significantly. But of course, we understand that the decision like this can be made only once, and the profits of our robot will stop there.

The following figure illustrates the first scenario:

Figure 1.2: First strategy

Now, let's consider the second scenario. Every day the robot works hard and learns new things. In this case, his strategy is long-term. It may be inferior to other strategies in the short term, but it will be significantly more profitable in the long term.

Figure 1.3: Second strategy

Of course, in real life, everything is much more complicated. But this example illustrates the principle when it is necessary to think several steps ahead. A solution that has a quick effect can be fatal in the long run. Reinforcement learning aims to find long-term strategies that maximize the agent's reward.

Here are some essential characteristics of reinforcement learning:

There is no supervisor. Agent only receives a reward signal

Sequential decision making

Agent's actions determine the subsequent data it receives

The term reinforcement comes from the fact that a reward received by an agent should reinforce its behavior in a positive or negative direction. A local reward indicates the success of the agent's recent action and not overall successes achieved by the agent so far. Of course, getting a large reward for some action doesn't mean that you won't face dramatic consequences later due to your previous decisions. Remember our example with a robot that decides to rob a computer - it could look like a brilliant idea until you think about the next day.

The problem can be considered as RL problem if we can define the following:

Agent: Define the subject, which takes some actions.

Environment: Define the system that receives an agent's actions.

Set of states: Define the set of states that an agent can receive. This set can be infinite.

Set of actions: Define the set of actions an agent can take. This set can be infinite.

Reward: Define what the agent's primary goal is and how it can be achieved with some reward system.

If all the above definitions can be obtained, you obviously deal with the reinforcement learning problem.

Reinforcement learning vs. supervised learning

When we have an intuitive understanding of reinforcement learning, we can examine how it differs from traditional supervised learning. A good rule of thumb is to treat reinforcement learning as a dynamic model and supervised learning as a static model. Let's elaborate on this.

We can use supervised learning as a statistical model that can extract some correlations and patterns from which they make predictions without being explicitly programmed. Generally speaking, supervised learning makes only one action. It takes input and returns the output. Its primary goal is to provide you with an automatically built function F that maps some input X into some output Y:

Figure 1.4: Supervised learning

While reinforcement learning builds an agent that makes a sequence of actions interacting with an environment, this agent cooperates with an environment and produces the sequence of actions:

Figure 1.5: Reinforcement learning

Let's summarize all distinctions between reinforcement learning and supervised learning in the following table:

Table 1.1: Reinforcement learning vs. supervised learning

It is important to understand the difference between reinforcement learning and supervised learning. This knowledge will help you in the correct use of each of these methods.

Examples of reinforcement learning

In this section, we will see some popular examples of RL problems. In all these problems, we have the following: agent, environment, set of states, set of actions, and the reward.

Stock trading

This type of activity assumes making a profit by buying and selling shares of different companies. All traders tend to buy stocks of a company when they are cheap and sell when they are high:

Table 1.2: Stock trading as RL problem

Chess

Chess is one of the oldest games. This game has many different styles and approaches. However, chess is also a reinforcement learning problem:

Table 1.3: Chess as RL problem

Neural Architecture Search (NAS)

RL has been successfully applied to the domain of Neural network Architecture Search (NAS). The goal is to get the best performance on some datasets by selecting the number of layers or their parameters, adding extra connections, or making other changes to the architecture. The reward, in this case, is the performance of neural network architecture:

Table 1.4: NAS as RL problem

As you can see, many practical problems can be solved using the reinforcement learning approach.

Conclusion

Reinforcement learning is a machine learning approach that aims to find optimal decision-making strategies. It differs from other machine learning approaches by emphasizing agent learning from direct interaction with its environment. It doesn't require traditional supervision or complete computational models of the environment. Reinforcement learning aims to find an appropriate long-term strategy that allows collecting maximum rewards to an agent. In the next chapter, we will study the theory of Markov decision processes that form the base of the entire reinforcement learning approach.

Points to remember

A solution that has a quick effect can be fatal in the long run.

RL doesn't assume any supervisor. Agent only receives a reward signal.

RL produces a sequential decision-making strategy.

Reinforcement learning is a dynamic model, and supervised learning is a static model.

Multiple choice questions

Let's consider a popular and simple computer game called Tetris, which has relatively simple mechanics. When the player builds one or more completed rows, the completed rows disappear, and the player gains some points. The game's goal is to prevent the blocks from stacking up to the top of the screen and collect as many points as possible.

Figure 1.6: Reinforcement learning

What do you think? Can Tetris be considered as an RL problem?

Yes

Considering Tetris as an RL problem, define an agent.

Score

Player

Number of disappeared lines

Considering Tetris as an RL problem, define a state.

Score

Arrangement of bricks and score

Arrangement of bricks, score, and the next element

Answers

Key terms

Agent: A decision-maker who defines what action to take.

Action: A concrete act in a surrounding environment that takes the agent.

Environment: A problem context that the agent cooperates with.

State: A position of an agent in the environment.

Reward: A numerical value returned by an environment as the reaction of the agent's action.

CHAPTER 2

Playing Monopoly and Markov Decision Process

In the last chapter, you got a general introduction to reinforcement learning ( RL ). We saw different examples for different problems and highlighted the main characteristics of reinforcement learning. But before we start solving practical problems, we will formally describe how you can solve them using the RL approach. One of the RL cornerstones is the Markov decision process ( MDP ). This concept is the foundation of the whole theory of reinforcement learning. We will dedicate this chapter to explaining what the Markov decision process is with the help of Monopoly game examples. We'll discuss MDPs in greater detail as we walk through the chapter. Markov chains and Markov decision processes are extensively used in many aspects of engineering and statistics. Reading this chapter will be useful for understanding the context of reinforcement learning and a much wider range of topics. If you're already familiar with MDPs, you can quickly get a grasp of this chapter, just by focusing on the terminology definitions that will be used later in the book.

Structure

In this chapter, we will discuss the following topics:

What is the best strategy for playing Monopoly?

Markov chain

Markov reward process

Markov decision process

Policy

Monopoly as Markov decision process

Objectives

The primary goal of this chapter is to provide the basics and fundamental concepts of reinforcement learning: Markov reward process and Policy. We will look at simple and straightforward examples that will allow us to understand what lies at the heart of these concepts. This chapter will give you a clear understanding of tasks that reinforcement learning deals with.

Choosing the best strategy for playing Monopoly

The formal mathematical explanation of Markov decision process often confuses the reader, although this concept is not as complicated as it might seem. In this chapter, we will explore what Markov decision process is by playing the popular game of Monopoly.

Let's create a list of simplified versions of the Monopoly game.

We will consider only simplified rules of the game here. This chapter does not need to go through a complete list of rules.

List of rules

Our custom simplified Monopoly game will follow the given set of rules:

Two players are playing. For the sake of simplicity, we will consider a game for two players only. We will denote the players by a square and a triangle:

Figure 2.1: Monopoly players

Each player rolls the dice and moves forward a certain number of cells:

Figure 2.2: Player 1 moves four steps forward

Each cell can be purchased for the price indicated on it. When a player gets on a free cell, they have two options:

Buy a cell

Do not buy a cell

It is not obligatory to buy a free cell:

Figure 2.3: Cell prices

If a player lands on someone else's cell, then he must pay the other player 20% of the cost of the cell.

Figure 2.4: Player 1 has to pay $2 to Player 2

Each player starts the game with $100.

There are surprise cells on the board. They randomly give three results:

Player gets $10 from the bank

Player gives $5 to the bank

Player skips one turn

A player loses when they run out of money.

Let's take a look at the entire board:

Figure 2.5: Monopoly playing board

Now that we have defined the rules, we have a more interesting question: what strategy should we choose for the game? It would seem that there is a reasonable and straightforward strategy: buy everything you can! Indeed, the more cells the player buys, the more rent he will receive when another player hits his cells. But everything is not so simple. Let's take a look at the example in Figure 2.6:

Figure 2.6: To buy or not to buy?

Suppose player 1 has only $40 left. And he just got on the cell that costs $40. Should they buy it? If player 1 buys it, then the probability of losing on the next move is extremely high. Because player 1 will have no money left, and they can get to the cells that have already been bought by player 2:

Figure 2.7: Player 1 can lose on the next turn if he buys a cell

As we can see, there is no primitive strategy in this game. A more advanced approach

Enjoying the preview?

Page 1 of 1

Practical Deep Reinforcement Learning with Python: Concise Implementation of Algorithms, Simplified Maths, and Effective Use of TensorFlow and PyTorch (English Edition)

About this ebook

Ivan Gridin

Read more from Ivan Gridin

Time Series Forecasting using Deep Learning: Combining PyTorch, RNN, TCN, and Deep Neural Network Models to Provide Production-Ready Prediction Solutions

Learning Genetic Algorithms with Python: Empower the performance of Machine Learning and AI models with the capabilities of a powerful search algorithm (English Edition)

Related authors

Related to Practical Deep Reinforcement Learning with Python

Related ebooks

Advanced Machine Learning with Python

Beginning with Deep Learning Using TensorFlow: A Beginners Guide to TensorFlow and Keras for Practicing Deep Learning Principles and Applications

Deep Learning with Keras

Deep Reinforcement Learning Hands-On - Second Edition: Apply modern RL methods to practical problems of chatbots, robotics, discrete optimization, web automation, and more, 2nd Edition

Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2

Practical Mathematics for AI and Deep Learning: A Concise yet In-Depth Guide on Fundamentals of Computer Vision, NLP, Complex Deep Neural Networks and Machine Learning (English Edition)

Hands-On Deep Learning Algorithms with Python: Master deep learning algorithms with extensive math by implementing them using TensorFlow

Hands-on Supervised Learning with Python

Deep Learning for Data Architects: Unleash the power of Python's deep learning algorithms (English Edition)

Python Machine Learning: A Step by Step Beginner’s Guide to Learn Machine Learning Using Python

Machine Learning in Python: Hands on Machine Learning with Python Tools, Concepts and Techniques

Hands-On Data Analysis with Pandas: Efficiently perform data collection, wrangling, analysis, and visualization using Python

Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work

Microsoft Azure Machine Learning

Image Processing in Python

Fundamentals of Machine Learning: An Introduction to Neural Networks

DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB

No-Code Artificial Intelligence: The new way to build AI powered applications (English Edition)

Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)

NumPy Cookbook

Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch

Elements of Deep Learning for Computer Vision: Explore Deep Neural Network Architectures, PyTorch, Object Detection Algorithms, and Computer Vision Applications for Python Coders (English Edition)

Reinforcement Learning Algorithms with Python: Learn, understand, and develop smart algorithms for addressing AI challenges

Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python

Learning PyTorch 2.0, Second Edition: Utilize PyTorch 2.3 and CUDA 12 to experiment neural networks and deep learning models

Designing Machine Learning Systems with Python

Machine Learning for Finance

Markov Models Supervised and Unsupervised Machine Learning: Mastering Data Science And Python

Mastering Machine Learning Algorithms - Second Edition: Expert techniques for implementing popular machine learning algorithms, fine-tuning your models, and understanding how they work, 2nd Edition

Deep Learning with Keras: Beginner’s Guide to Deep Learning with Keras

Trending on #Booktok

Powerless

The Assassin and the Pirate Lord: A Throne of Glass Novella

Icebreaker: A Novel

A Court of Mist and Fury

It Ends with Us: A Novel

A Little Life: A Novel

Pride and Prejudice

The Secret History: A Read with Jenna Pick: A Novel

If We Were Villains: A Novel

Once Upon a Broken Heart

The Summer I Turned Pretty

Funny Story

Crime and Punishment

Normal People: A Novel

Happy Place

The Love Hypothesis

Seven Stones to Stand or Fall: A Collection of Outlander Fiction

Atomic Habits: An Easy & Proven Way to Build Good Habits & Break Bad Ones

Fire & Blood: 300 Years Before A Game of Thrones

Beauty and the Beast

Divine Rivals: A Novel

Better Than the Movies

The 48 Laws of Power

The Little Prince: New Translation Version

Rich Dad Poor Dad

Dune

The Lord Of The Rings: One Volume

Finnegans Wake

Beach Read

Milk and Honey: 10th Anniversary Collector's Edition

Related podcast episodes

Related categories

Reviews for Practical Deep Reinforcement Learning with Python

What did you think?

Book preview

Practical Deep Reinforcement Learning with Python - Ivan Gridin

Part - I

CHAPTER 1

Introducing Reinforcement Learning

Structure