0% found this document useful (0 votes)

59 views

Summary of Lecture 2: Locomotion Control: Biologically Inspired Artificial Intelligence (WS03: 410)

This document summarizes a lecture on biologically inspired artificial intelligence and locomotion control in robotics. It discusses 1) different types of legged robot locomotion control including problems that need to be solved, types of gaits, and control methods like PID controllers and ZMP control, and 2) learning algorithms covered in the lecture including CPG-and-reflex based control of locomotion, evolutionary algorithms, and reinforcement learning. It also provides an overview of related practical work involving different teams developing walking robots.

Uploaded by

Mehdi Gh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views

Summary of Lecture 2: Locomotion Control: Biologically Inspired Artificial Intelligence (WS03: 410)

Uploaded by

Mehdi Gh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Biologically Inspired Artificial Intelligence

Summary of lecture 2: Locomotion control

(WS03: 410) Topics:

Lecture 3 1. Locomotion control in robotics:

Overview of different bio-inspired robots, wheeled versus
Auke Jan Ijspeert legged robots, passive walkers

Swiss Federal Institute of Technology, Lausanne 2. Legged robot locomotion control:

Problems that need to be solved, different types of gaits,
(EPFL)
PID controllers, trajectory generation, ZMP control,
Virtual Model Control.

Lecture 3: Learning algorithms Practical work:

Topics: • Registered teams:
1. KInator: David Weber
1. CPG-and-reflex based control of locomotion (end of 2. BruceTheShark: Tobias Reinhard and Thilo Tanner
lecture 2) 3. Barbie: Raoul Schmidiger
4. RoDeBot: Dennis Brunner and Roman Flueckiger
2. Evolutionary algorithms 5. ElNino: Oliver Michel
6. Muhammad_Ai: Alexander Kraehenbuehl and Juerg
3. Reinforcement learning Schaefli
7. Gogo_Yubari: Tsuyoshi Ito and Sascha Robert
8. SickBoy: Marc Ziegler
9. Sonovobic: Marc Breuer

Nov 4: Kinator went straight to number one,

congratulations!!

1
Practical work: Practical work
• Recent matches (Friday November 14th):

judo_14_11_03\R8D8_vs_El_Nino.wva For next lecture (Dec. 1), all teams should

judo_14_11_03\chunky_vs_KInator.wva
judo_14_11_03\chunky_vs_Coyote.wva
imperatively have a robot capable of
walking, and if possible, standing-up,
participating to the competition

Lecture 3: Learning algorithms CPG-and-reflex control

Topics: • Main idea: to use oscillators and to replicate the
distributed control mechanisms found in vertebrates
1. CPG-and-reflex based control of locomotion (end of
lecture 2) Visual System Vestibular Sys.

Visuomotor Coord. Balance Control

2. Evolutionary algorithms

3. Reinforcement learning
CPG Reflexes

Reflexes

Actuators Proprioception

2
Concept of Limit Cycle CPG-and-reflex control
• A limit cycle is an oscillatory regime in a dynamical Two types of implementations:
system:
CPG produces desired positions:
~
CPG-and- θ Feedback u θ
reflex + Σ Robot
- Controller (PID)

Limit cycles CPG directly produces torques:

~
• If the limit cycle is stable, the states of the system will CPG-and- u θ
return to it after perturbations Robot
reflex

Taga’s neuromechanical simulation Taga’s neuromechanical simulation

Neural oscillator:
(Taga 1994)

G. Taga. Emergence of bipedal locomotion through entrainment among the neuro-musculo-skeletal

system and the environment. Physica D: Nonlinear Phenomena, 75(1-3):190-208, 1994
G. Taga. A model of the neuro-musculo-skeletal system for human locomotion. i. emergence of basic
gait. Biological Cybernetics, 73(2):97-111, 1995

3
Taga’s neuromechanical simulation Taga’s neuromechanical simulation
Interesting aspects:
Walking gait:
• Locomotion seen as a limit cycle due to the global
entrainment between the neuro-musculo-skeletal system
and the environment
• Robustness against (small) variations in the environment
(e.g. small slopes)

Cons:
• Hand-tuning of (many) parameters for obtaining
satisfactory limit cycles

Nonlinear oscillators Nonlinear oscillator model

Example: Each oscillatory center is modeled with the following oscillator:
Design of a locomotion controller inspired by the salamander
CPG for the control an amphibious robot

Limit cycle: Explicit frequency and amplitude parameters

Limit cycle:

4
Inter-oscillator coupling Body CPG
Two parameters (aij and bij)
per coupling • Model: 40 segments
aij, bij
• Assumptions:
• Lamprey-like system: chain of
oscillators 40
• Two oscillators per segment
• Closest neighbour coupling
• Double symmetry:
left-right+per segment

6 open coupling parameters

Generation of traveling waves for

Corresponding swimming gait:
Example: swimming
~ time to stabilize Motoneuron signals:

(Delvolvé et al 1997):
EMG
In axial
musculature

5
Complete CPG Generation of standing wave for walking

swimming

Limb CPG Body CPG

EMG

From swimming to walking Real salamander: walking

6
Real salamander: from walking to swimming Real salamander: swimming

Real salamander: from swimming to walking Simulation Demo

Salamander applet

7
Outcomes Quadruped-robot controlled with a
• Simple control signals for controlling the speed, direction, CPG-and-reflex based controller
Kimura Lab,
and type of gait (Ebody_left, Ebody_right, Elimb_left, Elimb_right, National Univ. of Electro-Communications
and τ) Tokyo

• Robustness against noise and perturbations

• Entrainment between the CPG and the body through

sensory feedback (work in progress)

• Nonlinear oscillators are more tractable than neural

networks (fewer parameters)
• Problems: not yet a good methodology for setting the
coupling weights

Quadruped-robot controlled with a CPG-and-reflex Control: summary

CPG-and-reflex based controller • Pros:
Kimura Lab, • Distributed control
National Univ. of Electro-Communications
Tokyo • Limit cycle behavior (controller-body-environment)
• Robust against pertubations
• Smooth trajectories due to the oscillators

• Cons:
• Fewer mathematical tools than other methods
• Not (yet) a clear design methodology, it is
Reflex - Knee Bending
recommended to use learning algorithms
Camera control
To avoid obstacles Obstacles detection

8
Lecture 3: Learning algorithms Evolutionary algorithms
Topics: There exist different types of learning:
• Evolution
1. CPG-and-reflex based control of locomotion (end of • Supervised learning
lecture 2) • Learning by imitation
• Reinforcement learning
2. Evolutionary algorithms • Unsupervised learning
• …
3. Reinforcement learning
We will start by making an overview of evolutionary
algorithms

Evolutionary algorithms Genetic Algorithm (GA)

Evolutionary algorithms are stochastic population-based Developed by John Holland (1975)
optimization algorithms Ingredients:
• Fitness function: function returning a real number
Three main mechanisms: describing how well a solution solves the given problem
1. reproduction, • Chromosomes: candidate solutions
2. mutation and • Population: group of solutions (chromosomes)
3. the Darwinian principle of survival of the fittest • Genes: parameters of a chromosome
• Genetic operators: operators that modify the population
Mainly three different flavors: of solutions
1. Genetic algorithms
2. Evolution Strategies Main characteristics of the original GA:
3. Genetic Programming • Large populations, binary encoding, extensive use of the
crossover operator

9
GA: algorithm GA: encoding
Let’s assume we would like to find the maximum of a
1.Initial population fitness function f(x,y):

2.Parent selection 3.Crossover

Y X
5.0 5.0
7.Ending criterion? 4.Mutation
Xi=2.30
Yi=1.03
6.Rejection 5.Fitness evaluation
chromosome
The typical GA has a
binary encoding: <Xi,Yi> = 011101 001101
allele gene

GA: Initial population GA: Parent selection

• New chromosomes are created by modifying parent
• The typical GA has a fixed population size: N chromosomes (only some chromosomes in the whole
chromosomes population)

• The initial population is normally randomly generated • Parents are chosen depending on their fitness: the higher
the fitness, the higher the chance to be chosen
• In some case, prior knowledge of the problem can be
used to introduce some particular solutions • Different schemes are possible:
(chromosomes) in the population • Fitness-based selection: probability directly
proportional to the fitness
• Rank-based selection: probability inversely
proportional to the rank (i.e. first, second,…)
• Tournament selection: Pick two potential parents and
keep the best (repeat until you have enough parents)

10
GA: Crossover operator GA: Mutation operator
Crossover operator: recombination operator that swaps Mutation operator: each allele in a gene has a probability M
genetic material from two parent chromosomes to be mutated:
One-point crossover:
011101 001101 xyyx01 001101
011101 001101

xyyxyx yyxyxy 0111yx yyxyxy

Two-point crossover: 010111 011101

011101 001101 xyyx01 0011xy

xyyxyx yyxyxy 0111yx yyxy01

!! The effectiveness of the crossover operator depends on the

encoding, e.g. in which order parameters are encoded

GA: Fitness function GA: Rejection operator

Fitness functions must be carefully designed. Some Different regimes of selection and rejection operators can be
functions can have the same maxima, but can be more or chosen:
less difficult to optimize
Global optimum
Generational GA: the whole population is updated at each
generation

Steady-state GA: Only part of the population is updated

Local optimum Generational GA: Steady-state GA:

Current New Current New
Easier to optimize More difficult population Children population population Children population
Parents Parents

11
GA: Ending criterion Typical run: generation 0
Different possibilities:

• Fixed number of generations has been reached

• Increase of maximum fitness per generation passing

Y X
below a threshold 5.0 5.0
• Genetic diversity (e.g. standard deviations of gene
values) passing below a threshold

Typical run: next generations Typical run: next generations

Y X Y X
5.0 5.0 5.0 5.0

12
Typical run: convergence Typical run Max.
Fitness Average
Min..

Y X
5.0 5.0
Genetic Generations
Diversity
(e.g. sum of standard
deviations of gene values
within the population

Generations

GA: applications GA: applications

GAs are useful for solving problems that are not well Because the Body-Environment is a complex nonlinear
characterized mathematically, e.g. when no information system, the gradient ∇r f (θ ,..., θ ) = [ ∂f ,..., ∂f ]
∂θ1 ∂θ N
1 N
concerning the gradient of the fitness function is
available (i.e. gradient-descent is not possible). is usually impossible to compute analytically, and time-
consuming to estimate numerically.
Example: evolution of a locomotion controller

Control BODY ENV. Control BODY ENV.

Parameters: θi ? Fitness: f(θ1,…,θN) Parameters: θi Fitness: f(θ1,…,θN)

13
GA: applications GA applications:
Karl Sims evolved Creatures
In robotics, GAs are used either to optimize parameters in a
controller, e.g. a sinus-based controller or a finite-state
machine (i.e. a set of if-then rules), or, more commonly,
to optimize parameters such as synaptic weights in a
neural network

Lecture 2: sinus controller • GA used to evolve both body shape and controller
• Fitness function: speed of locomotion
θ i = θ i0 + Ai sin(υi t + ϕi ) • Controller: special type of neural network (with some neurons
producing sinusoidal signals)

Sims, K., "Evolving Virtual Creatures," Computer Graphics (Siggraph '94) Annual
Conference Proceedings, July 1994, pp.43-50.

GA applications: Evolution Strategies (ES)

evolutionary robotics Developed by Rechenberg 1973 and Schwefel 1975

Main characteristics of the original ES:

• Small populations (sometimes just one chromosome),
• Real number encoding,
• Extensive use of the mutation operator, evolution of the
mutation range
• No crossover operator

• GA used to evolve neural networks.

• Incremental evolution: evolution of obstacle avoidance, then homing-
behavior, then puck grasping, …

Urzelai, J., Floreano, D., Dorigo, M., and Colombetti, M. Incremental Robot Shaping,
Connection Science, 10, 341-360, 1998

14
ES: encoding ES: Mutation operator
At each generation, a gene x is mutated as follows:

x t +1 = x t + N 0 (σ x )
t
Y X
5.0 5.0
Where N 0 (σ ) is a Gaussian random number with mean 0
t

Xi=2.30 and standard deviation σ t

Yi=1.03
At every n generations, the standard deviations are updated:
chromosome
The typical ES has a cd σ x t if pst < 1/5 t
 where ps is the
real number encoding: <Xi,Yi> = 2.30,σx 1.03, σy σx t +n
= ciσ x if pst > 1/5
t
frequency of successful
 t mutations over intervals
Standard deviations of the mutation operator σ x if pst = 1/5 of 10n

ES: Mutation operator Genetic Programming (GP)

Because of the encoding and the mutation operator, a gene Developed by John Koza (1992)
has a “memory” of good mutations:
Main characteristics of the original GP:
• Chromosomes encode programs, e.g. in Lisp (rather than
parameters),
• A chromosome has a tree-structure
Maximum
• Specific mutation and crossover operators to deal with
tree-structure

15
GP: example of encoding GP: example of crossover operator
Symbolic (as opposed to parametric) fitting of a function: Parents

Functions:
+,-,*,/,…

Children

Terminals:
variables or b *b − 2* 2* a *c − b
numeric values 2*a

GP: example of mutation operators Evolutionary algorithms

Parent • Note: There is less and less distinction between genetic
algorithms, evolution strategies, and genetic
programming

• E.g. many genetic algorithms use real-number

Mutation of
a function encodings, many ES use large populations and
Replacement of a subtree
Mutation of crossover, …
by another random one
a parameter
• All these algorithms are part of a continuum of
evolutionary algorithms

16
Evolutionary algorithms: summary Evolutionary algorithms: summary
Pros: Cons:

• Robust optimization (does not get stuck in local optima • Slow

too easily) • Need some adjustments that are problem-dependent
• Few restrictions on the type of fitness function (e.g. it (probabilities of mutation and crossover, number of
does not need to be differentiable, nor continuous) children,…)
• Easy to implement • Robotics: not well suited for online learning (too slow,
• Well-adapted to be implemented on parallel computers must be run serially if single robot, some generated
• Can easily be combined with other approaches (e.g. controllers can be harmful to the robot).
starting with a GA, and then finishing with a gradient-
based hill climbing).

Lecture 3: Learning algorithms Reinforcement Learning

Topics:

1. CPG-and-reflex based control of locomotion (end of The next slides are adapted from Sutton and Barto’s course
lecture 2)

2. Evolutionary algorithms

3. Reinforcement learning

17
Key Features of RL Key Features of RL

❐ Trial-and-Error learning, that is well-suited for online

learning on a robot, for instance.

Environment
❐ Learner is not told which actions to take (i.e. no
supervision), but receives a reward every so often
state action
❐ Possibility of delayed reward
Sacrifice short-term gains for greater long-term gains reward
Agent

❐ Most RL learning algorithms can be seen as algorithms

that estimate value functions and solve the Bellman
equation (see next slides)

Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 1 Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 2

Elements of RL The Agent-Environment Interface

Agent
Policy state reward
rt action
st at
Reward rt+1
Value st+1 Environment
Model of
environment
❐ Policy: what to do Agent and environment interact at discrete time steps : t = 0,1,2,K
Agent observes state at step t : st ∈ S
❐ Reward: what is good
produces action at step t : at ∈ A( st )
❐ Value: what is good because it predicts reward gets resulting reward : rt +1 ∈ ℜ
❐ Model: what follows what and resulting next state : st +1

... rt +1 rt +2 rt +3 s ...
st st +1 st +2 t +3
at at +1 at +2 at +3

Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 3 Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 4

1
The Agent Learns a Policy Policy

Policy at step t ,π t : S
Stochastic environment:
a
a mapping from states to action probabilities stochastic transition
S’
π t ( s, a ) = probability to take action at = a when st = s

❐ Reinforcement learning methods specify how the agent

changes its policy as a result of experience.
❐ Roughly, the agent’s goal is to get as much reward as it
can over the long run.
Note: both the environment and the policy can be
stochastic/probabilistic

Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 5 Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 6

Getting the Degree of Abstraction Right Goals and Rewards

❐ Time steps need not refer to fixed intervals of real time.
❐ Actions can be low level (e.g., voltages to motors), or high ❐ The reward signal rt is a scalar. This offers a flexible way
level (e.g., move North, pick-up object, accept a job of encoding the goal of a problem.
offer,… ), ❐ A goal should specify what we want to achieve, not how
we want to achieve it.
❐ States can low-level “sensations” (e.g. distance sensor ❐ The agent must be able to measure success:
readings), or they can be abstract, symbolic, based on explicitly;
memory, or subjective (e.g., the state of being “surprised”
frequently during its lifespan.
or “lost”).
❐ The environment is not necessarily unknown to the agent,
only incompletely controllable.

Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 7 Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 8

2
Returns Returns for Continuing Tasks
Suppose the sequence of rewards after step t is :
rt +1 , rt+ 2 , rt + 3 , K Continuing tasks: interaction does not have natural episodes.
What do we want to maximize?

In general, Discounted return:

∞
we want to maximize the expected return, E{Rt }, for each step t. Rt = rt +1 + γ rt+ 2 + γ 2 rt +3 + L = ∑ γ k rt + k +1 ,
k =0

Episodic tasks: interaction breaks naturally into where γ , 0 ≤ γ ≤ 1, is the discount rate.
episodes, e.g., plays of a game, trips through a maze.
Rt = rt +1 + rt +2 + L + rT ,
shortsighted 0 ← γ → 1 farsighted
where T is a final time step at which a terminal state is reached,
ending an episode.

Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 9 Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 10

An Example Another Example

Avoid failure: the pole falling beyond
a critical angle or the cart hitting end of Get to the top of the hill
track. as quickly as possible.

As an episodic task where episode ends upon failure:

reward = +1 for each step before failure
⇒ return = number of steps before failure
reward = −1 for each step where not at top of hill
As a continuing task with discounted return: ⇒ return = − number of steps before reaching top of hill
reward = −1 upon failure; 0 otherwise
⇒ return = − γ k , for k steps before failure Return is maximized by minimizing
number of steps reach the top of the hill.
In either case, return is maximized by
avoiding failure for as long as possible.
Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 11 Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 12

3
Reinforcement learning algorithms

❐ We will continue with RL algorithms in Lecture 4.

End of lecture 3

Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 13

Lab6-Tubular Flow Reactor
100% (1)
Lab6-Tubular Flow Reactor
11 pages
Building Blocks Learning Trajectories
No ratings yet
Building Blocks Learning Trajectories
16 pages
2011 0006.advanced Evolutionary
No ratings yet
2011 0006.advanced Evolutionary
76 pages
21MID0119_EvolutionaryAlgorithmsinRoboticsApplicationsandChallenges
No ratings yet
21MID0119_EvolutionaryAlgorithmsinRoboticsApplicationsandChallenges
8 pages
Machine Learning in Embedded System
No ratings yet
Machine Learning in Embedded System
56 pages
2007 - Manipulator Trajectory Planning Using A MOEA - Solteiro
No ratings yet
2007 - Manipulator Trajectory Planning Using A MOEA - Solteiro
9 pages
Swarm Intelligence (SI)
No ratings yet
Swarm Intelligence (SI)
34 pages
Bio Inspired Networking: Prepared By: Kratika Jaiswal
No ratings yet
Bio Inspired Networking: Prepared By: Kratika Jaiswal
15 pages
Chapter 3 - Lecturer 2 Other Nature Inspired Optimization Techniques
No ratings yet
Chapter 3 - Lecturer 2 Other Nature Inspired Optimization Techniques
32 pages
Bioinspired Algorithms and Applications
No ratings yet
Bioinspired Algorithms and Applications
42 pages
Comparative Analysis of Nature-Inspired Algorithms For Optimization
No ratings yet
Comparative Analysis of Nature-Inspired Algorithms For Optimization
9 pages
Report GeneticAlgorithm Leandro
No ratings yet
Report GeneticAlgorithm Leandro
7 pages
2004 - Multi-Objective Genetic Manipulator Trajectory Planner - Solteiro
No ratings yet
2004 - Multi-Objective Genetic Manipulator Trajectory Planner - Solteiro
10 pages
Intelligent Systems Theory: Lecture Eight Dan Humpert Associate Professor University of Cincinnati Mechanical Engineering
No ratings yet
Intelligent Systems Theory: Lecture Eight Dan Humpert Associate Professor University of Cincinnati Mechanical Engineering
35 pages
Fundamentals of Neural Networks What Is Neural Net
No ratings yet
Fundamentals of Neural Networks What Is Neural Net
4 pages
Cse 590 Data Mining: Prof. Anita Wasilewska SUNY Stony Brook
No ratings yet
Cse 590 Data Mining: Prof. Anita Wasilewska SUNY Stony Brook
66 pages
How to Apply Genetic Algorithms to Bioinformatics and Computational Biology
No ratings yet
How to Apply Genetic Algorithms to Bioinformatics and Computational Biology
20 pages
ML_UNIT_4
No ratings yet
ML_UNIT_4
40 pages
2004 - Robot Trajectory Planning Using Multi-Objective Genetic Algorithm Optimization - Solteiro
No ratings yet
2004 - Robot Trajectory Planning Using Multi-Objective Genetic Algorithm Optimization - Solteiro
12 pages
Introduction To Artificial Intelligence
No ratings yet
Introduction To Artificial Intelligence
24 pages
Evolutionary Computation and Its Applications: Dr. K.Indira
No ratings yet
Evolutionary Computation and Its Applications: Dr. K.Indira
78 pages
Slide 1-14+ Backpropagation (BP) Algorithm
No ratings yet
Slide 1-14+ Backpropagation (BP) Algorithm
8 pages
Fundamentals of Natural Computing
No ratings yet
Fundamentals of Natural Computing
14 pages
Path Planning For A Robot Arm Using Genetic Algorithm
No ratings yet
Path Planning For A Robot Arm Using Genetic Algorithm
55 pages
Warm Ntelligence: Calm. We Are Going Around The Leaf
No ratings yet
Warm Ntelligence: Calm. We Are Going Around The Leaf
32 pages
NNGA 9
No ratings yet
NNGA 9
15 pages
University Instituteof Engineering Masters of Engineering (AI/AI-ML) Machine Learning (20-CST-651)
No ratings yet
University Instituteof Engineering Masters of Engineering (AI/AI-ML) Machine Learning (20-CST-651)
8 pages
Module 5
No ratings yet
Module 5
11 pages
Lectures 1_3 Advanced Optimization Techniques
No ratings yet
Lectures 1_3 Advanced Optimization Techniques
119 pages
ECE Swarm Intelligence (SI)
No ratings yet
ECE Swarm Intelligence (SI)
34 pages
Unit 4
No ratings yet
Unit 4
8 pages
Introduction To Optimization With Genetic Algorithm: Ahmed Fawzy Gad
No ratings yet
Introduction To Optimization With Genetic Algorithm: Ahmed Fawzy Gad
8 pages
master_thesis_David_Kabath
No ratings yet
master_thesis_David_Kabath
70 pages
1BM22CS038 Anagha Bharadwaj (6)
No ratings yet
1BM22CS038 Anagha Bharadwaj (6)
27 pages
03 Operations Research
No ratings yet
03 Operations Research
66 pages
Artificial Intelligence in Biomedical Engineering
No ratings yet
Artificial Intelligence in Biomedical Engineering
25 pages
Genetic Algorithms Tutorial
No ratings yet
Genetic Algorithms Tutorial
52 pages
A.Townsend - Genetic Algorithms - A Tutorial
No ratings yet
A.Townsend - Genetic Algorithms - A Tutorial
52 pages
Unit-3 Evolutionary Computing & GA (1)
No ratings yet
Unit-3 Evolutionary Computing & GA (1)
39 pages
Multi Robot System Path Optimization and Obstacle Avoidance Using Genetic Algorithm
No ratings yet
Multi Robot System Path Optimization and Obstacle Avoidance Using Genetic Algorithm
27 pages
GA Main
No ratings yet
GA Main
27 pages
Calm. We Are Going Around The Leaf.: A Bug's Life, Walt Disney, 1998
No ratings yet
Calm. We Are Going Around The Leaf.: A Bug's Life, Walt Disney, 1998
30 pages
SVM_OT
No ratings yet
SVM_OT
9 pages
Lec 15 - GA
No ratings yet
Lec 15 - GA
30 pages
BSCM 01 Introduction
No ratings yet
BSCM 01 Introduction
5 pages
A Systematic Review On Fruit Fly Optimization Algorithm and Its Applications
No ratings yet
A Systematic Review On Fruit Fly Optimization Algorithm and Its Applications
55 pages
Download Complete (Ebook) Swarm Intelligence: Principles, Advances, and Applications by Aboul Ella Hassanien, Eid Emary ISBN 9781498741064, 1498741061 PDF for All Chapters
100% (1)
Download Complete (Ebook) Swarm Intelligence: Principles, Advances, and Applications by Aboul Ella Hassanien, Eid Emary ISBN 9781498741064, 1498741061 PDF for All Chapters
72 pages
5.3 Supervised & Reinforcement
No ratings yet
5.3 Supervised & Reinforcement
30 pages
Optimizing With Ga
No ratings yet
Optimizing With Ga
62 pages
Swarm Intelligence Principles Advances and Applications Aboul Ella Hassaniendownload
100% (1)
Swarm Intelligence Principles Advances and Applications Aboul Ella Hassaniendownload
48 pages
Spec 1
No ratings yet
Spec 1
2 pages
Genetic Algorithms: Optimization Techniques
No ratings yet
Genetic Algorithms: Optimization Techniques
36 pages
Soft Computing Techniques
No ratings yet
Soft Computing Techniques
40 pages
Evolutionary Computation: 22c: 145, Chapter 9
No ratings yet
Evolutionary Computation: 22c: 145, Chapter 9
64 pages
Genetic Algorithms For Biped Robot Gait
No ratings yet
Genetic Algorithms For Biped Robot Gait
20 pages
Genetic Algorithms
No ratings yet
Genetic Algorithms
37 pages
Introduction To Genetic Algorithms (GA)
No ratings yet
Introduction To Genetic Algorithms (GA)
14 pages
Genetic Algorithms For Game Programming
No ratings yet
Genetic Algorithms For Game Programming
39 pages
advanced modeling algorithms (1)
No ratings yet
advanced modeling algorithms (1)
33 pages
GA Lec1 Intro
No ratings yet
GA Lec1 Intro
23 pages
Natural Computing with Python: Learn to implement genetic and evolutionary algorithms to solve problems in a pythonic way
From Everand
Natural Computing with Python: Learn to implement genetic and evolutionary algorithms to solve problems in a pythonic way
Giancarlo Zaccone
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
Surpac Drill and Blast Tutorial
100% (5)
Surpac Drill and Blast Tutorial
69 pages
Motion Tracking Using Kalman Filter Matlab Code
100% (2)
Motion Tracking Using Kalman Filter Matlab Code
2 pages
Solutions Manual For Introduction To Modern Statistical Mechanics
100% (4)
Solutions Manual For Introduction To Modern Statistical Mechanics
48 pages
Lesson Plan (Geometry)
No ratings yet
Lesson Plan (Geometry)
18 pages
Random Errors in Chemical Analysis: CHM028 Analytical Chemistry For Teachers
No ratings yet
Random Errors in Chemical Analysis: CHM028 Analytical Chemistry For Teachers
38 pages
EXP 1-20
No ratings yet
EXP 1-20
57 pages
Verification of Coulomb's Law in Electrostatics (Using Phet Simulation)
No ratings yet
Verification of Coulomb's Law in Electrostatics (Using Phet Simulation)
14 pages
Worksheet 6-2c
No ratings yet
Worksheet 6-2c
2 pages
Lecture05 Handout
No ratings yet
Lecture05 Handout
42 pages
Form 4 Math Placement Test 2021
No ratings yet
Form 4 Math Placement Test 2021
8 pages
Lab 2 Projectile Motion
No ratings yet
Lab 2 Projectile Motion
7 pages
Leicester University Ec 3064 Project Template
No ratings yet
Leicester University Ec 3064 Project Template
61 pages
Decision Theory
No ratings yet
Decision Theory
14 pages
UCMAS - VIETNAM - Question - Paper (2022-08-06 16 - 33 - 35)
100% (1)
UCMAS - VIETNAM - Question - Paper (2022-08-06 16 - 33 - 35)
2 pages
12 Regresi Linier Dan Korelasi
No ratings yet
12 Regresi Linier Dan Korelasi
16 pages
Design For Optimization Even Semester 2021 Home Assignment-CO-2
No ratings yet
Design For Optimization Even Semester 2021 Home Assignment-CO-2
2 pages
Module 3 Oer Assignment - Mathematics Grade 4
No ratings yet
Module 3 Oer Assignment - Mathematics Grade 4
12 pages
Modi Method
No ratings yet
Modi Method
19 pages
Ijpir 16 403 - 98 110
No ratings yet
Ijpir 16 403 - 98 110
13 pages
KCS302 - Stack Organization & RPN
No ratings yet
KCS302 - Stack Organization & RPN
19 pages
Module 9 Parent Letter
No ratings yet
Module 9 Parent Letter
2 pages
Thermodynamics - II: Clausius-Clapeyron Equation
No ratings yet
Thermodynamics - II: Clausius-Clapeyron Equation
12 pages
(JCAM 122) Brezinski C.-Numerical Analysis 2000. Interpolation and Extrapolation. Volume 2 (2000)
No ratings yet
(JCAM 122) Brezinski C.-Numerical Analysis 2000. Interpolation and Extrapolation. Volume 2 (2000)
355 pages
OPRE 6301-SYSM 6303 Chapter 04 - Students
No ratings yet
OPRE 6301-SYSM 6303 Chapter 04 - Students
17 pages
(Ebook) 2D graphics programming for games by John Pile Jr. ISBN 9781466501898, 9781466501904, 1466501898, 1466501901 - Quickly access the ebook and start reading today
100% (1)
(Ebook) 2D graphics programming for games by John Pile Jr. ISBN 9781466501898, 9781466501904, 1466501898, 1466501901 - Quickly access the ebook and start reading today
48 pages
12 Cbse Cont, Dbility, Diff Chain, Trig Sub Apr 2024
No ratings yet
12 Cbse Cont, Dbility, Diff Chain, Trig Sub Apr 2024
2 pages
Hines Et Al-2007-Quality and Reliability Engineering International
No ratings yet
Hines Et Al-2007-Quality and Reliability Engineering International
13 pages
The World of Science and Innovation 9 11.12.2020
No ratings yet
The World of Science and Innovation 9 11.12.2020
1,012 pages

Summary of Lecture 2: Locomotion Control: Biologically Inspired Artificial Intelligence (WS03: 410)

Uploaded by

Summary of Lecture 2: Locomotion Control: Biologically Inspired Artificial Intelligence (WS03: 410)

Uploaded by

Biologically Inspired Artificial Intelligence

Summary of lecture 2: Locomotion control

Lecture 3 1. Locomotion control in robotics:

Swiss Federal Institute of Technology, Lausanne 2. Legged robot locomotion control:

Lecture 3: Learning algorithms Practical work:

Nov 4: Kinator went straight to number one,

judo_14_11_03\R8D8_vs_El_Nino.wva For next lecture (Dec. 1), all teams should

Lecture 3: Learning algorithms CPG-and-reflex control

Visuomotor Coord. Balance Control

Limit cycles CPG directly produces torques:

Taga’s neuromechanical simulation Taga’s neuromechanical simulation

G. Taga. Emergence of bipedal locomotion through entrainment among the neuro-musculo-skeletal

Nonlinear oscillators Nonlinear oscillator model

Limit cycle: Explicit frequency and amplitude parameters

6 open coupling parameters

Generation of traveling waves for

Limb CPG Body CPG

From swimming to walking Real salamander: walking

Real salamander: from swimming to walking Simulation Demo

• Robustness against noise and perturbations

• Entrainment between the CPG and the body through

• Nonlinear oscillators are more tractable than neural

Quadruped-robot controlled with a CPG-and-reflex Control: summary

Evolutionary algorithms Genetic Algorithm (GA)

2.Parent selection 3.Crossover

GA: Initial population GA: Parent selection

xyyxyx yyxyxy 0111yx yyxyxy

Two-point crossover: 010111 011101

xyyxyx yyxyxy 0111yx yyxy01

!! The effectiveness of the crossover operator depends on the

GA: Fitness function GA: Rejection operator

Steady-state GA: Only part of the population is updated

Local optimum Generational GA: Steady-state GA:

• Fixed number of generations has been reached

• Increase of maximum fitness per generation passing

Typical run: next generations Typical run: next generations

GA: applications GA: applications

Control BODY ENV. Control BODY ENV.

Parameters: θi ? Fitness: f(θ1,…,θN) Parameters: θi Fitness: f(θ1,…,θN)

GA applications: Evolution Strategies (ES)

Main characteristics of the original ES:

• GA used to evolve neural networks.

Xi=2.30 and standard deviation σ t

ES: Mutation operator Genetic Programming (GP)

GP: example of mutation operators Evolutionary algorithms

• E.g. many genetic algorithms use real-number

• Robust optimization (does not get stuck in local optima • Slow

Lecture 3: Learning algorithms Reinforcement Learning

❐ Trial-and-Error learning, that is well-suited for online

❐ Most RL learning algorithms can be seen as algorithms

Elements of RL The Agent-Environment Interface

❐ Reinforcement learning methods specify how the agent

Getting the Degree of Abstraction Right Goals and Rewards

In general, Discounted return:

An Example Another Example

As an episodic task where episode ends upon failure:

❐ We will continue with RL algorithms in Lecture 4.

Adapted from R. S. Sutton and A. G. Barto: Reinforcement Learning: An Introduction 13

You might also like