0% found this document useful (0 votes)

59 views16 pages

CS391R Presentation Dexterous

This document summarizes a presentation on learning dexterous in-hand manipulation. It discusses (1) the motivation for focusing on dexterity, (2) the difficulties with dexterous manipulation using a 24 degree of freedom robot hand, (3) related work on dexterous and in-hand manipulation using various strategies, and (4) the proposed approach of using reinforcement learning trained in a simulated environment with domain randomization and then transferring the policy to a physical robot.

Uploaded by

Muhammad Yasir Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views16 pages

CS391R Presentation Dexterous

Uploaded by

Muhammad Yasir Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Learning Dexterous In-Hand Manipulation

Presenter: Franke Tang

09-27-2022

CS391R: Robot Learning (Fall 2022) 1

Motivation
Why focus on dexterity?
● The human hands are able to solve a huge array of tasks
● The world is built around human hands
● OpenAI’s goal is built a general purpose robot
● Start off interacting with environment and utilizing dexterous manipulation

CS391R: Robot Learning (Fall 2022) 2

Difficulties with Dexterous Manipulation
OpenAI uses the Shadow Dexterous Hand
- 24 Degrees of Freedom (7 typically)
- 20 Actuators

Real-world Hardware
- Hard to simulate

CS391R: Robot Learning (Fall 2022) 3

Related Works
Dexterous Manipulation
● Active area of research for decades, with many strategies
● rolling, sliding, finger gaiting, pushing, etc.
● However, required planning and exact models of hand and object

Dexterous In-Hand Manipulation

● Promising in-hand manipulation in simulations, but do not transfer to real-world robot
● Training on physical robots are slow and learning is limited due less training data

Simulation to Real Transfer

● Domain adaptation methods
● Domain randomization for policy to be more adaptive
● Adversarial Training for more robust policies and can help with transfer
CS391R: Robot Learning (Fall 2022) 4
Proposed Approach
Goal: Using a humanoid hand, orientate a block to desired orientation

Utilize Reinforcement Learning to train a policy

- Requires a lot data
- Unfeasible with physical robots (very hard to scale)

Instead let try to simulate it first, then later transfer it to the physical robot

CS391R: Robot Learning (Fall 2022) 5

Simulated vs Real
The right image is the simulated
environment using MuJoCo
Physics Engine.

The left image is the physical

world environment. The hand is
in the center of the “cage”. The
red circle are the RGB camera
The simulated environment is model after the physical
used for position and object
environment to mimic experiences the robot should have.
pose estimation.

CS391R: Robot Learning (Fall 2022) 6

Reality Gap
A simulated model can never be an exact replica of the physical model.
In general, policies trained in simulation perform poorly in the real world (the Reality Gap).

Domain Randomization:
- randomize certain aspects of a simulated environment
- train policies over wide range of environment
- avoids overfitting on one specific environment
- Key things randomized:
- physics randomizations
- visual randomizations (Unity rendered images)

CS391R: Robot Learning (Fall 2022) 7

Experimental Setup

CS391R: Robot Learning (Fall 2022) 8

The control policy
Policy represented as RNN with memory
● LSTM
● Trained with PPO

Actions and Rewards

● rt = dt - dt + 1 (rotation angles)
● +5 for goal is achieved
● -20 for dropping the object

Internal framework: Rapid

● Used in OpenAI Five
● Train policy and vision models
● Policy is trained with states (not images) 8 GPUs were used. Pools contained 384 worker machines
and each with 16 CPU cores. 6144 CPU cores in total
CS391R: Robot Learning (Fall 2022) 9
Vision Model
3 RGB cameras with different angles
- predicts position and orientation
- feed the pose estimator’s prediction to policy

Training
- trained policy until 1 million states
- Use 2 GPUs for rendering, 1 GPU for training

CS391R: Robot Learning (Fall 2022) 10

Qualitative Results
The policy naturally exhibit many grasps human do.
Naturally discovered many strategies for dexterous in-hand manipulation.

For precision grasps, the policy tends to use the little finger more than the index and middle
finger. Likely due to more DoF.

CS391R: Robot Learning (Fall 2022) 11

Quantitative Results
Domain randomization is key to
the policy performing well

CS391R: Robot Learning (Fall 2022) 12

Quantitative Results cont.
Using memory achieve better
performance on randomized simulation

CS391R: Robot Learning (Fall 2022) 13

Limitations
Transfer from simulation to physical is still limited.

Failures still occurs

- Most common: dropping when rotating the
wrist pitch joint down
- Dropping the object near the beginning of
the trial
- Getting stuck because the edge of object
gets caught in a screw hole

Hardware is super expensive.

Multiple GPUs and machines.
Shadow Hand price is also extremely expensive!

CS391R: Robot Learning (Fall 2022) 14

Future Work / Extended Readings
Solving Rubik's Cube with a Robot Hand. OpenAI et. al (2019)
- Same people, added on the challenge of solving a Rubik’s Cube
- Utilizes Automatic Domain Randomization

A System for General In-Hand Object Re-Orientation. Tao Chen et. al (2021)
- Best Paper Award for Conference on Robot Learning 2021
- 2000 distinct objects, and pick up objects with hand facing downwards

Dota 2 with Large Scale Deep Reinforcement Learning. OpenAI et. al (2019)
- Learn more about Rapid
- See how a complex game was broken by an AI

CS391R: Robot Learning (Fall 2022) 15

Summary
This paper attempts to train a robot manipulate objects with a 5-fingered robot
hands. Due to the complexity of the hardware, prior works were unable to both
train the robotic hand and use in the physical world. This paper utilized
reinforcement learning and simulations to train the policy. Domain randomization
allowed them to generalize the policy and allow it be transfer to the physical world.
Using Rapid as the internal framework of the policy and vision model, combining
the two allowed the robot physically manipulate objects with relative success.

CS391R: Robot Learning (Fall 2022) 16

00 Introduction
No ratings yet
00 Introduction
35 pages
ROBOTICS AND AUTOMATION L T P C - Notes12
No ratings yet
ROBOTICS AND AUTOMATION L T P C - Notes12
83 pages
Robotic Hand Thesis
100% (3)
Robotic Hand Thesis
6 pages
Autonomous Systems - Final
No ratings yet
Autonomous Systems - Final
4 pages
Master Thesis
No ratings yet
Master Thesis
77 pages
The Rubik's Cube
No ratings yet
The Rubik's Cube
51 pages
Towards Human-Level Bimanual Dexterous Manipulatio
No ratings yet
Towards Human-Level Bimanual Dexterous Manipulatio
36 pages
23 Ese650
No ratings yet
23 Ese650
208 pages
DuranK Thesis Redacted
No ratings yet
DuranK Thesis Redacted
63 pages
OpenAI Et Al. - 2019 - Learning Dexterous In-Hand Manipulation
No ratings yet
OpenAI Et Al. - 2019 - Learning Dexterous In-Hand Manipulation
27 pages
A Survey On Deep Reinforcement Learning Algorithms For Robotic Manipulation
No ratings yet
A Survey On Deep Reinforcement Learning Algorithms For Robotic Manipulation
35 pages
Project GR00T A Blueprint For Generalist Robotics
No ratings yet
Project GR00T A Blueprint For Generalist Robotics
145 pages
Lecture-22-Presentation Mask RCNN
No ratings yet
Lecture-22-Presentation Mask RCNN
32 pages
Model-Based Deep Reinforcement Learning For Robotic Systems
No ratings yet
Model-Based Deep Reinforcement Learning For Robotic Systems
146 pages
Complex System Report Ujwal Bhattarai
No ratings yet
Complex System Report Ujwal Bhattarai
19 pages
Robotics 12 00012 v2
No ratings yet
Robotics 12 00012 v2
19 pages
Lecture Intro
No ratings yet
Lecture Intro
36 pages
Paper of Learning Dextrous In-Hand Robot Manipulation
No ratings yet
Paper of Learning Dextrous In-Hand Robot Manipulation
8 pages
5 Slides Simulation
No ratings yet
5 Slides Simulation
54 pages
Orouji 20230108
No ratings yet
Orouji 20230108
45 pages
Sim-to-Real Reinforcement Learning For Vision-Based Dexterous Manipulation On Humanoids
No ratings yet
Sim-to-Real Reinforcement Learning For Vision-Based Dexterous Manipulation On Humanoids
12 pages
Empowerment Curriculum Map
No ratings yet
Empowerment Curriculum Map
13 pages
6 - Uses Various Strategies in Order To Avoid Communication Breakdown
No ratings yet
6 - Uses Various Strategies in Order To Avoid Communication Breakdown
13 pages
Groot n1
No ratings yet
Groot n1
36 pages
HumanoidBench: Simulated Humanoid Benchmark For Whole-Body Locomotion and Manipulation
No ratings yet
HumanoidBench: Simulated Humanoid Benchmark For Whole-Body Locomotion and Manipulation
24 pages
Week12 RobotSystem
No ratings yet
Week12 RobotSystem
53 pages
Sim 2 Real S25
No ratings yet
Sim 2 Real S25
59 pages
Dynamic Path Planning For Dexterous Manipulation A Matlab Implementation IJERTV13IS100037
No ratings yet
Dynamic Path Planning For Dexterous Manipulation A Matlab Implementation IJERTV13IS100037
8 pages
Lab1 Description
No ratings yet
Lab1 Description
5 pages
Animatronic Hand Model On The Basis of ESP8266
No ratings yet
Animatronic Hand Model On The Basis of ESP8266
4 pages
HCI Lecture12 Online
No ratings yet
HCI Lecture12 Online
31 pages
Lesson 6-8
100% (1)
Lesson 6-8
33 pages
Proposal PDF
No ratings yet
Proposal PDF
8 pages
The Power of Robotics in Modern Applications
No ratings yet
The Power of Robotics in Modern Applications
7 pages
Field Robotics
No ratings yet
Field Robotics
70 pages
Odoo Starterkit
100% (2)
Odoo Starterkit
37 pages
Project
No ratings yet
Project
15 pages
Research Methodologies
No ratings yet
Research Methodologies
11 pages
Intelligent Social Robots ACMAssiut
No ratings yet
Intelligent Social Robots ACMAssiut
73 pages
Robot RobotxR1 2505.03238v1
No ratings yet
Robot RobotxR1 2505.03238v1
19 pages
1 Introduction V2
No ratings yet
1 Introduction V2
98 pages
Papers Upto 28th March 2025
No ratings yet
Papers Upto 28th March 2025
9 pages
EL-422 Robotics (Revised-2022)
No ratings yet
EL-422 Robotics (Revised-2022)
106 pages
Cell: 9952749533 WWW - Researchprojects.info: CS 491/691 (X) - Lecture 2 1
No ratings yet
Cell: 9952749533 WWW - Researchprojects.info: CS 491/691 (X) - Lecture 2 1
33 pages
Cell: 9952749533 WWW - Researchprojects.info: CS 491/691 (X) - Lecture 1 1
No ratings yet
Cell: 9952749533 WWW - Researchprojects.info: CS 491/691 (X) - Lecture 1 1
33 pages
L1 Introduction2023
No ratings yet
L1 Introduction2023
70 pages
EEE-BEE009 - Robotics and Automation Dr. S. P. Vijaya Raghavan
No ratings yet
EEE-BEE009 - Robotics and Automation Dr. S. P. Vijaya Raghavan
34 pages
Domain Introduction
No ratings yet
Domain Introduction
9 pages
AI Presentation Script Vinay
No ratings yet
AI Presentation Script Vinay
4 pages
Benchmarking Generalizable Bimanual Manipulation: Dual-Arm Collaboration Challenge at CVPR 2025 MEIS Workshop
No ratings yet
Benchmarking Generalizable Bimanual Manipulation: Dual-Arm Collaboration Challenge at CVPR 2025 MEIS Workshop
13 pages
Lab Assignment 2 - Pick-and-Place DoGoodBot
No ratings yet
Lab Assignment 2 - Pick-and-Place DoGoodBot
5 pages
Honors ME RA
No ratings yet
Honors ME RA
8 pages
Solving Rubik's Cube With A Robot Hand OpenAI Blog
No ratings yet
Solving Rubik's Cube With A Robot Hand OpenAI Blog
11 pages
Faizan All About Arduino Robotics
No ratings yet
Faizan All About Arduino Robotics
21 pages
8th Sem Syllabus
No ratings yet
8th Sem Syllabus
3 pages
Exp 1 - Basic Arduino
No ratings yet
Exp 1 - Basic Arduino
5 pages
AI Robotics 12 Month Roadmap
No ratings yet
AI Robotics 12 Month Roadmap
2 pages
Robotics & Automation
No ratings yet
Robotics & Automation
2 pages
Thesis Writing Chapter 4 Sample
100% (3)
Thesis Writing Chapter 4 Sample
6 pages
Research
No ratings yet
Research
3 pages
A MATLAB Toolbox For Robotic Manipulators: September 2005
No ratings yet
A MATLAB Toolbox For Robotic Manipulators: September 2005
7 pages
Reinforcement Learning For Robotics Advance
No ratings yet
Reinforcement Learning For Robotics Advance
2 pages
TTL2 Module 4, Lesson 2 (Engaging in A Community of Learning)
No ratings yet
TTL2 Module 4, Lesson 2 (Engaging in A Community of Learning)
9 pages
Experience With Introducing Robotics Toolbox For MATLAB in A Senior Level Undergraduate Course
No ratings yet
Experience With Introducing Robotics Toolbox For MATLAB in A Senior Level Undergraduate Course
8 pages
Chapter 1 - Introduction To Robotics
No ratings yet
Chapter 1 - Introduction To Robotics
27 pages
How Do Children Learn Math and How Do We Teach and Assess Math
No ratings yet
How Do Children Learn Math and How Do We Teach and Assess Math
32 pages
1、WHEN AND HOW ARTIFICIAL INTELLIGENCE AUGMENTS EMPLOYEE CREATIVITY
No ratings yet
1、WHEN AND HOW ARTIFICIAL INTELLIGENCE AUGMENTS EMPLOYEE CREATIVITY
28 pages
Ap12 q1 Module 1 Final
No ratings yet
Ap12 q1 Module 1 Final
28 pages
Career Readiness Skills SA
No ratings yet
Career Readiness Skills SA
4 pages
Academic Writing: Competence and Difficulties of Senior High School Students at Cotabato City National High School-Main Campus
No ratings yet
Academic Writing: Competence and Difficulties of Senior High School Students at Cotabato City National High School-Main Campus
5 pages
ChatGPT Presentation
No ratings yet
ChatGPT Presentation
13 pages
Artifact Orgl 3331
No ratings yet
Artifact Orgl 3331
10 pages
Amber by Infeedo - The CEO's Virtual Assistant Revolutionizing Employee Engagement
No ratings yet
Amber by Infeedo - The CEO's Virtual Assistant Revolutionizing Employee Engagement
3 pages
TAN Week 4 - ITL 516 Student Slides
No ratings yet
TAN Week 4 - ITL 516 Student Slides
37 pages
Pioneers in Police Research William A. Westley - Jack R. Greene
No ratings yet
Pioneers in Police Research William A. Westley - Jack R. Greene
16 pages
DLL CSS Week 2
No ratings yet
DLL CSS Week 2
3 pages
(Ebook PDF) Sociology, 16Th Global Edition
No ratings yet
(Ebook PDF) Sociology, 16Th Global Edition
52 pages
Lesson 2 Designing A Research Project
No ratings yet
Lesson 2 Designing A Research Project
30 pages
11 A
No ratings yet
11 A
10 pages
What Is Healthcare Cybersecurity?
No ratings yet
What Is Healthcare Cybersecurity?
1 page
Epidemiology (Introduction)
No ratings yet
Epidemiology (Introduction)
17 pages
Introducing Sociology: Suparna Majumdar Kar
No ratings yet
Introducing Sociology: Suparna Majumdar Kar
48 pages
Lecture 2
No ratings yet
Lecture 2
23 pages
Patrimonialism. What Is Behind The Term: Ideal Type, Category, Concept or Just A Buzzword?
No ratings yet
Patrimonialism. What Is Behind The Term: Ideal Type, Category, Concept or Just A Buzzword?
27 pages
Senior Data Scientist (Gen AI) - Job Description
No ratings yet
Senior Data Scientist (Gen AI) - Job Description
2 pages
PSTMLS 100 Reviewer
No ratings yet
PSTMLS 100 Reviewer
2 pages
Research Papers Jean Piaget Theory Cognitive Development
No ratings yet
Research Papers Jean Piaget Theory Cognitive Development
7 pages
Lesson Plan 20-1-2025 - G2
No ratings yet
Lesson Plan 20-1-2025 - G2
2 pages
Mengendalikan Emosi Negatifuntuk Meraih Kebahagiaandengan Menerapkan Stoisisme
No ratings yet
Mengendalikan Emosi Negatifuntuk Meraih Kebahagiaandengan Menerapkan Stoisisme
10 pages
CDC Svnit Flyer 2024-25
No ratings yet
CDC Svnit Flyer 2024-25
2 pages
Abstract Form: I Certify That This Material Has Not Been Published or Presented Previously
No ratings yet
Abstract Form: I Certify That This Material Has Not Been Published or Presented Previously
3 pages
Blank DLP MSP
No ratings yet
Blank DLP MSP
3 pages
Isulan National High School Senior High School
No ratings yet
Isulan National High School Senior High School
1 page
Unreal Engine 5 For Beginners: Build High-Quality Games, Immersive Virtual Worlds, And Advanced Interactive Contents
From Everand
Unreal Engine 5 For Beginners: Build High-Quality Games, Immersive Virtual Worlds, And Advanced Interactive Contents
Calren Dovale
No ratings yet