CS391R Presentation Dexterous
CS391R Presentation Dexterous
09-27-2022
Real-world Hardware
- Hard to simulate
Instead let try to simulate it first, then later transfer it to the physical robot
Domain Randomization:
- randomize certain aspects of a simulated environment
- train policies over wide range of environment
- avoids overfitting on one specific environment
- Key things randomized:
- physics randomizations
- visual randomizations (Unity rendered images)
Training
- trained policy until 1 million states
- Use 2 GPUs for rendering, 1 GPU for training
For precision grasps, the policy tends to use the little finger more than the index and middle
finger. Likely due to more DoF.
A System for General In-Hand Object Re-Orientation. Tao Chen et. al (2021)
- Best Paper Award for Conference on Robot Learning 2021
- 2000 distinct objects, and pick up objects with hand facing downwards
Dota 2 with Large Scale Deep Reinforcement Learning. OpenAI et. al (2019)
- Learn more about Rapid
- See how a complex game was broken by an AI