Poster 2
Poster 2
Method
(1) Sample and filter rationales with powerful LLMs ❖ Takeaway from inference results:
✔ ✗
❖ In the future, we want to:
➢ Train a bigger stronger world model
( perplexity = 10 ) ( perplexity = 1000 ) ➢ Test on more reasoning and planning tasks
➢ Conduct more analysis on the effect of world model
<BOT>We multiply the rate <BOT>We divide the amount
per minute by the number of of money by 60 and multiple
minutes she worked<EOT> by the number of minute she
worked<EOT>
Just kidding, the QR code is actually our Overleaf draft :)