OpenAssistant Roadmap
OpenAssistant Roadmap
Q1 Q2
ASAP …
2023 2023
○ Can use pseudo-data (e.g. from QA dataset) before we have the real data
○ Training an "instruction detector" would allow us to e.g. filter Twitter for good data
2) Training a Reward Model & RLHF