0% found this document useful (0 votes)

44 views12 pages

OpenAssistant Roadmap

The document provides a vision and roadmap for OpenAssistant, an open-source conversational chat-based assistant. OpenAssistant aims to be personalized, integrate with third-party systems, and retrieve information dynamically through search engines. The roadmap outlines releasing a minimum viable prototype in January 2023, then expanding capabilities like retrieval augmentation and rapid personalization in subsequent quarters. Getting to the MVP involves collecting human demonstrations, fine-tuning models on the data, and reinforcement learning with a reward model trained on additional human feedback. Main efforts include data collection, instruction dataset gathering, and model training experiments.

Uploaded by

MERM 13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views12 pages

OpenAssistant Roadmap

Uploaded by

MERM 13

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

OpenAssistant

Vision & Roadmap

OpenAssistant is a chat-based assistant that
understands tasks, can interact with third-party
systems, and retrieve information dynamically
to do so.

It can be extended and personalized easily and is

developed as free, open-source software.
Your Conversational Retrieval via Search
Assistant Engines
State-of-the-Art chat External, upgradeable OpenAssistant uniﬁes all
assistant that can be knowledge: No need for knowledge work in one place
personalized to your needs billions of parameters.

● Uses modern deep learning

● Runs on consumer hardware

Interfacing w/ A building block for ● Trains on human feedback

external systems developers
● Free and open
Usage of APIs and Integrate OpenAssistant
third-party applications, into your application.
described via language &
demonstrations.
Our Vision

We want OpenAssistant to be the single, unifying platform

that all other systems use to interface with humans.
Our Roadmap

Q1 Q2
ASAP …
2023 2023

Minimum Viable Growing Up Growing Out How did we get here?

Prototype
● Data Collection Pipeline ● Retrieval Augmentation ● Third-Party Extensions ● What do you need?

● RL on Human Feedback ● Rapid Personalization ● Device Control

● Assistant v1 usable ● Using External Tools ● Multi-Modality

● Out January 2023!

Getting to MVP
We follow InstructGPT
Source: InstructGPT
1) Supervised Fine-Tuning on Human Demonstrations

● We need to collect (human) demonstrations of assistant interactions

○ Read our Data Structures Overview to see how
* InstructGPT has 13k, 33k,
and 31k samples for the
○ We estimate about 50k* demonstrations three steps, respectively

● Fine-tuning a base model on the collected data

○ Candidates: GPT-J, CodeGen(surprisingly promising), FlanT5, GPT-JT

○ Can use pseudo-data (e.g. from QA dataset) before we have the real data

● Additionally, collect instruction datasets

○ Quora, StackOverflow, appropriate subreddits, …

○ Training an "instruction detector" would allow us to e.g. filter Twitter for good data
2) Training a Reward Model & RLHF

● We need to collect rankings of interactions

○ Again, read our Data Structures Overview to see how

● Reward Model Training could also use Active Learning

○ Keeps humans in the loop

○ Drastically decreases needed data

● Reinforcement Learning against the Reward Model

○ Follow InstructGPT and use PPO
Main Efforts
● Data Collection Code → Backend, website, and discord bot to collect data

● Instruction Dataset Gathering → Scraping & cleaning web data

● Gamification → Leaderboards & more, to make data collection more fun

● Model Training → Experiments on pseudo- and real-data

● Infrastructure → Collection, training, and inference

● Data Collection → This is the bulk of the work

● Data Augmentation → Making more data from little data

● Privacy and Safety → Protecting sensitive data

Principles
● We put the human in the center
● We need to get the MVP out fast, while we still have momentum
● We pull in one direction
● We are pragmatic
● We aim for models that can (or could, with some effort) be run on consumer
hardware
● We rapidly validate our ML experiments on a small scale, before going to a
supercluster
Where to go from here?

● Read the Data Structures Documentation

● Come to our repository and grab an issue

● Join the LAION Discord and the YK Discord

CSS Solved Past Papers (2016-2021)
100% (3)
CSS Solved Past Papers (2016-2021)
359 pages
Ubio Alpeta User Guide: Union Community
No ratings yet
Ubio Alpeta User Guide: Union Community
113 pages
Web Scraper For Online Price Comparison
No ratings yet
Web Scraper For Online Price Comparison
24 pages
DCN Manual
No ratings yet
DCN Manual
465 pages
Nist Ir 8332-Draft
No ratings yet
Nist Ir 8332-Draft
30 pages
Prediction of Stock Values Changes Using Sentiment Analysis of Stock News Headlines
No ratings yet
Prediction of Stock Values Changes Using Sentiment Analysis of Stock News Headlines
21 pages
JA2 Manual
No ratings yet
JA2 Manual
57 pages
ImplementationofSecurityAlarmusingArduinowithP I Rmotion
No ratings yet
ImplementationofSecurityAlarmusingArduinowithP I Rmotion
8 pages
As Colour Brand
No ratings yet
As Colour Brand
227 pages
The Internet of Medical Things (Iomt) and Telemedicine Frameworks and Applications
No ratings yet
The Internet of Medical Things (Iomt) and Telemedicine Frameworks and Applications
358 pages
Different Type of Fabrics
No ratings yet
Different Type of Fabrics
10 pages
Unit 5
No ratings yet
Unit 5
61 pages
Lecture Notes Lectures 1 12 Lecture Slides Textbook Group Activities
No ratings yet
Lecture Notes Lectures 1 12 Lecture Slides Textbook Group Activities
80 pages
ZT610/ZT620: Industrial Printer With Optional Color Touch Display
No ratings yet
ZT610/ZT620: Industrial Printer With Optional Color Touch Display
188 pages
Task Analysis
No ratings yet
Task Analysis
25 pages
Summary of Pretotyping Techniques
No ratings yet
Summary of Pretotyping Techniques
19 pages
Xpath Notes
No ratings yet
Xpath Notes
18 pages
McLarens - Role To Catalog App - Project Services - Draftv6
No ratings yet
McLarens - Role To Catalog App - Project Services - Draftv6
79 pages
OPENAI
No ratings yet
OPENAI
20 pages
Zebra MC9300 - Specification Sheet
No ratings yet
Zebra MC9300 - Specification Sheet
4 pages
Avaya CMS HA Connectivity Upgrade and Administration 19.2 March 2021
No ratings yet
Avaya CMS HA Connectivity Upgrade and Administration 19.2 March 2021
56 pages
Brand Fashion Project 2
No ratings yet
Brand Fashion Project 2
16 pages
Zebra ZPL Programming Guide
No ratings yet
Zebra ZPL Programming Guide
1,268 pages
Regex Cheat Sheet 1
No ratings yet
Regex Cheat Sheet 1
8 pages
SE Decode SE IT
No ratings yet
SE Decode SE IT
85 pages
Chapter-Two Mobile Computing
No ratings yet
Chapter-Two Mobile Computing
17 pages
User Experience and Experience Design PDF
No ratings yet
User Experience and Experience Design PDF
14 pages
Harmony Control Relays - RM22TR33
No ratings yet
Harmony Control Relays - RM22TR33
8 pages
Directional Landscapes Using Parametric Loudspeakers For Sound Reproduction in Art
No ratings yet
Directional Landscapes Using Parametric Loudspeakers For Sound Reproduction in Art
12 pages
All Units MC PDF
No ratings yet
All Units MC PDF
332 pages
Lpu Internship Presentation
No ratings yet
Lpu Internship Presentation
29 pages
Week-2 Design Principles and Purposes
No ratings yet
Week-2 Design Principles and Purposes
28 pages
Visual Merchandising Strategy in Online Retail VS Offline Retail To Build Consumer Preferences To Buy (IKEA, Dekoruma)
No ratings yet
Visual Merchandising Strategy in Online Retail VS Offline Retail To Build Consumer Preferences To Buy (IKEA, Dekoruma)
12 pages
MDB Interface Specification
No ratings yet
MDB Interface Specification
313 pages
Personal Account Opening - Application Form - New - PDF 2019
No ratings yet
Personal Account Opening - Application Form - New - PDF 2019
12 pages
Invoice - Biziverse
No ratings yet
Invoice - Biziverse
1 page
The Complete Guide To Linux - 1st Edition 2023
No ratings yet
The Complete Guide To Linux - 1st Edition 2023
132 pages
Garmin Connect OAuth Specification
No ratings yet
Garmin Connect OAuth Specification
8 pages
Group4 Final Project
No ratings yet
Group4 Final Project
50 pages
ACC Server Admin Handbook
No ratings yet
ACC Server Admin Handbook
27 pages
Building A CNC Router
100% (1)
Building A CNC Router
42 pages
Internship Report
No ratings yet
Internship Report
13 pages
Zaag Brochure Portfolio at A Glance en Us
No ratings yet
Zaag Brochure Portfolio at A Glance en Us
2 pages
Commere Board Project
No ratings yet
Commere Board Project
26 pages
CloudNC Series B Deck PDF
No ratings yet
CloudNC Series B Deck PDF
26 pages
Lightworks v12.0 User Guide PDF
No ratings yet
Lightworks v12.0 User Guide PDF
208 pages
Mni Ision: General Description Applications
No ratings yet
Mni Ision: General Description Applications
10 pages
Dcof Full Notes (Module 1)
No ratings yet
Dcof Full Notes (Module 1)
26 pages
WWE 2K17 PS3 Online Manual
No ratings yet
WWE 2K17 PS3 Online Manual
22 pages
Accenture Driving Unconventional Growth Through IIoT PDF
100% (1)
Accenture Driving Unconventional Growth Through IIoT PDF
20 pages
AMS 1209 3012EntertainmentStandsLowRes
No ratings yet
AMS 1209 3012EntertainmentStandsLowRes
13 pages
Task 2 Model Answer C
No ratings yet
Task 2 Model Answer C
1 page
Program ANBK
No ratings yet
Program ANBK
17 pages
Coaching & Facilitation Agile Teams
100% (1)
Coaching & Facilitation Agile Teams
28 pages
Quran Exposed - Numerical Miracles 309
No ratings yet
Quran Exposed - Numerical Miracles 309
17 pages
BIG-IP DNS Presentation
No ratings yet
BIG-IP DNS Presentation
18 pages
Zebra Scanner Profile Guide PDF
No ratings yet
Zebra Scanner Profile Guide PDF
14 pages
Lightworks v14.5.0 User Guide
100% (1)
Lightworks v14.5.0 User Guide
288 pages
HS40 - v1.02.04 Technical Note - Eng - 20200918
100% (1)
HS40 - v1.02.04 Technical Note - Eng - 20200918
28 pages
Sarthak's Resume
No ratings yet
Sarthak's Resume
1 page
Welcome To The World of Fast Fashion
No ratings yet
Welcome To The World of Fast Fashion
2 pages
Research Journal - Fashion Basics Jury Submission
No ratings yet
Research Journal - Fashion Basics Jury Submission
14 pages
Use Case Specification - Check Shopping Cart
No ratings yet
Use Case Specification - Check Shopping Cart
6 pages
Q10 SBC Datasheet Rev1
No ratings yet
Q10 SBC Datasheet Rev1
4 pages
Structural Design of B+G+6 Apartment Building: Preparedby
No ratings yet
Structural Design of B+G+6 Apartment Building: Preparedby
46 pages
Smart Card
No ratings yet
Smart Card
33 pages
Case Study Jaguar
No ratings yet
Case Study Jaguar
2 pages
Eustace Mullins The World Order Our Secret Rulers PDF
50% (2)
Eustace Mullins The World Order Our Secret Rulers PDF
2 pages
Hisui ApeosPort - 5570 - 4570 Brochure SG-En
No ratings yet
Hisui ApeosPort - 5570 - 4570 Brochure SG-En
8 pages
Compilerbook PDF
No ratings yet
Compilerbook PDF
245 pages
Campus Recruitment and Placement System: Rajnish Tripathi, Raghvendra Singh Ms. Jaweria Usmani
No ratings yet
Campus Recruitment and Placement System: Rajnish Tripathi, Raghvendra Singh Ms. Jaweria Usmani
6 pages
Free - Proxy - List Russia Socks 5
No ratings yet
Free - Proxy - List Russia Socks 5
2 pages
Software Quality Process Assignment
No ratings yet
Software Quality Process Assignment
12 pages
Marblesorterprojectreport
No ratings yet
Marblesorterprojectreport
10 pages
BP 2050 Physical Networking
No ratings yet
BP 2050 Physical Networking
26 pages
FUSION TECH Intro
No ratings yet
FUSION TECH Intro
32 pages
Best Resume Format Ms Word
100% (1)
Best Resume Format Ms Word
6 pages
A Study of Emoticon Use in Instant Messaging From Smartphone
No ratings yet
A Study of Emoticon Use in Instant Messaging From Smartphone
11 pages
Test Case Example
No ratings yet
Test Case Example
17 pages
Ajol File Journals - 411 - Articles - 221085 - Submission - Proof - 221085 4897 541755 1 10 20220208
No ratings yet
Ajol File Journals - 411 - Articles - 221085 - Submission - Proof - 221085 4897 541755 1 10 20220208
6 pages
2 Predicates and Predicated Logic
No ratings yet
2 Predicates and Predicated Logic
41 pages
AbantikaBhowmick Bengaluru Bangalore 5.11 Yrs
No ratings yet
AbantikaBhowmick Bengaluru Bangalore 5.11 Yrs
2 pages
Data Privacy
No ratings yet
Data Privacy
10 pages
Computer Application Lab File - 231126 - 220302
No ratings yet
Computer Application Lab File - 231126 - 220302
5 pages
RPAx User Manual Rev 9.6 Update 27 - 5 - 2017
No ratings yet
RPAx User Manual Rev 9.6 Update 27 - 5 - 2017
11 pages
An Introduction To Digital Workflows
No ratings yet
An Introduction To Digital Workflows
7 pages
Syllabus
No ratings yet
Syllabus
3 pages
Design of An Electronic Jacquard Sampling Loom: April 2019
No ratings yet
Design of An Electronic Jacquard Sampling Loom: April 2019
6 pages
MS Powerpoint Computer Awareness Questions Answers MCQ - IBPS - Computer Knowledge For Preparation of Competitive Exams - Mastguru
No ratings yet
MS Powerpoint Computer Awareness Questions Answers MCQ - IBPS - Computer Knowledge For Preparation of Competitive Exams - Mastguru
4 pages
Nusrat Jahan Brishty UPDATED RESUME
No ratings yet
Nusrat Jahan Brishty UPDATED RESUME
1 page

OpenAssistant Roadmap

Uploaded by

OpenAssistant Roadmap

Uploaded by

OpenAssistant

Vision & Roadmap

It can be extended and personalized easily and is

● Uses modern deep learning

● Runs on consumer hardware

Interfacing w/ A building block for ● Trains on human feedback

We want OpenAssistant to be the single, unifying platform

Minimum Viable Growing Up Growing Out How did we get here?

● RL on Human Feedback ● Rapid Personalization ● Device Control

● Assistant v1 usable ● Using External Tools ● Multi-Modality

● Out January 2023!

● We need to collect (human) demonstrations of assistant interactions

● Fine-tuning a base model on the collected data

● Additionally, collect instruction datasets

● We need to collect rankings of interactions

● Reward Model Training could also use Active Learning

○ Drastically decreases needed data

● Reinforcement Learning against the Reward Model

● Instruction Dataset Gathering → Scraping & cleaning web data

● Gamification → Leaderboards & more, to make data collection more fun

● Model Training → Experiments on pseudo- and real-data

● Infrastructure → Collection, training, and inference

● Data Collection → This is the bulk of the work

● Data Augmentation → Making more data from little data

● Privacy and Safety → Protecting sensitive data

● Read the Data Structures Documentation

● Come to our repository and grab an issue

● Join the LAION Discord and the YK Discord

You might also like