OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
-
Updated
Mar 9, 2023 - Python
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
A curated list of reinforcement learning with human feedback resources (continually updated)
Implementation of Reinforcement Learning from Human Feedback (RLHF)
Code accompanying the paper Pretraining Language Models with Human Preferences
Library of Environments, Human Actor UIs and Agent implementation for Human In the Loop Learning & Reinforcement Learning
A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback using GPT-2 on AWS
Zero-Shot Reward Models with the trlx library
Create your own ChatGPT with Python
Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.
To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."