rlhf

Here are 10 public repositories matching this topic...

LAION-AI / Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

python machine-learning ai nextjs discord-bot assistant language-model chatgpt rlhf

Updated Mar 9, 2023
Python

voidful / TextRL

Star

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

nlp reinforcement-learning pytorch nlg language-model gpt-2 gpt-3 controlled-nlg chatgpt rlhf

Updated Mar 5, 2023
Python

opendilab / awesome-RLHF

Star

A curated list of reinforcement learning with human feedback resources (continually updated)

reinforcement-learning deep-learning deep-reinforcement-learning human-feedback rlhf

Updated Feb 28, 2023

xrsrke / instructGOOSE

Star

Implementation of Reinforcement Learning from Human Feedback (RLHF)

reinforcement-learning chatgpt human-feedback rlhf instructgpt

Updated Feb 12, 2023
Jupyter Notebook

tomekkorbak / pretraining-with-human-feedback

Star

Code accompanying the paper Pretraining Language Models with Human Preferences

reinforcement-learning gpt language-models ai-safety ai-alignment pretraining decision-transformers rlhf

Updated Mar 1, 2023
Python

csmile-1006 / PreferenceTransformer

Star

Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)

robotics rl rlhf

Updated Mar 8, 2023
Python

cogment / cogment-verse

Star

Library of Environments, Human Actor UIs and Agent implementation for Human In the Loop Learning & Reinforcement Learning

reinforcement-learning human-in-the-loop-learning cogment rlhf

Updated Mar 9, 2023
Python

arunprsh / ChatGPT-Decoded-GPT2-FAQ-Bot-RLHF-PPO

Star

A Practical Guide to Developing a Reliable FAQ Chatbot with Reinforcement Learning and Human Feedback using GPT-2 on AWS

aws reinforcement-learning chatbot transformers question-answering sagemaker gpt-2 gpt2 rlhf

Updated Feb 11, 2023
Jupyter Notebook

vicgalle / zero-shot-reward-models

Sponsor

Star

Zero-Shot Reward Models with the trlx library

reinforcement-learning zero-shot llm rlhf reward-models trlx

Updated Mar 3, 2023
Python

AmirMotefaker / Create-your-own-ChatGPT

Star

Create your own ChatGPT with Python

python machine-learning ai ml artificial-intelligence llm chatgpt chatgpt-api chatgpt3 rlhf large-language-model

Updated Feb 26, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."

Learn more

Feb	MAR	Apr
	09
2022	2023	2024