2 results found Sort:

59
555
mit
10
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Created 2021-03-18
70 commits to main branch, last one 10 months ago
20
118
other
9
Code accompanying our papers on the "Generative Distributional Control" framework
Created 2021-03-05
12 commits to master branch, last one 2 years ago