2 results found Sort:

60
543
mit
11
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Created 2021-03-18
70 commits to main branch, last one 6 months ago
21
117
other
10
Code accompanying our papers on the "Generative Distributional Control" framework
Created 2021-03-05
12 commits to master branch, last one about a year ago