2 results found Sort:
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Created
2021-03-18
70 commits to main branch, last one 6 months ago
Code accompanying our papers on the "Generative Distributional Control" framework
Created
2021-03-05
12 commits to master branch, last one about a year ago