Search Results - RepositoryStats

678

7.8k

mit

140

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

transformers deep-learning human-feedback attention-mechanisms reinforcement-learning artificial-intelligence

Created 2022-12-09

148 commits to main branch, last one 3 days ago

awesome-RLHF opendilab

231

3.7k

apache-2.0

61

A curated list of reinforcement learning with human feedback resources (continually updated)

rlhf deep-learning human-feedback large-language-models reinforcement-learning deep-reinforcement-learning

Created 2023-02-13

73 commits to main branch, last one 22 days ago

LaMDA-rlhf-pytorch conceptofmind

74

471

mit

22

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

transformers deep-learning human-feedback machine-learning attention-mechanism reinforcement-learning artificial-intelligence

This repository has been archived (exclude archived)

Created 2022-06-21

88 commits to main branch, last one 12 months ago

data-is-better-together huggingface

28

252

unknown

7

Let's build better datasets, together!

datasets community human-feedback machine-learning

Created 2024-03-11

139 commits to main branch, last one 2 months ago

d3po yk7333

19

190

mit

7

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

human-feedback diffusion-models reinforcement-learning

Created 2023-11-23

37 commits to main branch, last one 10 months ago

ParroT wxjiao

23

172

unknown

3

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

lora gpt-4 llama bloomz chatgpt contrastive error-guided human-feedback instruction-tuning machine-translation

Created 2023-03-22

179 commits to master branch, last one about a month ago

instructGOOSE xrsrke

21

171

mit

4

Implementation of Reinforcement Learning from Human Feedback (RLHF)

rlhf chatgpt instructgpt human-feedback reinforcement-learning

Created 2022-12-28

76 commits to main branch, last one about a year ago

trubrics-python trubrics

26

139

unknown

4

Product analytics for AI Assistants

llm mlops llmops streamlit ml-monitoring human-feedback model-feedback machine-learning

Created 2022-01-19

1,046 commits to main branch, last one a day ago

beavertails PKU-Alignment

5

122

apache-2.0

6

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

gpt llm llms rlhf llama beaver safety datasets ai-safety safe-rlhf human-feedback language-model human-feedback-data large-language-model

Created 2023-06-14

3 commits to main branch, last one about a year ago

prism-alignment HannahKirk

3

66

unknown

3

The Prism Alignment Project

dataset alignment multicultural human-feedback sociotechnical human-feedback-data

Created 2024-03-06

12 commits to main branch, last one 9 months ago

dataset-viber davidberenstein1957

12

44

apache-2.0

1

Dataset Viber is your chill repo for data collection, annotation and vibe checks.

evaluation data-quality human-feedback data-collection

Created 2024-08-07

181 commits to main branch, last one 5 months ago

Reliable_AD ZhenbangDu

0

41

unknown

3

[ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback

rlhf datasets eccv2024 diffusers diffusion advertising human-feedback diffusion-models image-generation

Created 2024-07-04

42 commits to main branch, last one 3 months ago

prelude gao-g

0

36

mit

2

Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".

llm gpt4 llms edits alignment transformers user-feedback human-feedback interpretability preference-learning

Created 2024-04-25

15 commits to main branch, last one 2 months ago

tdpo ZiyiZhang27

0

34

mit

1

[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"

rlhf alignment text-to-image human-feedback diffusion-models stable-diffusion reinforcement-learning

Created 2024-05-19

6 commits to main branch, last one 7 months ago