Search Results - RepositoryStats

1 result found Sort:

119

1.4k

apache-2.0

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Created 2023-05-15

111 commits to main branch, last one 9 months ago