4 results found Sort:
[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
Created
2024-10-09
17 commits to main branch, last one about a month ago
Datasets collection and preprocessings framework for NLP extreme multitask learning
nlp
glue
scaling
bigbench
crossfit
benchmark
extreme-mtl
huggingface
meta-learning
discriminative
preprocessings
reward-modeling
curated-datasets
dataset-collection
instruction-tuning
multi-task-learning
text-classification
natural-language-inference
extreme-multi-task-learning
multi-task-learning-scaling
Created
2022-12-06
220 commits to main branch, last one 2 months ago
Efficient LLM inference on Slurm clusters using vLLM.
Created
2024-03-06
474 commits to main branch, last one 15 days ago
official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives
Created
2024-09-18
45 commits to main branch, last one 2 days ago