4 results found Sort:

223
2.9k
other
131
A Doctor for your data
Created 2023-05-02
33 commits to master branch, last one 2 months ago
A curated, but incomplete, list of data-centric AI resources.
Created 2023-03-07
69 commits to main branch, last one 8 months ago
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Created 2023-05-25
64 commits to main branch, last one about a year ago
A list of data-efficient and data-centric LLM (Large Language Model) papers. Our Survey Paper: Towards Efficient LLM Post Training: A Data-centric Perspective
Created 2025-02-19
4 commits to main branch, last one about a month ago