4 results found Sort:
- Filter by Primary Language:
- HTML (1)
- Python (1)
- +
A Doctor for your data
Created
2023-05-02
33 commits to master branch, last one 2 months ago
A curated, but incomplete, list of data-centric AI resources.
Created
2023-03-07
69 commits to main branch, last one 8 months ago
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Created
2023-05-25
64 commits to main branch, last one about a year ago
A list of data-efficient and data-centric LLM (Large Language Model) papers. Our Survey Paper: Towards Efficient LLM Post Training: A Data-centric Perspective
Created
2025-02-19
4 commits to main branch, last one about a month ago