5 results found Sort:
- Filter by Primary Language:
- Python (3)
- C++ (1)
- Go (1)
- +
GUNDAM is a data management system that prioritizes data using language models.
Created
2023-06-05
54 commits to main branch, last one 11 months ago
Graphical tool for data manipulation written in C++/Qt.
Created
2019-01-19
545 commits to master branch, last one 10 months ago
DSIR large-scale data selection framework for language model training
Created
2023-01-30
67 commits to main branch, last one 4 months ago
⏳ Provide filtering, sanitizing, and conversion of Golang data. 提供对Golang数据的过滤,净化,转换。
Created
2018-09-26
158 commits to master branch, last one 3 months ago
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
Created
2024-02-27
20 commits to main branch, last one 29 days ago