11 results found Sort:

58
537
apache-2.0
11
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
Created 2021-12-14
92 commits to main branch, last one about a year ago
Use the universal VDF format for vector datasets to easily export and import data from all vector databases
Created 2023-05-04
486 commits to main branch, last one 15 days ago
Pytorch-like dataloaders in JAX.
Created 2023-01-12
127 commits to main branch, last one 3 months ago
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
Created 2023-06-16
12 commits to main branch, last one 11 months ago
Translate large dataset to any language with google translation api and multithread processing, no key required !
Created 2023-10-27
81 commits to main branch, last one 2 months ago
Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the features ...
Created 2023-05-23
16 commits to master branch, last one 10 months ago
NLP model that predicts subreddit based on the title of a post
Created 2022-09-14
18 commits to main branch, last one about a year ago
huggingface-go : 加速下载 huggingface 的模型和数据集
Created 2023-10-21
15 commits to main branch, last one 2 months ago
🫁 AeroPath: An airway segmentation benchmark dataset with challenging pathology
Created 2023-10-03
135 commits to main branch, last one 5 days ago