13 results found Sort:

58
550
apache-2.0
12
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
Created 2021-12-14
92 commits to main branch, last one 2 years ago
Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, backup, re-embed (using any model) or access your vector data fro...
Created 2023-05-04
498 commits to main branch, last one about a month ago
Pytorch-like dataloaders in JAX.
Created 2023-01-12
143 commits to main branch, last one 2 months ago
Translate large dataset to any language with google translation api and multithreads processing, no key required!
Created 2023-10-27
134 commits to main branch, last one 2 months ago
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine
Created 2024-05-08
19 commits to master branch, last one 7 months ago
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
Created 2023-06-16
12 commits to main branch, last one about a year ago
Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the features ...
Created 2023-05-23
16 commits to master branch, last one about a year ago
huggingface-go : 加速下载 huggingface 的模型和数据集
Created 2023-10-21
15 commits to main branch, last one 7 months ago
:hugs: AeroPath: An airway segmentation benchmark dataset with challenging pathology
Created 2023-10-03
135 commits to main branch, last one 6 months ago
NLP model that predicts subreddit based on the title of a post
Created 2022-09-14
18 commits to main branch, last one about a year ago
0
25
apache-2.0
4
A collection of Italian benchmarks for LLM evaluation
Created 2024-09-19
34 commits to main branch, last one 7 days ago