Statistics for topic datasets
RepositoryStats tracks 584,797 Github repositories, of these 356 are tagged with the datasets topic. The most common primary language for repositories using this topic is Python (150). Other languages include: Jupyter Notebook (39)
Stargazers over time for topic datasets
Most starred repositories for topic datasets (view more)
Trending repositories for topic datasets (view more)
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Techniques for deep learning with satellite & aerial imagery
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
Securely share and store AI/ML projects as OCI artifacts in your container registry.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
[ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
Securely share and store AI/ML projects as OCI artifacts in your container registry.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Securely share and store AI/ML projects as OCI artifacts in your container registry.
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
[ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inference j...
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and...
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inference j...
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Securely share and store AI/ML projects as OCI artifacts in your container registry.
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.