Trending repositories for topic datasets
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Techniques for deep learning with satellite & aerial imagery
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
A list of awesome papers and resources of recommender system on large language model (LLM).
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inference j...
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
OSINT cheat sheet, list OSINT tools, wiki, dataset, article, book and OSINT tips
[TMLR] A curated list of language modeling researches for code and related datasets.
Securely share and store AI/ML projects as OCI artifacts in your container registry.
Therapeutics Commons (TDC-2): Multimodal Foundation for Therapeutic Science
A curated list of amazingly awesome Cybersecurity datasets
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
A list of awesome papers and resources of recommender system on large language model (LLM).
Securely share and store AI/ML projects as OCI artifacts in your container registry.
OSINT cheat sheet, list OSINT tools, wiki, dataset, article, book and OSINT tips
You can find links to data acquisition websites.
Awesome Chinese LLM: A curated list of Chinese Large Language Model 中文大语言模型数据集和模型资料汇总
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
This reposiotry is the collection for public 3D LiDAR datasets
Download and preprocess popular sequential recommendation datasets
Croissant is a high-level format for machine learning datasets that brings together four rich layers.
Datasets for deep learning with satellite & aerial imagery
[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"
Therapeutics Commons (TDC-2): Multimodal Foundation for Therapeutic Science
🌳 A curated list of ground-truth forest datasets for the machine learning and forestry community.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
A list of awesome papers and resources of recommender system on large language model (LLM).
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inference j...
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
Techniques for deep learning with satellite & aerial imagery
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
[TMLR] A curated list of language modeling researches for code and related datasets.
Securely share and store AI/ML projects as OCI artifacts in your container registry.
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
[ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
Securely share and store AI/ML projects as OCI artifacts in your container registry.
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
This reposiotry is the collection for public 3D LiDAR datasets
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
A list of awesome papers and resources of recommender system on large language model (LLM).
This repository is a collection of existing KGQA datasets in the form of the 🤗 huggingface datasets -> https://github.com/huggingface/datasets library, aiming to provide easy-to-use access to them.
A collection of some awesome public object detection and recognition datasets.
Download and preprocess popular sequential recommendation datasets
WildlifeDatasets: An open-source toolkit for animal re-identification
[TMLR] A curated list of language modeling researches for code and related datasets.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
[TMLR] A curated list of language modeling researches for code and related datasets.
Techniques for deep learning with satellite & aerial imagery
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inference j...
Securely share and store AI/ML projects as OCI artifacts in your container registry.
A list of awesome papers and resources of recommender system on large language model (LLM).
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
A large collection of system log datasets for AI-driven log analytics [ISSRE'23]
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.
Securely share and store AI/ML projects as OCI artifacts in your container registry.
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and Benchmarks)
[ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback
[TMLR] A curated list of language modeling researches for code and related datasets.
A list of awesome papers and resources of recommender system on large language model (LLM).
A collection of some awesome public object detection and recognition datasets.
A curated list of datasets, publically available for machine learning research in the area of manufacturing
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
OSINT cheat sheet, list OSINT tools, wiki, dataset, article, book and OSINT tips
Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"
This reposiotry is the collection for public 3D LiDAR datasets
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inference j...
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and...
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and Application Spaces.
Securely share and store AI/ML projects as OCI artifacts in your container registry.
This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"
A curated list of Place Recognition methods, datasets, and various algorithms for LiDAR
WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...
Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
A repository of datasets paired with rich documentation, data essays, and teaching resources
🎉🎨 Papers, Code, Datasets for Neuroscience and Cognition Science
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Label Studio is a multi-type data labeling and annotation tool with standardized output format
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inference j...
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Techniques for deep learning with satellite & aerial imagery
[TMLR] A curated list of language modeling researches for code and related datasets.
An open source multi-tool for exploring and publishing data
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and...
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
A list of awesome papers and resources of recommender system on large language model (LLM).
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
Securely share and store AI/ML projects as OCI artifacts in your container registry.
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
A curated list of Place Recognition methods, datasets, and various algorithms for LiDAR
🦄 Unitxt: a python library for getting data fired up and set for training and evaluation
[TMLR] A curated list of language modeling researches for code and related datasets.
WildlifeDatasets: An open-source toolkit for animal re-identification
Croissant is a high-level format for machine learning datasets that brings together four rich layers.
📊 Adana - 1-click analytical dashboard for OSINT researchers
Multilingual Large Language Models Evaluation Benchmark
"Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" by Jiarui Li and Ye Yuan and Zehua Zhang
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
Resources about solar power systems for data science