Trending repositories for topic datasets
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数...
CSGHub Server is the backend server for CSGHub which helps user to manage datasets, model files, codes and more. CSGHub Server是开源大模型资产管理平台CSGHub的服务端部分的开源项目,提供基于REST API的模型和数据集等大模型资产管理功能。欢迎关注反馈和Star⭐️
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Techniques for deep learning with satellite & aerial imagery
A curated list of language modeling researches for code and related datasets.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
A list of awesome papers and resources of recommender system on large language model (LLM).
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
A list of publicly available datasets with real-time data maintained by the team at bytewax.io
Datasets for deep learning with satellite & aerial imagery
FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.
CSGHub Server is the backend server for CSGHub which helps user to manage datasets, model files, codes and more. CSGHub Server是开源大模型资产管理平台CSGHub的服务端部分的开源项目,提供基于REST API的模型和数据集等大模型资产管理功能。欢迎关注反馈和Star⭐️
Multilingual Large Language Models Evaluation Benchmark
A curated list of Place Recognition methods, datasets, and various algorithms for LiDAR
WildlifeDatasets: An open-source toolkit for animal re-identification
A curated list of language modeling researches for code and related datasets.
A list of datasets, tools, papers and code related to Deepfakes.
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数...
Final Year Malware Detection Project with PPT, Research Paper, code and Synopsis. Malware detection project by Machine Learning ALgorithms.
A curated list of peer-reviewed papers on theoretical and practical aspects of drivers' attention used for paper "Attention for Vision-Based Assistive and Automated Driving: A Review of Algorithms and...
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
A list of awesome papers and resources of recommender system on large language model (LLM).
A list of publicly available datasets with real-time data maintained by the team at bytewax.io
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
Datasets for deep learning with satellite & aerial imagery
An open source multi-tool for exploring and publishing data
Datasets & Analyses for Formula 1 World Championship
An open source multi-tool for exploring and publishing data
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数...
Label Studio is a multi-type data labeling and annotation tool with standardized output format
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
CSGHub Server is the backend server for CSGHub which helps user to manage datasets, model files, codes and more. CSGHub Server是开源大模型资产管理平台CSGHub的服务端部分的开源项目,提供基于REST API的模型和数据集等大模型资产管理功能。欢迎关注反馈和Star⭐️
A curated list of language modeling researches for code and related datasets.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Techniques for deep learning with satellite & aerial imagery
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
A list of awesome papers and resources of recommender system on large language model (LLM).
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.
CSGHub Server is the backend server for CSGHub which helps user to manage datasets, model files, codes and more. CSGHub Server是开源大模型资产管理平台CSGHub的服务端部分的开源项目,提供基于REST API的模型和数据集等大模型资产管理功能。欢迎关注反馈和Star⭐️
A comprehensive survey of datasets for research in host-based and/or network-based intrusion detection, with a focus on enterprise networks
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
A list of datasets, tools, papers and code related to Deepfakes.
A curated list of Place Recognition methods, datasets, and various algorithms for LiDAR
Multilingual Large Language Models Evaluation Benchmark
A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数...
Final Year Malware Detection Project with PPT, Research Paper, code and Synopsis. Malware detection project by Machine Learning ALgorithms.
A curated list of language modeling researches for code and related datasets.
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrument, verb, target> labels for every surgical fine-grained activi...
AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in ...
A curated list of peer-reviewed papers on theoretical and practical aspects of drivers' attention used for paper "Attention for Vision-Based Assistive and Automated Driving: A Review of Algorithms and...
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数...
Label Studio is a multi-type data labeling and annotation tool with standardized output format
CSGHub Server is the backend server for CSGHub which helps user to manage datasets, model files, codes and more. CSGHub Server是开源大模型资产管理平台CSGHub的服务端部分的开源项目,提供基于REST API的模型和数据集等大模型资产管理功能。欢迎关注反馈和Star⭐️
An open source multi-tool for exploring and publishing data
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
A curated list of language modeling researches for code and related datasets.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Techniques for deep learning with satellite & aerial imagery
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
A list of awesome papers and resources of recommender system on large language model (LLM).
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.
CSGHub Server is the backend server for CSGHub which helps user to manage datasets, model files, codes and more. CSGHub Server是开源大模型资产管理平台CSGHub的服务端部分的开源项目,提供基于REST API的模型和数据集等大模型资产管理功能。欢迎关注反馈和Star⭐️
A comprehensive survey of datasets for research in host-based and/or network-based intrusion detection, with a focus on enterprise networks
A curated list of Place Recognition methods, datasets, and various algorithms for LiDAR
Multiple datasets for ARC (Abstraction and Reasoning Corpus)
Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数...
A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.
Multilingual Large Language Models Evaluation Benchmark
A curated list of language modeling researches for code and related datasets.
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
Resources about solar power systems for data science
WildlifeDatasets: An open-source toolkit for animal re-identification
A list of datasets, tools, papers and code related to Deepfakes.
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数...
A curated list of language modeling researches for code and related datasets.
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
CSGHub Server is the backend server for CSGHub which helps user to manage datasets, model files, codes and more. CSGHub Server是开源大模型资产管理平台CSGHub的服务端部分的开源项目,提供基于REST API的模型和数据集等大模型资产管理功能。欢迎关注反馈和Star⭐️
This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"
A curated list of Place Recognition methods, datasets, and various algorithms for LiDAR
Dataset and package for working with protein-protein interactions in 3D
WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"
🎉🎨 Papers, Code, Datasets for Neuroscience and Cognition Science
A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
CSGHub is an opensource large model assets platform just like on-premise huggingface which helps to manage datasets, model files, codes and more. CSGHub是一个开源、可信的大模型资产管理平台,可帮助用户治理LLM和LLM应用生命周期中涉及到的资产(数...
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Techniques for deep learning with satellite & aerial imagery
A curated list of language modeling researches for code and related datasets.
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
An open source multi-tool for exploring and publishing data
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
A list of awesome papers and resources of recommender system on large language model (LLM).
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...
FL Chart is a highly customizable Flutter chart library that supports Line Chart, Bar Chart, Pie Chart, Scatter Chart, and Radar Chart.
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Tools for easing the handoff between AI/ML and App/SRE teams.
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...
A curated list of language modeling researches for code and related datasets.
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
Multilingual Large Language Models Evaluation Benchmark
Croissant is a high-level format for machine learning datasets that brings together four rich layers.
A curated list of Place Recognition methods, datasets, and various algorithms for LiDAR
Resources about solar power systems for data science
"Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" by Jiarui Li and Ye Yuan and Zehua Zhang
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
The code used to create and update the Open Australian Legal Corpus, the first and only multijurisdictional open corpus of Australian legislative and judicial documents.
Repository for organizing datasets and papers used in Open LLM.