Statistics for topic data
RepositoryStats tracks 623,870 Github repositories, of these 1,046 are tagged with the data topic. The most common primary language for repositories using this topic is Python (299). Other languages include: TypeScript (96), JavaScript (88), Jupyter Notebook (79), R (44), Go (39), Java (36), HTML (35), Rust (31), C# (20)
Stargazers over time for topic data
Most starred repositories for topic data (view more)
Trending repositories for topic data (view more)
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
🐵 Preswald is a framework for building and deploying interactive data apps, internal tools, and dashboards with Python. With one command, you can launch, share, and deploy locally or in the cloud, tu...
Extract, Transform, Index Data. CocoIndex is the world's first open-source engine that supports both custom transformation logic and incremental updates specialized for data indexing.
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
A list of public EMG datasets and their papers, with a focus on raw EMG signals.
Extract, Transform, Index Data. CocoIndex is the world's first open-source engine that supports both custom transformation logic and incremental updates specialized for data indexing.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
How to disable Firefox Telemetry and Data Collection
🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.
Xpert AI is an AI agents and data analysis platform for enterprises to make business decisions.
Extract, Transform, Index Data. CocoIndex is the world's first open-source engine that supports both custom transformation logic and incremental updates specialized for data indexing.
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:
🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!
🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.
A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬
Machine Learning Roadmap for 2025. Step-by-step guide to become a Data Scientist. Covers the best free learning resources from Python basics to Deep Learning and MLOps.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
This is a repo with links to everything you'd ever want to learn about data engineering
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.
2025 AI/ML internship & new graduate job list updated daily
RobustMQ is a next-generation, high-performance, cloud-native, converged message queue that is compatible with multiple mainstream message queuing protocols and has complete Serveless capabilities.