Statistics for topic data
RepositoryStats tracks 584,797 Github repositories, of these 975 are tagged with the data topic. The most common primary language for repositories using this topic is Python (272). Other languages include: TypeScript (91), JavaScript (86), Jupyter Notebook (74), R (39), Go (38), Java (34), HTML (32), Rust (30), C++ (19)
Stargazers over time for topic data
Most starred repositories for topic data (view more)
Trending repositories for topic data (view more)
This is a repo with links to everything you'd ever want to learn about data engineering
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
This is a repo with links to everything you'd ever want to learn about data engineering
Ylem is an open-source platform for real-time data streaming orchestration
A project providing a Graphic Walker Pane for use with HoloViz Panel.
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
Open source project for data preparation of LLM application builders
This is a repo with links to everything you'd ever want to learn about data engineering
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.
A project providing a Graphic Walker Pane for use with HoloViz Panel.
This is a repo with links to everything you'd ever want to learn about data engineering
Ylem is an open-source platform for real-time data streaming orchestration
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
Browser-only utils for sharing/synchronizing data using "animated" QR codes
A project providing a Graphic Walker Pane for use with HoloViz Panel.
This is a repo with links to everything you'd ever want to learn about data engineering
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Ylem is an open-source platform for real-time data streaming orchestration
2025 AI/ML internship & new graduate job list updated daily
This is a repo with links to everything you'd ever want to learn about data engineering
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
This is a repo with links to everything you'd ever want to learn about data engineering
A configuration as code language with rich validation and tooling.
🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.
2025 AI/ML internship & new graduate job list updated daily
Chat with your data, modify it, visualize it, create and test machine learning models all in plain English. DataHorse makes data analysis and data science conversational using LLMs.