Trending repositories for topic deep-learning
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
500 AI Machine learning Deep learning Computer vision NLP Projects with code
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Effortless data labeling with AI support from Segment Anything and other awesome models.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
The world's 1st free and open source palm recognition SDK for Windows and Linux (Palm detection, ROI extraction, Template extraction, Template mathcing)
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
Contains Solutions to Deep Learning Specailization - Coursera
Implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch
Easy and Clear Pipeline With LLM-Automation to Preprocess Medical Image for Everybody
Colab Notebooks covering deep learning tools for biomolecular structure prediction and design
Official implement for "PGN: The RNN’s New Successor is Effective for Long-Range Time Series Forecasting"(NeurIPS 2024) in PyTorch.
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
A summary of open-source deep learning-based infrared and visible image fusion and some vision algorithms. 红外与可见光图像融合的开源代码
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website...
算法工程师、机器学习工程师、软件工程师、数据科学家-实践与面试指南 | Interview guide for MLE, SDE, DS
🎇👌Ezy-Parking is a complete parking management system that applies smart solutions for short time rental of empty spaces.
Realtime Sign Language Detection: Deep learning model for accurate, real-time recognition of sign language gestures using Python and TensorFlow.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Contains Solutions and Notes for the Machine Learning Specialization By Stanford University and Deeplearning.ai - Coursera (2022) by Prof. Andrew NG
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch
Easy and Clear Pipeline With LLM-Automation to Preprocess Medical Image for Everybody
Colab Notebooks covering deep learning tools for biomolecular structure prediction and design
Contains Solutions to Deep Learning Specailization - Coursera
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
The world's 1st free and open source palm recognition SDK for Windows and Linux (Palm detection, ROI extraction, Template extraction, Template mathcing)
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
Official implement for "PGN: The RNN’s New Successor is Effective for Long-Range Time Series Forecasting"(NeurIPS 2024) in PyTorch.
Become skilled in Artificial Intelligence, Machine Learning, Generative AI, Deep Learning, Data Science, Natural Language Processing, Reinforcement Learning and more with this complete 0 to 100 reposi...
train and use graph-based ML models of potential energy surfaces
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
Food calorie estimations Using Deep Learning And Computer Vision
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
The world's 1st free and open source palm recognition SDK for Windows and Linux (Palm detection, ROI extraction, Template extraction, Template mathcing)
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
Implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch
A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer learning, and more. Build your own customizable LLM alignment solut...
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Visualizer for neural network, deep learning and machine learning models
Implementation of papers in 100 lines of code.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
500 AI Machine learning Deep learning Computer vision NLP Projects with code
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Deep reinforcement learning without experience replay, target networks, or batch updates.
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
Material for lectures on Diffusion models at IE university
train and use graph-based ML models of potential energy surfaces
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
Implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
[NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".
simple but efficient kernel regression and anomaly detection algorithms
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents. https://oasis.camel-ai.org
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
[Neurips 2024] A benchmark suite for autoregressive neural emulation of PDEs. (≥46 PDEs in 1D, 2D, 3D; Differentiable Physics; Unrolled Training; Rollout Metrics)
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
Implementation of Alphafold 3 from Google Deepmind in Pytorch
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
Easiest and laziest way for building multi-agent LLMs applications.
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
An Open Source Machine Learning Framework for Everyone
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Streamlit — A faster way to build and share data apps.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
SDG is a specialized framework designed to generate high-quality structured tabular data.
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
Face recognition SDK Android with 3D passive liveness detection (Face Detection, Face Landmarks, Face Recognition, Face Liveness, Face Pose, Face Expression, Face attributes)
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
The official implementation of Self-Play Preference Optimization (SPPO)
A curated list of data science & AI guided projects to start building your portfolio
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
⚡️SwanLab: your ML experiment notebook. 你的AI实验笔记本,日志记录与可视化AI训练全流程。
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
ONNX-compatible Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Foundational model for human-like, expressive TTS