Trending repositories for topic deep-learning
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Implementation of papers in 100 lines of code.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Visualizer for neural network, deep learning and machine learning models
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
The world's 1st free and open source palm recognition SDK for Windows and Linux (Palm detection, ROI extraction, Template extraction, Template mathcing)
Official PyTorch implementation of the WACV 2025 paper "Composed Image Retrieval for Training-FREE DOMain Conversion".
Implementation of papers in 100 lines of code.
VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for correctness (using posteriori model)
ML & DL roadmap with curated resources like videos, articles, research-papers, competitions, projects etc.
A curated list of awesome leaderboard-oriented resources for foundation models
A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks
[CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
[Neurips 2024] A benchmark suite for autoregressive neural emulation of PDEs. (≥46 PDEs in 1D, 2D, 3D; Differentiable Physics; Unrolled Training; Rollout Metrics)
A curated list of datasets, codes, and papers related to scene change detection.
[ECCV2024] "Raindrop Clarity: A Dual-Focused Dataset for Day and Night Raindrop Removal", https://arxiv.org/abs/2407.16957
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
DeepMol: A Machine and Deep Learning Framework for Computational Chemistry
Deep reinforcement learning without experience replay, target networks, or batch updates.
Prodigy and ScheduleFree, together at last.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Implementation of papers in 100 lines of code.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Tensors and Dynamic neural networks in Python with strong GPU acceleration
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Visualizer for neural network, deep learning and machine learning models
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
An Open Source Machine Learning Framework for Everyone
The world's 1st free and open source palm recognition SDK for Windows and Linux (Palm detection, ROI extraction, Template extraction, Template mathcing)
Implementation of papers in 100 lines of code.
A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks
Official implement for "PGN: The RNN’s New Successor is Effective for Long-Range Time Series Forecasting"(NeurIPS 2024) in PyTorch.
Official PyTorch implementation of the WACV 2025 paper "Composed Image Retrieval for Training-FREE DOMain Conversion".
A curated list of awesome leaderboard-oriented resources for foundation models
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).
Deep reinforcement learning without experience replay, target networks, or batch updates.
VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for correctness (using posteriori model)
[NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)
Prodigy and ScheduleFree, together at last.
[CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning
LHU-Net: A Light Hybrid U-Net for Cost-efficient, High-performance Volumetric Medical Image Segmentation
🧼🔎 A holistic self-supervised data cleaning strategy to detect irrelevant samples, near duplicates and label errors (NeurIPS'24).
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents. https://oasis.camel-ai.org
Deep reinforcement learning without experience replay, target networks, or batch updates.
The official implementation of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".
The world's 1st free and open source palm recognition SDK for Windows and Linux (Palm detection, ROI extraction, Template extraction, Template mathcing)
ML & DL roadmap with curated resources like videos, articles, research-papers, competitions, projects etc.
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Visualizer for neural network, deep learning and machine learning models
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
simple but efficient kernel regression and anomaly detection algorithms
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
[NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website...
Prodigy and ScheduleFree, together at last.
[NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)
[Neurips 2024] A benchmark suite for autoregressive neural emulation of PDEs. (≥46 PDEs in 1D, 2D, 3D; Differentiable Physics; Unrolled Training; Rollout Metrics)
ML models + benchmark for tabular data classification and regression
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
ML & DL roadmap with curated resources like videos, articles, research-papers, competitions, projects etc.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Implementation of Alphafold 3 from Google Deepmind in Pytorch
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
An Open Source Machine Learning Framework for Everyone
Streamlit — A faster way to build and share data apps.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
500 AI Machine learning Deep learning Computer vision NLP Projects with code
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
SDG is a specialized framework designed to generate high-quality structured tabular data.
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
Face recognition SDK Android with 3D passive liveness detection (Face Detection, Face Landmarks, Face Recognition, Face Liveness, Face Pose, Face Expression, Face attributes)
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
The official implementation of Self-Play Preference Optimization (SPPO)
A curated list of data science & AI guided projects to start building your portfolio
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
ONNX-compatible Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
A JavaScript library like PyTorch, with GPU acceleration.
Foundational model for human-like, expressive TTS