Trending repositories for topic deep-learning
Implementation of a transformer for reinforcement learning using `x-transformers`
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to...
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README....
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
500 AI Machine learning Deep learning Computer vision NLP Projects with code
An Open Source Machine Learning Framework for Everyone
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Implementation of a transformer for reinforcement learning using `x-transformers`
A framework for fine-tuning retrieval-augmented generation (RAG) systems.
This repository is an AI Bootcamp material that consist of a workflow for LLM
Golang AI applications have incredible potential. With unique features like inexplicable speed, easy debugging, concurrency, and excellent libraries for ML, deep learning, and reinforcement learning.
Build computer vision models in a fraction of the time and with less data.
Flare Guard is an AI-powered system designed to instantly detect and report fires and smoke.
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to...
This model is created using pre-trained CNN architecture (VGG16 and RESNET50) via Transfer Learning that classifies the Waste or Garbage material (class labels =7) for recycling.
Learn AI and LLMs from scratch using free resources
WhiteFox: White-Box Compiler Fuzzing Empowered by Large Language Models (OOPSLA 2024)
Build a Large Language Model (From Scratch) book and Finetuned Models
A Deep-learning Driven Predictor of Compound Synthesis Accessibility
Tutorials on computer vision with PyTorch and FiftyOne
Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."
(CVPR 2025) Adversarial Diffusion Compression for Real-World Image Super-Resolution [PyTorch]
Implementation of a transformer for reinforcement learning using `x-transformers`
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to...
Learn AI and LLMs from scratch using free resources
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README....
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
An Open Source Machine Learning Framework for Everyone
Learn AI and LLMs from scratch using free resources
Implementation of a transformer for reinforcement learning using `x-transformers`
A framework for fine-tuning retrieval-augmented generation (RAG) systems.
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Tutorials on computer vision with PyTorch and FiftyOne
CVPR 2025 DarkIR: Robust Low-Light Image Restoration - State of the art low light deblurring [Official PyTorch Implementation]
A benchmark for spaced repetition schedulers/algorithms
Build computer vision models in a fraction of the time and with less data.
Your Cheat Sheet for Machine Learning Interview – Questions and Answers.
LLM, RL, DPO, Distillation, Alignment. 由《大模型算法》作者发起(Initiated by the author of the book📘 "Large Model Algorithms")
Flare Guard is an AI-powered system designed to instantly detect and report fires and smoke.
Build a Large Language Model (From Scratch) book and Finetuned Models
Introduction to PyTorch, covering tensor initialization, operations, indexing, and reshaping.
This Deepfake Detection and Prevention project leverages advanced AI techniques to identify manipulated images with 95% accuracy.
Implementation of a transformer for reinforcement learning using `x-transformers`
LLM, RL, DPO, Distillation, Alignment. 由《大模型算法》作者发起(Initiated by the author of the book📘 "Large Model Algorithms")
Your Cheat Sheet for Machine Learning Interview – Questions and Answers.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to...
Learn AI and LLMs from scratch using free resources
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Burn is a next generation Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README....
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
500 AI Machine learning Deep learning Computer vision NLP Projects with code
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
An Open Source Machine Learning Framework for Everyone
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Learn AI and LLMs from scratch using free resources
Build computer vision models in a fraction of the time and with less data.
A framework for fine-tuning retrieval-augmented generation (RAG) systems.
Introduction to PyTorch, covering tensor initialization, operations, indexing, and reshaping.
Implementation of a transformer for reinforcement learning using `x-transformers`
A Python toolkit for debiasing neural networks in image classification tasks
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Open-source machine learning framework for Java. Designed with speed and lightweight in mind.
Your Cheat Sheet for Machine Learning Interview – Questions and Answers.
A collection of graph foundation models including papers, codes, and datasets.
Attention Kernels for Symmetric Power Transformers
A configurable engine for analysing multi-lingual and multi-modal content.
Datasets to protect Earth's forests and biodiversity
Tutorials on computer vision with PyTorch and FiftyOne
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Easiest and laziest way for building multi-agent LLMs applications.
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents. https://oasis.camel-ai.org
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
Synthetic data curation for post-training and structured data extraction
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2
AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSee...
Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
[ICLR 2025 Spotlight] Official implementation of "Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts"
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Streamlit — A faster way to build and share data apps.
An Open Source Machine Learning Framework for Everyone
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
Learn AI and LLMs from scratch using free resources
The official implementation of Self-Play Preference Optimization (SPPO)
This repository offers a collection of recent time series research papers, including forecasting, anomaly detection and so on , with links to code and resources.
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation
(TPAMI 2025) Invertible Diffusion Models for Compressed Sensing [PyTorch]
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
Deploy agents, models, RAG, pipelines and more - without learning MLOps.
Comprehensive benchmarking of protein-ligand structure prediction methods. (ICML 2024 AI4Science)
Tutorials on computer vision with PyTorch and FiftyOne
[ICLR 2025 Spotlight] Official implementation of "Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts"