Trending repositories for topic deep-learning
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
pix2tex: Using a ViT to convert images of equations into LaTeX code.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
🔬 A curated list of awesome LLMs & deep learning strategies & tools in financial market.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Library for Jacobian descent with PyTorch. It enables optimization of neural networks with multiple losses (e.g. multi-task learning).
Code for our paper "VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters".
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
We release `LOBFrame', a novel, open-source code base which presents a renewed way to process large-scale Limit Order Book (LOB) data.
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Rank images using TrueSkill by comparing them against each other in the browser. 🖼📊
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
Scraping Wikipedia by combining LangChain's agents and tools with OpenAI's LLMs and function calling
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization...
Swap faces in images and videos. Create face embeddings. Enhance face image quality. Deploy as a web api.
Hands-on examples for motion estimation and correction in MRI
Repository containing the code for the paper "Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions". Specifically, an implementation of SAC + Robust Control Barrier Functions...
⚡️SwanLab: your ML experiment notebook. 你的AI实验笔记本,跟踪与可视化你的机器学习全流程
Dlib compiled binary (.whl) for Python 3.7-3.12 and Windows x64
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
An Open Source Machine Learning Framework for Everyone
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Code for our paper "VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters".
CTNet: A Convolutional Transformer Network for EEG-Based Motor Imagery Classification
Library for Jacobian descent with PyTorch. It enables optimization of neural networks with multiple losses (e.g. multi-task learning).
吴恩达《ChatGPT Prompt Engineering for Developers》课程中英版
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
⚡️SwanLab: your ML experiment notebook. 你的AI实验笔记本,跟踪与可视化你的机器学习全流程
Rank images using TrueSkill by comparing them against each other in the browser. 🖼📊
Implementation of a Light Recurrent Unit in Pytorch
A lightweight efficient audio codec in 30MB with 30~170x compression ratio. Supports 16kHz mono speech audio.
Versatile computational pipeline for processing protein structure data for deep learning applications.
Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate
Based on tensorrt v8.0+, deploy detect, pose, segment, tracking of YOLOv8 with C++ and python api.
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Code for our paper "VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters".
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.
Code for "Training-free Graph Neural Networks and the Power of Labels as Features" (TMLR 2024)
Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise
Rank images using TrueSkill by comparing them against each other in the browser. 🖼📊
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Tensors and Dynamic neural networks in Python with strong GPU acceleration
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
An Open Source Machine Learning Framework for Everyone
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise
CTNet: A Convolutional Transformer Network for EEG-Based Motor Imagery Classification
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Implementation of a Light Recurrent Unit in Pytorch
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
Rank images using TrueSkill by comparing them against each other in the browser. 🖼📊
Self-Supervised Scalable Deep Compressed Sensing (IJCV 2024) [PyTorch]
Library for Jacobian descent with PyTorch. It enables optimization of neural networks with multiple losses (e.g. multi-task learning).
[NeurIPS 2022] Official Code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2
A lightweight efficient audio codec in 30MB with 30~170x compression ratio. Supports 16kHz mono speech audio.
Developed TactileNet, the first deep-learning model designed for surface roughness recognition using EEG data. This project leverages CNNs to classify surface textures encountered through a robotic de...
吴恩达《ChatGPT Prompt Engineering for Developers》课程中英版
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
An Open Source Machine Learning Framework for Everyone
Streamlit — A faster way to build and share data apps.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
Web UI for AutoGen (A Framework Multi-Agent LLM Applications)
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
The official implementation of Self-Play Preference Optimization (SPPO)
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
The Programmable Cypher-based Neuro-Symbolic AGI that lets you program its behavior using Graph-based Prompt Programming: for people who want AI to behave as expected
A Jax-based library for designing and training transformer models from scratch.
Hello Data Enthusiast! I will be updating my 100-day Journey here along with detailed Code Files Starting from Essential Libraries to Advanced Machine Learning and Deep Learning Algorithm Theory with ...