Trending repositories for topic pytorch
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
SGLang is a fast serving framework for large language models and vision language models.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.
A Comprehensive Framework for Building End-to-End Recommendation Systems with State-of-the-Art Models
Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.
[IROS 2024] Representing 3D sparse map points and lines for camera relocalization
[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity
Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
[ECCV 2024] OneRestore: A Universal Restoration Framework for Composite Degradation
depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.
Fight Detection From Surveillance Cameras by fine-tuning a PyTorch Pretrained Model
PyTorch implementation of Logic Tensor Networks, a Neural-Symbolic framework.
PyTorch implementation of "From Sparse to Soft Mixtures of Experts"
PyTorch at the Edge: Deploying Over 964 TIMM Models on Android with TorchScript and Flutter.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
SGLang is a fast serving framework for large language models and vision language models.
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
GUI for a Vocal Remover that uses Deep Neural Networks.
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your d...
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.
[IROS 2024] Representing 3D sparse map points and lines for camera relocalization
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
A Comprehensive Framework for Building End-to-End Recommendation Systems with State-of-the-Art Models
Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.
Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
This project focuses on implementing CNN model based on the EEGNet architecture with Pytorch library for classifying motor imagery tasks using EEG data.
PyTorch native quantization and sparsity for training and inference
Developing a UNet3D model for accurate MRI skull stripping using the Calgary Campinas 359 dataset, enhancing neuroimaging preprocessing workflows.
PyTorch implementation of Logic Tensor Networks, a Neural-Symbolic framework.
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition
A Reverse Engineering Assistant leveraging Retrieval-Augmented Generation (RAG) and the LLaMA-3.1-8B-Instant Large Language Model (LLM). This tool is designed to revolutionize reverse engineering task...
龙良曲pytorch学习代码及一些模型的复现,包括Unet、Vision Transformer、Swim Transformer、ConvNext、YOLOv3、MAE、Diffusion model等
Attribute (or cite) statements generated by LLMs back to in-context information.
Interactive Character Control with Auto-Regressive Motion Diffusion Models
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your d...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
SGLang is a fast serving framework for large language models and vision language models.
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Visualizer for neural network, deep learning and machine learning models
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
The AdEMAMix Optimizer: Better, Faster, Older.
CTNet: A Convolutional Transformer Network for EEG-Based Motor Imagery Classification
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
Library for Jacobian descent with PyTorch. It enables optimization of neural networks with multiple losses (e.g. multi-task learning).
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor contents easier.
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.
[ECCV 2024] UMERegRobust - Universal Manifold Embedding Compatible Features for Robust Point Cloud Registration
This project focuses on implementing CNN model based on the EEGNet architecture with Pytorch library for classifying motor imagery tasks using EEG data.
Volumetric structures such as voxels and SDFs implemented in pytorch
Decomposing and Editing Predictions by Modeling Model Computation
DrugHIVE: Structure-based drug design with a deep hierarchical generative model
A Comprehensive Framework for Building End-to-End Recommendation Systems with State-of-the-Art Models
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition
Attribute (or cite) statements generated by LLMs back to in-context information.
Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
SGLang is a fast serving framework for large language models and vision language models.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
A 4-hour coding workshop to understand how LLMs are implemented and used
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A high-throughput and memory-efficient inference and serving engine for LLMs
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
GUI for a Vocal Remover that uses Deep Neural Networks.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
🎉 Modern CUDA Learn Notes with PyTorch: fp32, fp16, bf16, fp8/int8, flash_attn, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
SGLang is a fast serving framework for large language models and vision language models.
PyTorch native quantization and sparsity for training and inference
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
Stable Diffusion implemented from scratch in PyTorch
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Tools for easing the handoff between AI/ML and App/SRE teams.
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
A collection of sample programs, notebooks, and tools which highlight the power of the MAX Platform
Tutorial for Porting PyTorch Transformer Models to Candle (Rust)
[CVPR 2024 Highlight] Official repository for paper "SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction"
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.