Trending repositories for topic pytorch
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
SGLang is a fast serving framework for large language models and vision language models.
GUI for a Vocal Remover that uses Deep Neural Networks.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
This is a warehouse for MobileNetV4-Pytorch-model, can be used to train your image-datasets for vision tasks.
PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture incor...
[ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes
:fire: [CVPR 2024] Color Shift Estimation-and-Correction for Image Enhancement
🎹 Moodify - an emotion-based music recommendation system that uses AI/ML models to analyze text, speech, and facial expressions, providing personalized music recommendations across web and mobile pla...
Model Predictive Path Integral (MPPI) with approximate dynamics implemented in pytorch
adam implements a collection of algorithms for calculating rigid-body dynamics in Jax, CasADi, PyTorch, and Numpy.
在图像获取和传输过程中,往往伴随着各种形式的损坏,降低了图像质量和对图像信息的准确解释,一些老照片因为保存不当也会变得存在污渍或者破损缺失。图像修复技术主要用来修复日常生活中被噪声污染或者人为破坏的破损图像,也可应用于替换图像中的小区域或者瑕疵。目前,图像修复工作仍然由经验丰富的图像修复师来完成,让图像修复借助深度学习算法实现自动化日趋成为该领域的发展方向。本课题基于深度学习算法和图像处理技术,设...
A gravitational lensing simulator for the machine learning era.
Semi-Supervised Domain Adaptation with Source Label Adaptation, accepted to CVPR 2023
Implementation of Complex Valued Neural Networks in Pytorch 🧠
[ECCV 2024] UMERegRobust - Universal Manifold Embedding Compatible Features for Robust Point Cloud Registration
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
SGLang is a fast serving framework for large language models and vision language models.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
GUI for a Vocal Remover that uses Deep Neural Networks.
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
pix2tex: Using a ViT to convert images of equations into LaTeX code.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture incor...
an inference lib for image/video restoration with VapourSynth support
:fire: [CVPR 2024] Color Shift Estimation-and-Correction for Image Enhancement
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
🎹 Moodify - an emotion-based music recommendation system that uses AI/ML models to analyze text, speech, and facial expressions, providing personalized music recommendations across web and mobile pla...
YOLOv8-3D is a LowCode, Simple 2D and 3D Bounding Box Object Detection and Tracking , Python 3.10
This is a warehouse for MobileNetV4-Pytorch-model, can be used to train your image-datasets for vision tasks.
Deep learning training framework for image super resolution and restoration.
在图像获取和传输过程中,往往伴随着各种形式的损坏,降低了图像质量和对图像信息的准确解释,一些老照片因为保存不当也会变得存在污渍或者破损缺失。图像修复技术主要用来修复日常生活中被噪声污染或者人为破坏的破损图像,也可应用于替换图像中的小区域或者瑕疵。目前,图像修复工作仍然由经验丰富的图像修复师来完成,让图像修复借助深度学习算法实现自动化日趋成为该领域的发展方向。本课题基于深度学习算法和图像处理技术,设...
The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Semi-Supervised Domain Adaptation with Source Label Adaptation, accepted to CVPR 2023
Repository containing the code for the paper "Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions". Specifically, an implementation of SAC + Robust Control Barrier Functions...
无人机动态覆盖控制;1. 实现了一个无人机点覆盖环境;2. 给出了无人机连通保持规则;3. 给出了基于MARL的控制算法
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
A utility to inspect, validate, sign and verify machine learning model files.
Efficient CUDA kernels for training convolutional neural networks with PyTorch.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
SGLang is a fast serving framework for large language models and vision language models.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
deep learning for image processing including classification and object-detection etc.
GUI for a Vocal Remover that uses Deep Neural Networks.
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture incor...
The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
ML Nexus is an open-source collection of machine learning projects, covering topics like neural networks, computer vision, and NLP. Whether you're a beginner or expert, contribute, collaborate, and gr...
:fire: [CVPR 2024] Color Shift Estimation-and-Correction for Image Enhancement
Best practices & guides on how to write distributed pytorch training code
IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition
[RAL 2024] D2S: Representing sparse descriptors and 3D coordinates for camera relocalization
[IROS 2024] Representing 3D sparse map points and lines for camera relocalization
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
[ECCV 2024] UMERegRobust - Universal Manifold Embedding Compatible Features for Robust Point Cloud Registration
Securely share and store AI/ML projects as OCI artifacts in your container registry.
SGLang is a fast serving framework for large language models and vision language models.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
A 4-hour coding workshop to understand how LLMs are implemented and used
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
LightMirrors is a lightweight mirror server with caching capabilities that currently supports DockerHub, K8S, PyPI, PyTorch, and NPM.
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
GUI for a Vocal Remover that uses Deep Neural Networks.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
SGLang is a fast serving framework for large language models and vision language models.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
📚Modern CUDA Learn Notes with PyTorch: Tensor/CUDA Cores, 📖150+ CUDA Kernels with PyTorch bindings, 📖HGEMM/SGEMM (95%~99% cuBLAS performance), 📖100+ LLM/CUDA Blogs.
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
SGLang is a fast serving framework for large language models and vision language models.
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
Securely share and store AI/ML projects as OCI artifacts in your container registry.
LLM (Large Language Model) FineTuning
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
PyTorch native quantization and sparsity for training and inference
A collection of sample programs, notebooks, and tools which highlight the power of the MAX Platform
[CVPR 2024 Highlight] Official repository for paper "SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction"
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.