Trending repositories for topic pytorch
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A high-throughput and memory-efficient inference and serving engine for LLMs
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Large Concept Models: Language modeling in a sentence representation space
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Effortless data labeling with AI support from Segment Anything and other awesome models.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Large Concept Models: Language modeling in a sentence representation space
Official repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models".
A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest
The official code for "Olympus: A Universal Task Router for Computer Vision Tasks"
🎹 Moodify - an emotion-based music recommendation system that uses AI/ML models to analyze text, speech, and facial expressions, providing personalized music recommendations across web and mobile pla...
From scratch implementation of a vision language model in pure PyTorch
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
[NeurIPS 2024] Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation
Low effort scraping Python's pickle format in Rust. It is to complete pickle parsing as BeautifulSoup was to complete HTML parsing.
This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow and PyTorch that have been optimized for Intel platforms. Scali...
TradeAI: Empowering Algorithmic Trading with Deep Learning for Cryptocurrency Data. Explore the potential of deep learning in cryptocurrency trading through our full-stack algorithmic trading system, ...
PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm
Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.
Simple and clean implementation of Conditional Variational AutoEncoder (CVAE) using PyTorch
A highly optimized LLM inference acceleration engine for Llama and its variants.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Large Concept Models: Language modeling in a sentence representation space
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Visualizer for neural network, deep learning and machine learning models
Effortless data labeling with AI support from Segment Anything and other awesome models.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
SGLang is a fast serving framework for large language models and vision language models.
Large Concept Models: Language modeling in a sentence representation space
Official repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models".
[NeurIPS 2024] Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
A highly optimized LLM inference acceleration engine for Llama and its variants.
A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest
This is a warehouse for MobileNetV4-Pytorch-model, can be used to train your image-datasets for vision tasks.
🎹 Moodify - an emotion-based music recommendation system that uses AI/ML models to analyze text, speech, and facial expressions, providing personalized music recommendations across web and mobile pla...
The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow and PyTorch that have been optimized for Intel platforms. Scali...
TradeAI: Empowering Algorithmic Trading with Deep Learning for Cryptocurrency Data. Explore the potential of deep learning in cryptocurrency trading through our full-stack algorithmic trading system, ...
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
本项目实现了一种基于 VAE-CycleGAN 的图像重建无监督缺陷检测算法。该算法结合了变分自编码器 (VAE) 和 CycleGAN 的优势,无需标注数据即可检测图像中的缺陷/异常。This project implements an unsupervised defect detection algorithm for image reconstruction based on VAE-Cy...
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
Large Concept Models: Language modeling in a sentence representation space
A highly optimized LLM inference acceleration engine for Llama and its variants.
FAPLM: A Drop-in Efficient Pytorch Implementation of Protein Language Models
The official code for "Olympus: A Universal Task Router for Computer Vision Tasks"
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Visualizer for neural network, deep learning and machine learning models
Implementation of papers in 100 lines of code.
Large Concept Models: Language modeling in a sentence representation space
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
SGLang is a fast serving framework for large language models and vision language models.
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Large Concept Models: Language modeling in a sentence representation space
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
[NeurIPS 2024] Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation
RaSPDAM - an efficient machine learning algorithm based on U-Net to detect radio single-pulse signals.
FAPLM: A Drop-in Efficient Pytorch Implementation of Protein Language Models
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
A deep reinforcement learning framework for generating formulaic alpha factors for quantitative investment, powered by GFlowNet, implemented in Python&PyTorch.
🔥[NIPS 2024, Official Code] for paper "Rethinking No-reference Image Exposure Assessment from Holism to Pixel: Models, Datasets and Benchmarks". Official Weights and Demos provided. 首个像素级曝光评估数据集、算法和b...
Prodigy and ScheduleFree, together at last.
Implementation of papers in 100 lines of code.
This repository is the official code for ResEmoteNet. The project is written in Python using PyTorch in MacBook Pro (M2 Pro 10-core CPU and 16-core GPU).
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest
SGLang is a fast serving framework for large language models and vision language models.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
A 4-hour coding workshop to understand how LLMs are implemented and used
LightMirrors is a lightweight mirror server with caching capabilities that currently supports DockerHub, K8S, PyPI, PyTorch, and NPM.
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
An open source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI artifact.
[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
Large Concept Models: Language modeling in a sentence representation space
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
SGLang is a fast serving framework for large language models and vision language models.
GUI for a Vocal Remover that uses Deep Neural Networks.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
SGLang is a fast serving framework for large language models and vision language models.
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
An open source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI artifact.
Large Concept Models: Language modeling in a sentence representation space
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
A collection of sample programs, notebooks, and tools which highlight the power of the MAX Platform
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
⚡️SwanLab: your ML experiment notebook. 你的AI实验笔记本,日志记录与可视化AI训练全流程。
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
infinifi plays gentle lofi music in the background indefinitely
ONNX-compatible Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data