Trending repositories for topic reinforcement-learning
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Massively parallel rigidbody physics simulation on accelerator hardware.
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
A curated list of reinforcement learning with human feedback resources (continually updated)
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement lea...
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
"Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation" code repository
PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
[RSS 2024]: Expressive Whole-Body Control for Humanoid Robots
[NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback
SAPIEN Manipulation Skill Framework, a open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.
Reinforcement Learning-based exploration algorithm for a fleet of UAVs in an unknown environment.
This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refe...
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Deep reinforcement learning without experience replay, target networks, or batch updates.
UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
Autonomous UAV Navigation without Collision using Visual Information in Airsim
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab
A goal-driven autonomous exploration through deep reinforcement learning (ICRA 2022) system that combines reactive and planned robot navigation in unknown environments
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Implementation of papers in 100 lines of code.
A curated list of reinforcement learning with human feedback resources (continually updated)
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement lea...
A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Deep reinforcement learning without experience replay, target networks, or batch updates.
[RSS 2024]: Expressive Whole-Body Control for Humanoid Robots
[ICLR 2022] Accelerated Policy Learning with Parallel Differentiable Simulation
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Reinforcement Learning-based exploration algorithm for a fleet of UAVs in an unknown environment.
The Flatland Framework is a multi-purpose environment to tackle problems around resilient resource allocation under uncertainty. It is designed to be a flexible and method agnostic to solve a wide ran...
This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refe...
UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers
The Julia implementation of the generalised hierarchical Gaussian filter
Reinforcement Learning with Model Predictive Control
The Source code for paper "Optimal Energy System Scheduling Combining Mixed-Integer Programming and Deep Reinforcement Learning". Safe reinforcement learning, energy management
PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm
Implementation of papers in 100 lines of code.
A driving simulator built specifically for reinforcement learning/imitation learning. The scenario generator is based on highway-env, and 3-D animation is based on Carla
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Implementation of papers in 100 lines of code.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Deep reinforcement learning without experience replay, target networks, or batch updates.
Simplifying reinforcement learning for complex game environments
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement lea...
Deep reinforcement learning without experience replay, target networks, or batch updates.
A deep reinforcement learning framework for generating formulaic alpha factors for quantitative investment, powered by GFlowNet, implemented in Python&PyTorch.
Implementation of papers in 100 lines of code.
A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Multi-Agent Deep Reinforcement Learning (MA-DRL) Routing Simulator for satellite networks
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for be...
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various rewa...
Awesome LLM Papers and repos on very comprehensive topics.
Deep reinforcement learning without experience replay, target networks, or batch updates.
This Python-based simulation platform can realistically model various components of the UAV network, including the network layer, MAC layer and physical layer, as well as the UAV mobility model, energ...
Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"
Official Implementation of the ICLR 2024 spotlight paper: Universal Humanoid Motion Representations for Physics-Based Control
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement lea...
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"
[CVPR 2024] Official code for EgoGen: An Egocentric Synthetic Data Generator
UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers
Simplifying reinforcement learning for complex game environments
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state se...
[NeurIPS 2024] ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution
[RSS 2024]: Expressive Whole-Body Control for Humanoid Robots
A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest
Implementation of Soft Actor Critic and some of its improvements in Pytorch
Top paper collection for stock price prediction, quantitative trading. Covering top conferences and journals like KDD, WWW, CIKM, AAAI, IJCAI, ACL, EMNLP.
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
Tactics2D: A Reinforcement Learning Environment Library with Generative Scenarios for Driving Decision-making
🚗 This repository offers a ready-to-use training and evaluation environment for conducting various experiments using Deep Reinforcement Learning (DRL) in the CARLA simulator with the help of Stable B...
Framework for integrating ROS and Gazebo with gymnasium, streamlining the development and training of RL algorithms in realistic robot simulations.