Statistics for topic reinforcement-learning
RepositoryStats tracks 584,796 Github repositories, of these 1,313 are tagged with the reinforcement-learning topic. The most common primary language for repositories using this topic is Python (890). Other languages include: Jupyter Notebook (176), C++ (47), C# (15), HTML (11)
Stargazers over time for topic reinforcement-learning
Most starred repositories for topic reinforcement-learning (view more)
Trending repositories for topic reinforcement-learning (view more)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
An all-weather, day-and-night, collision avoidance simulator that can be implemented as a digital twin for the autonomous COLREG-compliant navigation of maritime vessels.
Repository containing the code for the paper "Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions". Specifically, an implementation of SAC + Robust Control Barrier Functions...
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
HFTFramework utilized for research on " A reinforcement learning approach to improve the performance of the Avellaneda-Stoikov market-making algorithm "
Official Implementation for the paper "R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World Models"
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
An extensive library of AI resources including books, courses, papers, guides, articles, tutorials, notebooks, AI field advancements and more.
Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"