Trending repositories for topic reinforcement-learning

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

3,244 (+93)

apache-2.0

labmlai/annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...

57,297 (+79)

mit

ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

34,556 (+49)

apache-2.0

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,726 (+40)

apache-2.0

d2l-ai/d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

24,265 (+36)

Farama-Foundation/Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

7,662 (+35)

mit

XinJingHao/DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

1,662 (+27)

MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

4,137 (+26)

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

9,362 (+26)

mit

Developer-Y/cs-video-courses

List of Computer Science courses with video lectures.

67,572 (+24)

vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

5,928 (+19)

AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

14,438 (+19)

mit

google/brax

Massively parallel rigidbody physics simulation on accelerator hardware.

2,423 (+18)

apache-2.0

datawhalechina/easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

9,764 (+17)

bulletphysics/bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

12,805 (+15)

opendilab/awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,567 (+14)

apache-2.0

TJU-DRL-LAB/AI-Optimizer

The next generation deep reinforcement learning tookit

4,865 (+14)

Unity-Technologies/ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement lea...

17,357 (+13)

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,077 (+13)

pytorch/rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

2,427 (+10)

mit

Last 3 days (relative gain)

mikelma/craftium

A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest

50 (+9%)

ml-jku/LRAM

A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

27 (+4%)

mit

sequential-dexterity/SeqDex

"Sequential Dexterity: Chaining Dexterous Policies for Long-Horizon Manipulation" code repository

140 (+4%)

apache-2.0

ammarhydr/SAC-Lagrangian

PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm

33 (+3%)

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

3,244 (+3%)

apache-2.0

chengxuxin/expressive-humanoid

[RSS 2024]: Expressive Whole-Body Control for Humanoid Robots

225 (+3%)

zjunlp/TRICE

[NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback

41 (+3%)

mit

haosulab/ManiSkill

SAPIEN Manipulation Skill Framework, a open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.

1,010 (+2%)

apache-2.0

gbattocletti-riccardoUrb/rl-exploration-for-uavs

Reinforcement Learning-based exploration algorithm for a fleet of UAVs in an unknown environment.

52 (+2%)

SMARTlab-Purdue/SAN-NaviSTAR

This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refe...

55 (+2%)

mit

dunnolab/xland-minigrid-datasets

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

59 (+2%)

apache-2.0

XinJingHao/DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

1,662 (+2%)

mohmdelsayed/streaming-drl

Deep reinforcement learning without experience replay, target networks, or batch updates.

186 (+2%)

real-stanford/umi-on-legs

UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers

239 (+1%)

mit

michaelnny/alpha_zero

A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games

89 (+1%)

mit

sunghoonhong/AirsimDRL

Autonomous UAV Navigation without Collision using Visual Information in Airsim

197 (+1%)

opendilab/GenerativeRL

Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).

101 (+1%)

apache-2.0

Toni-SM/skrl

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab

594 (+0.8%)

mit

Talendar/flappy-bird-gym

An OpenAI Gym environment for the Flappy Bird game

121 (+0.8%)

mit

reiniscimurs/GDAE

A goal-driven autonomous exploration through deep reinforcement learning (ICRA 2022) system that combines reactive and planned robot navigation in unknown environments

133 (+0.8%)

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

3,244 (+240)

apache-2.0

labmlai/annotated_deep_learning_paper_implementations

57,297 (+171)

mit

ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

34,556 (+124)

apache-2.0

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,726 (+103)

apache-2.0

Farama-Foundation/Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

7,662 (+71)

mit

d2l-ai/d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

24,265 (+66)

MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

4,137 (+59)

AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

14,438 (+52)

mit

datawhalechina/easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

9,764 (+48)

PufferAI/PufferLib

Simplifying reinforcement learning for complex game environments

1,406 (+47)

mit

Developer-Y/cs-video-courses

List of Computer Science courses with video lectures.

67,572 (+47)

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

9,362 (+44)

mit

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,077 (+43)

XinJingHao/DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

1,662 (+43)

MaximeVandegar/Papers-in-100-Lines-of-Code

Implementation of papers in 100 lines of code.

1,345 (+40)

mit

opendilab/awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,567 (+34)

apache-2.0

ddbourgin/numpy-ml

Machine learning, in numpy

15,807 (+34)

gpl-3.0

vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

5,928 (+33)

TJU-DRL-LAB/AI-Optimizer

The next generation deep reinforcement learning tookit

4,865 (+26)

Unity-Technologies/ml-agents

17,357 (+23)

Last week (relative gain)

mikelma/craftium

A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest

50 (+11%)

ml-jku/LRAM

A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

27 (+8%)

mit

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

3,244 (+8%)

apache-2.0

mohmdelsayed/streaming-drl

Deep reinforcement learning without experience replay, target networks, or batch updates.

186 (+6%)

chengxuxin/expressive-humanoid

[RSS 2024]: Expressive Whole-Body Control for Humanoid Robots

225 (+5%)

NVlabs/DiffRL

[ICLR 2022] Accelerated Policy Learning with Parallel Differentiable Simulation

287 (+5%)

TokyoRobotics/torobo_mujoco

Torobo models and example scripts in MuJoCo

43 (+5%)

bsd-3-clause

Max-We/Tetris-Gymnasium

A fully configurable Gymnasium compatible Tetris environment

25 (+4%)

openpsi-project/ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

153 (+4%)

apache-2.0

gbattocletti-riccardoUrb/rl-exploration-for-uavs

Reinforcement Learning-based exploration algorithm for a fleet of UAVs in an unknown environment.

52 (+4%)

flatland-association/flatland-rl

The Flatland Framework is a multi-purpose environment to tackle problems around resilient resource allocation under uncertainty. It is designed to be a flexible and method agnostic to solve a wide ran...

27 (+4%)

mit

SMARTlab-Purdue/SAN-NaviSTAR

55 (+4%)

mit

real-stanford/umi-on-legs

UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers

239 (+3%)

mit

PufferAI/PufferLib

Simplifying reinforcement learning for complex game environments

1,406 (+3%)

mit

ComputationalPsychiatry/HierarchicalGaussianFiltering.jl

The Julia implementation of the generalised hierarchical Gaussian filter

30 (+3%)

FilippoAiraldi/mpc-reinforcement-learning

Reinforcement Learning with Model Predictive Control

315 (+3%)

mit

ShengrenHou/Optimal-Energy-System-Scheduling-Combining-Mixed-Integer-Programming-and-Deep-Reinforcement-Learning

The Source code for paper "Optimal Energy System Scheduling Combining Mixed-Integer Programming and Deep Reinforcement Learning". Safe reinforcement learning, energy management

99 (+3%)

mit

ammarhydr/SAC-Lagrangian

PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm

33 (+3%)

MaximeVandegar/Papers-in-100-Lines-of-Code

Implementation of papers in 100 lines of code.

1,345 (+3%)

mit

FangjianLi/Driving_simulator_for_RL_IL

A driving simulator built specifically for reinforcement learning/imitation learning. The scenario generator is based on highway-env, and 3-D animation is based on Carla

34 (+3%)

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

labmlai/annotated_deep_learning_paper_implementations

57,297 (+910)

mit

MaximeVandegar/Papers-in-100-Lines-of-Code

Implementation of papers in 100 lines of code.

1,345 (+573)

mit

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

3,244 (+544)

apache-2.0

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,726 (+502)

apache-2.0

ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

34,556 (+482)

apache-2.0

AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

14,438 (+384)

mit

rl-tools/rl-tools

The Fastest Deep Reinforcement Learning Library

686 (+341)

mit

ddbourgin/numpy-ml

Machine learning, in numpy

15,807 (+309)

gpl-3.0

Developer-Y/cs-video-courses

List of Computer Science courses with video lectures.

67,572 (+308)

XinJingHao/DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

1,662 (+278)

Farama-Foundation/Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

7,662 (+262)

mit

MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

4,137 (+245)

d2l-ai/d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

24,265 (+244)

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,077 (+229)

datawhalechina/easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

9,764 (+216)

vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

5,928 (+211)

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

9,362 (+199)

mit

mohmdelsayed/streaming-drl

Deep reinforcement learning without experience replay, target networks, or batch updates.

186 (+176)

PufferAI/PufferLib

Simplifying reinforcement learning for complex game environments

1,406 (+135)

mit

Unity-Technologies/ml-agents

17,357 (+133)

Last month (relative gain)

mohmdelsayed/streaming-drl

Deep reinforcement learning without experience replay, target networks, or batch updates.

186 (+1,760%)

TokyoRobotics/torobo_mujoco

Torobo models and example scripts in MuJoCo

43 (+153%)

bsd-3-clause

nshen7/alpha-gfn

A deep reinforcement learning framework for generating formulaic alpha factors for quantitative investment, powered by GFlowNet, implemented in Python&PyTorch.

26 (+136%)

rl-tools/rl-tools

The Fastest Deep Reinforcement Learning Library

686 (+99%)

mit

MaximeVandegar/Papers-in-100-Lines-of-Code

Implementation of papers in 100 lines of code.

1,345 (+74%)

mit

mikelma/craftium

A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest

50 (+43%)

jolle-ag/qdx

Quantum error correction code AI-discovery with Jax

27 (+42%)

mit

priest-yang/Deep-Tracking-Control

DTC: Deep Tracking Control

25 (+32%)

rl-language/rlc

Bringing reinforcement learning to every day programmers

48 (+30%)

apache-2.0

dunnolab/xland-minigrid

JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️

255 (+25%)

apache-2.0

openpsi-project/ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

153 (+24%)

apache-2.0

SatCom-TELMA/MA-DRL_Routing_Simulator

Multi-Agent Deep Reinforcement Learning (MA-DRL) Routing Simulator for satellite networks

47 (+24%)

yihedeng9/rlhf-summary-notes

A brief and partial summary of RLHF algorithms.

84 (+24%)

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

3,244 (+20%)

apache-2.0

XinJingHao/DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

1,662 (+20%)

nomihq/nomi

Nomi enable people to use computer more simply.

72 (+20%)

mit

austin-starks/Deep-RL-Stocks

Reinforcement Learning for Stock Market Prediction

73 (+20%)

Max-We/Tetris-Gymnasium

A fully configurable Gymnasium compatible Tetris environment

25 (+19%)

real-stanford/umi-on-legs

UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers

239 (+18%)

mit

nissymori/JAX-CORL

Clean single-file implementation of offline RL algorithms in JAX

113 (+18%)

mit

Last 12-months (new repositories)

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,726

apache-2.0

alessiodm/drl-zh

Deep Reinforcement Learning: Zero to Hero!

2,032

mit

eloialonso/diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

1,642

mit

AgibotTech/agibot_x1_train

The reinforcement learning training code for AgiBot X1.

1,194

metauto-ai/GPTSwarm

🐝 GPTSwarm: LLM agents as (Optimizable) Graphs

707

mit

MarkFzp/humanplus

[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans

607

DmitryRyumin/AAAI-2024-Papers

AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for be...

466

mit

simpler-env/SimplerEnv

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

375

mit

EdanToledo/Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

252

apache-2.0

real-stanford/umi-on-legs

UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers

239

mit

chengxuxin/expressive-humanoid

[RSS 2024]: Expressive Whole-Body Control for Humanoid Robots

225

MichaelTMatthews/Craftax

(Crafter + NetHack) in JAX. ICML 2024 Spotlight.

221

mit

mihirp1998/VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various rewa...

221

Teddy-Liao/walk-these-ways-go2

Deploy walk-these-ways project on Unitree Go2

219

mit

shure-dev/Awesome-LLM-Papers-Comprehensive-Topics

Awesome LLM Papers and repos on very comprehensive topics.

195

mohmdelsayed/streaming-drl

Deep reinforcement learning without experience replay, target networks, or batch updates.

186

Zihao-Howard-Zhou/Simulation-Platform-for-UAV-network

This Python-based simulation platform can realistically model various components of the UAV network, including the network layer, MAC layer and physical layer, as well as the UAV mobility model, energ...

167

mit

ucd-dare/CarDreamer

World Model based Autonomous Driving Platform in CARLA :car:

164

nicklashansen/puppeteer

Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"

158

mit

ZhengyiLuo/PULSE

Official Implementation of the ICLR 2024 spotlight paper: Universal Humanoid Motion Representations for Physics-Based Control

153

Last 12-months (absolute gain)

labmlai/annotated_deep_learning_paper_implementations

57,297 (+16,660)

mit

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,726 (+5,592)

apache-2.0

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14,077 (+5,530)

ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

34,556 (+5,416)

apache-2.0

Developer-Y/cs-video-courses

List of Computer Science courses with video lectures.

67,572 (+4,975)

AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

14,438 (+4,497)

mit

d2l-ai/d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

24,265 (+4,008)

Farama-Foundation/Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

7,662 (+3,112)

mit

owainlewis/awesome-artificial-intelligence

A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.

11,078 (+2,701)

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

3,244 (+2,664)

apache-2.0

MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

4,137 (+2,557)

eugeneyan/applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

27,432 (+2,357)

mit

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

9,362 (+2,268)

mit

datawhalechina/easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

9,764 (+2,241)

vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

5,928 (+2,118)

alessiodm/drl-zh

Deep Reinforcement Learning: Zero to Hero!

2,032 (+2,031)

mit

wandb/wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

9,284 (+1,674)

mit

eloialonso/diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

1,642 (+1,641)

mit

Unity-Technologies/ml-agents

17,357 (+1,563)

xlang-ai/OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

1,471 (+1,470)

apache-2.0

Last 12-months (relative gain)

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,726 (+4,173%)

apache-2.0

MarkFzp/humanplus

[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans

607 (+2,790%)

ucd-dare/CarDreamer

World Model based Autonomous Driving Platform in CARLA :car:

164 (+2,633%)

corl-team/headless-ad

Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"

78 (+1,850%)

ligengen/EgoGen

[CVPR 2024] Official code for EgoGen: An Egocentric Synthetic Data Generator

74 (+1,750%)

apache-2.0

real-stanford/umi-on-legs

UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers

239 (+1,738%)

mit

austin-starks/Deep-RL-Stocks

Reinforcement Learning for Stock Market Prediction

73 (+1,725%)

PufferAI/PufferLib

Simplifying reinforcement learning for complex game environments

1,406 (+1,703%)

mit

mazpie/genrl

[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning them into sequences of latent world model states. Latent state se...

63 (+1,475%)

mit

ai4co/reevo

[NeurIPS 2024] ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution

138 (+1,433%)

mit

chengxuxin/expressive-humanoid

[RSS 2024]: Expressive Whole-Body Control for Humanoid Robots

225 (+1,400%)

rl-tools/rl-tools

The Fastest Deep Reinforcement Learning Library

686 (+1,219%)

mit

mikelma/craftium

A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI research based on Minetest

50 (+1,150%)

lucidrains/SAC-pytorch

Implementation of Soft Actor Critic and some of its improvements in Pytorch

46 (+1,050%)

mit

Waterkin/stock-top-papers

Top paper collection for stock price prediction, quantitative trading. Covering top conferences and journals like KDD, WWW, CIKM, AAAI, IJCAI, ACL, EMNLP.

218 (+1,047%)

apache-2.0

chandar-lab/Recall2Imagine

Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024

57 (+1,040%)

mit

yihedeng9/rlhf-summary-notes

A brief and partial summary of RLHF algorithms.

84 (+950%)

WoodOxen/tactics2d

Tactics2D: A Reinforcement Learning Environment Library with Generative Scenarios for Driving Decision-making

175 (+821%)

gpl-3.0

alberto-mate/CARLA-SB3-RL-Training-Environment

🚗 This repository offers a ready-to-use training and evaluation environment for conducting various experiments using Deep Reinforcement Learning (DRL) in the CARLA simulator with the help of Stable B...

64 (+814%)

rickstaa/ros-gazebo-gym

Framework for integrating ROS and Gazebo with gymnasium, streamlining the development and training of RL algorithms in realistic robot simulations.

35 (+775%)

mit