43 results found Sort:

1.9k
9.5k
other
79
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Created 2020-07-03
1,194 commits to master branch, last one 13 days ago
The most simple, flexible, and comprehensive OpenAI Gym trading environment (Approved by OpenAI Gym)
Created 2019-09-16
42 commits to master branch, last one about a year ago
172
1.4k
unknown
9
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Created 2021-11-14
104 commits to main branch, last one 29 days ago
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
Created 2016-05-08
206 commits to master branch, last one about a year ago
Stock Trading Bot using Deep Q-Learning
Created 2018-08-13
31 commits to master branch, last one 4 years ago
Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog
Created 2016-12-09
89 commits to master branch, last one about a year ago
104
521
unknown
40
Arnold - DOOM Agent
Created 2017-11-14
3 commits to master branch, last one 6 years ago
126
485
other
50
DEEp Reinforcement learning framework
Created 2016-01-21
489 commits to master branch, last one 6 months ago
A framework where a deep Q-Learning Reinforcement Learning agent tries to choose the correct traffic light phase at an intersection to maximize traffic efficiency.
Created 2019-03-17
51 commits to master branch, last one 3 years ago
Implementing Reinforcement Learning, namely Q-learning and Sarsa algorithms, for global path planning of mobile robot in unknown environment with obstacles. Comparison analysis of Q-learning and Sarsa
Created 2018-04-26
196 commits to master branch, last one 3 years ago
playing idealized trading games with deep reinforcement learning
Created 2018-02-25
31 commits to master branch, last one 3 years ago
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Created 2020-10-02
23 commits to master branch, last one 3 years ago
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
Created 2023-09-20
113 commits to main branch, last one 5 days ago
52
343
mit
17
Play Google Chrome's T-rex game with TensorFlow
Created 2017-03-04
70 commits to master branch, last one 5 years ago
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics...
Created 2017-09-15
1,863 commits to master branch, last one 11 months ago
Free Resources For Data Science created by Shubham Kumar
Created 2019-02-10
93 commits to master branch, last one 4 years ago
26
271
mit
13
A framework for multi-agent reinforcement learning.
Created 2020-10-06
145 commits to master branch, last one 2 years ago
A deep reinforcement learning bot that plays tetris
Created 2019-07-23
13 commits to master branch, last one 2 months ago
51
205
unknown
29
Forex trading simulator environment for OpenAI Gym, observations contain the order status, performance and timeseries loaded from a CSV file containing rates and indicators. Work In Progress
Created 2017-02-21
830 commits to master branch, last one about a year ago
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
Created 2022-06-05
17 commits to main branch, last one about a year ago
Using tabular and deep reinforcement learning methods to infer optimal market making strategies
Created 2022-05-20
109 commits to main branch, last one about a year ago
Implementation of the Llama architecture with RLHF + Q-learning
Created 2023-11-23
19 commits to main branch, last one 11 months ago
13
116
apache-2.0
4
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Created 2021-03-21
24 commits to main branch, last one 3 years ago
(Experimental, a lot of bugs) Automatic fingering generator for piano scores, determining optimal fingering using Model-Based Reinforcement Learning, written in the Julia language.
This repository has been archived (exclude archived)
Created 2023-04-21
20 commits to main branch, last one about a year ago