21 results found Sort:
- Filter by Primary Language:
- Python (8)
- C++ (4)
- Jupyter Notebook (4)
- Julia (2)
- HTML (1)
- Makefile (1)
- +
VIP cheatsheets for Stanford's CS 221 Artificial Intelligence
Created
2019-05-24
41 commits to master branch, last one 5 years ago
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
ppo
drqn
trpo
sarsa
double-dqn
openai-gym
q-learning
dueling-dqn
monte-carlo
deep-q-network
policy-gradient
policy-gradients
reinforcement-learning
deep-learning-algorithms
deep-recurrent-q-network
markov-decision-processes
deep-reinforcement-learning
hindsight-experience-replay
deep-deterministic-policy-gradient
asynchronous-advantage-actor-critic
Created
2018-06-11
44 commits to master branch, last one 4 years ago
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.
Created
2015-06-23
887 commits to master branch, last one 2 months ago
A C++ framework for MDPs and POMDPs with Python bindings
Created
2013-11-16
1,750 commits to master branch, last one about a year ago
Curso de Álgebra Lineal
Created
2018-12-03
113 commits to master branch, last one 2 years ago
Extensible Combinatorial Optimization Learning Environments
Created
2019-10-18
1,510 commits to master branch, last one 2 years ago
A JuMP extension for Stochastic Dual Dynamic Programming
Created
2017-04-10
697 commits to master branch, last one 27 days ago
A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/
Created
2019-09-22
369 commits to main branch, last one 8 months ago
An Automata Learning Library Written in Python
Created
2021-03-18
930 commits to master branch, last one 27 days ago
A research platform to develop automated security policies using quantitative methods, e.g., optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and...
Created
2020-09-07
3,898 commits to master branch, last one 17 days ago
🌲 Stanford CS 228 - Probabilistic Graphical Models
Created
2018-12-26
50 commits to master branch, last one 4 months ago
A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning for Mobile Edge Computing
dqn
mdp
mec
d3qn
ddqn
qeco
dueling-ddqn
lstm-networks
deep-q-network
edge-computing
qoe-measurements
network-performance
resource-management
network-optimization
double-deep-q-network
mobile-edge-computing
computation-offloading
performance-evaluation
markov-decision-processes
deep-reinforcement-learning
Created
2023-07-31
153 commits to main branch, last one a day ago
WrightEagle Base Code for RoboCup Soccer Simulation 2D
Created
2015-09-09
103 commits to master branch, last one 2 years ago
Framework for the simulation and estimation of some finite-horizon discrete choice dynamic programming models.
Created
2016-04-25
3,177 commits to main branch, last one about a year ago
Online algorithms for solving large-scale dynamic vehicle routing problems with stochastic requests
Created
2022-02-07
89 commits to main branch, last one 2 years ago
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a p...
Created
2019-10-10
29 commits to master branch, last one 9 months ago
Implementation of Tsallis Actor Critic method
Created
2018-11-21
346 commits to master branch, last one 2 months ago
AWS Last Mile Route Sequence Optimization
Created
2022-04-26
44 commits to main branch, last one 6 months ago
Reinforcement Learning Short Course
offline-rl
q-learning
ridesharing
deep-q-network
model-based-rl
policy-gradient
value-iteration
policy-iteration
fitted-q-iteration
dynamic-programming
monte-carlo-methods
policy-based-method
off-policy-evaluation
reinforcement-learning
markov-decision-processes
order-dispatch-recommendation
temporal-differencing-learning
Created
2023-02-07
90 commits to main branch, last one 14 days ago
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Created
2018-05-10
710 commits to master branch, last one 6 years ago
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Created
2021-10-12
125 commits to main branch, last one 21 days ago