Game-Sky

Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion. paper
Data-efficient model-based reinforcement learning with deep probabilistic dynamics models. paper
Iterative Value-Aware Model Learning. paper
Data center cooling using model-predictive control. paper

model free

Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation. paper
Representation Balancing MDPs for Off-Policy Policy Evaluation. paper
Simple random search provides a competitive approach to reinforcement learning. paper

value function approximation

Non-delusional Q-learning and value iteration. paper

policy gradient

Actor-Critic Policy Optimization in Partially Observable Multiagent Environments. paper
Learning Temporal Point Processes via Reinforcement Learning. paper
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing. paper code

Hierarchical

Flexible Neural Representation for Physics Prediction. paper
Data-Efficient Hierarchical Reinforcement Learning. paper
Learning Abstract Options. paper

Intergrating learning and planing

Dual Policy Iteration. paper
differentiable mpc for end-to-end planning and control. paper
Learning Plannable Representations with Causal InfoGAN. paper

Imitation rl

Multi-Agent Generative Adversarial Imitation Learning. paper

Inverse reinforcement learning

an event-based framework for task specification and control. paper

exploration

Context-Dependent Upper-Confidence Bounds for Directed Exploration. paper
Playing hard exploration games by watching YouTube. paper

CV related

Unsupervised Video Object Segmentation for Deep Reinforcement Learning. paper
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation. paper
Visual Reinforcement Learning with Imagined Goals. paper
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding. paper

meta-learning

Meta-Reinforcement Learning of Structured Exploration Strategies. paper
Evolved Policy Gradients. paper
Neural Arithmetic Logic Units. paper
Meta-Learning MCMC Proposals. paper
Probabilistic Model-Agnostic Meta-Learning. paper
Meta-Gradient Reinforcement Learning. paper

Generative Models

Deep Generative Models with Learnable Knowledge Constraints. paper
Are GANs Created Equal? A Large-Scale Study. paper

safe

Learning Safe Policies with Expert Guidance. paper
Adversarial Attacks on Stochastic Bandits. paper

multitask

Multi-Task Learning as Multi-Objective Optimization. paper
Learning to Multitask. paper

Interpretability

Human-in-the-Loop Interpretability Prior. paper
Towards Robust Interpretability with Self-Explaining Neural Networks. paper

undeterminded

end-to-end differentiable physics for learning and control. paper code
Recurrent World Models Facilitate Policy Evolution. paper
Learning to Play with Intrinsically-Motivated Self-Aware Agents. paper
Reward learning from human preferences and demonstrations in Atari. paper
On Learning Intrinsic Rewards for Policy Gradient Methods. paper
DeepProbLog: Neural Probabilistic Logic Programming. paper
Scalable End-to-End Autonomous Vehicle Testing via Rare-event Simulation. paper
Relational recurrent neural networks. paper code
How Does Batch Normalization Help Optimization? paper
Randomized Prior Functions for Deep Reinforcement Learning. paper
Transfer Learning with Neural AutoML. paper
Neural Guided Constraint Logic Programming for Program Synthesis. paper code

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
rl-base		rl-base
README.md		README.md
cfr.md		cfr.md
drl.md		drl.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rl-base

rl-base

README.md

README.md

cfr.md

cfr.md

drl.md

drl.md

Repository files navigation

Game-Sky

reinforcement learning base

counterfactual regret mimization(game theory)

open source library

deep reinforcement learning games

newest papers(wait to reasign)

NeurIPS 2018

model-based

model free

value function approximation

policy gradient

Hierarchical

Intergrating learning and planing

Imitation rl

Inverse reinforcement learning

exploration

CV related

meta-learning

Generative Models

safe

multitask

Interpretability

undeterminded

About

Releases

Packages

achao2013/game-sky

Folders and files

Latest commit

History

Repository files navigation

Game-Sky

reinforcement learning base

counterfactual regret mimization(game theory)

open source library

deep reinforcement learning games

newest papers(wait to reasign)

NeurIPS 2018

model-based

model free

value function approximation

policy gradient

Hierarchical

Intergrating learning and planing

Imitation rl

Inverse reinforcement learning

exploration

CV related

meta-learning

Generative Models

safe

multitask

Interpretability

undeterminded

About

Topics

Resources

Stars

Watchers

Forks