Skip to content

play game with reinforcement learning, game theory

Notifications You must be signed in to change notification settings

achao2013/game-sky

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Game-Sky

This is a resource about game playing with reinforcement learning, game theory and so on.

This project will gather newest papers and classify them gradually.(f.e. neurips 2018)

It will be updated continuously, welcome to contribute to us!.

reinforcement learning base

Theory

Tutorials

platforms

code

demos

Applications

counterfactual regret mimization(game theory)

focus on texas hold'em and so on

cfr

open source library

cppcfr

deep reinforcement learning games

include alpha go series

deep-rl

newest papers(wait to reasign)

NeurIPS 2018

model-based

  • Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion. paper
  • Data-efficient model-based reinforcement learning with deep probabilistic dynamics models. paper
  • Iterative Value-Aware Model Learning. paper
  • Data center cooling using model-predictive control. paper

model free

  • Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation. paper
  • Representation Balancing MDPs for Off-Policy Policy Evaluation. paper
  • Simple random search provides a competitive approach to reinforcement learning. paper

value function approximation

  • Non-delusional Q-learning and value iteration. paper

policy gradient

  • Actor-Critic Policy Optimization in Partially Observable Multiagent Environments. paper
  • Learning Temporal Point Processes via Reinforcement Learning. paper
  • Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing. paper code

Hierarchical

  • Flexible Neural Representation for Physics Prediction. paper
  • Data-Efficient Hierarchical Reinforcement Learning. paper
  • Learning Abstract Options. paper

Intergrating learning and planing

  • Dual Policy Iteration. paper
  • differentiable mpc for end-to-end planning and control. paper
  • Learning Plannable Representations with Causal InfoGAN. paper

Imitation rl

  • Multi-Agent Generative Adversarial Imitation Learning. paper

Inverse reinforcement learning

  • an event-based framework for task specification and control. paper

exploration

  • Context-Dependent Upper-Confidence Bounds for Directed Exploration. paper
  • Playing hard exploration games by watching YouTube. paper

CV related

  • Unsupervised Video Object Segmentation for Deep Reinforcement Learning. paper
  • Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation. paper
  • Visual Reinforcement Learning with Imagined Goals. paper
  • Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding. paper

meta-learning

  • Meta-Reinforcement Learning of Structured Exploration Strategies. paper
  • Evolved Policy Gradients. paper
  • Neural Arithmetic Logic Units. paper
  • Meta-Learning MCMC Proposals. paper
  • Probabilistic Model-Agnostic Meta-Learning. paper
  • Meta-Gradient Reinforcement Learning. paper

Generative Models

  • Deep Generative Models with Learnable Knowledge Constraints. paper
  • Are GANs Created Equal? A Large-Scale Study. paper

safe

  • Learning Safe Policies with Expert Guidance. paper
  • Adversarial Attacks on Stochastic Bandits. paper

multitask

  • Multi-Task Learning as Multi-Objective Optimization. paper
  • Learning to Multitask. paper

Interpretability

  • Human-in-the-Loop Interpretability Prior. paper
  • Towards Robust Interpretability with Self-Explaining Neural Networks. paper

undeterminded

  • end-to-end differentiable physics for learning and control. paper code
  • Recurrent World Models Facilitate Policy Evolution. paper
  • Learning to Play with Intrinsically-Motivated Self-Aware Agents. paper
  • Reward learning from human preferences and demonstrations in Atari. paper
  • On Learning Intrinsic Rewards for Policy Gradient Methods. paper
  • DeepProbLog: Neural Probabilistic Logic Programming. paper
  • Scalable End-to-End Autonomous Vehicle Testing via Rare-event Simulation. paper
  • Relational recurrent neural networks. paper code
  • How Does Batch Normalization Help Optimization? paper
  • Randomized Prior Functions for Deep Reinforcement Learning. paper
  • Transfer Learning with Neural AutoML. paper
  • Neural Guided Constraint Logic Programming for Program Synthesis. paper code

About

play game with reinforcement learning, game theory

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published