Skip to content

Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Reinforcement Learning.

Notifications You must be signed in to change notification settings

apexrl/RL-Exploration-Paper-Lists

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 

Repository files navigation

RL-Exploration-Paper-Lists

Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Reinforcement Learning.

Exploration Reinforcement Learning is an important topic in Reinforcement Learning research area, which is to essentially improve the sample efficiency in a MDP setting. Naive survey Slides and Document(Chinese) of Exploration problem are available.

A simple form of Exploration-exploitation dilemma can be seen from the Multi-Arm Bandit problems, and we include MAB papers because many theoretical idea can be drived from MAB studies.

In the early stage, most Exploration study focus on the sample efficiency of specific algorithm, many of them design different Exploration bonus to lead agents explore sufficient trajectories.

Recently, most DRL Exploration researches are focus on sparse reward settings and the target is a little different with the former studies, however we still classify those methods based on their methodology.

Many learning algorithm also consider the problem of efficient Exploration so that we also contain such work.

Exploration is a big topic so this paper list is just a scratch. Collected papers are sorted by time and classification. Any suggestions and pull requests are welcome.

The sharing principle of these references here is for research. If any authors do not want their paper to be listed here, please feel free to contact me (Email: ericliuof97 [AT] gmail.com).

Overview

MAB Exploration

MAB Review Papers

Decaying Parameter

Provable Algorithms

Beyesian Bandit

Gittin Indices

Comparative Experiments

RL Exploration

Reward based Exploration (Intrinsic Reward \ Exploration Bonus \ Surprise \ Curiosity \ Uncertainty)

Intrinsic Reward Review Papers

Counts based Methods

PAC-MDP
Beyesian Reinforcement Learning
Psudo Count
State Representation

Information Theory based Methods

Mutual Information
Information Gain

Prediction / Prediction error based Methods

Policy based Exploration

Search Exploration

Others

MARL Exploration

Coordinated Exploration

About

Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Reinforcement Learning.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published