Set of hyper-parameters to reproduce LOLA DICE #11

Manuscrit · 2020-11-07T10:34:42Z

Are the current default hyper-parameters the one used to produce the results of the DICE paper?
Current default HP are (from scripts/run_lola_dice.py):

batch-size=64
runs=5
epochs=200
use_dice=True

gamma=.96,
lr_inner=.1,
lr_outer=.2,
lr_value=.1,
lr_om=.1,
inner_asymm=True,
n_agents=2,
n_inner_steps=2,
value_batch_size=16,
value_epochs=0,
om_batch_size=16,
om_epochs=0,
use_baseline=False,

Or should we use the default from lola_dice/rpg.py?

epochs=100,
gamma=.96,
lr_inner=1.,          # lr for the inner loop steps
lr_outer=1.,          # lr for the outer loop steps
lr_value=.1,          # lr for the value function estimator
lr_om=.1,             # lr for opponent modeling
n_agents=2,
n_inner_steps=1,
inner_asymm=True,
om_batch_size=64,     # batch size used for fitting opponent models
om_epochs=5,          # epochs per iteration to fit opponent models
value_batch_size=64,  # batch size used for fitting the values
value_epochs=5,       # epochs per iteration to fit value functions
use_baseline=True,
use_dice=True,
use_opp_modeling=False,

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set of hyper-parameters to reproduce LOLA DICE #11

Set of hyper-parameters to reproduce LOLA DICE #11

Manuscrit commented Nov 7, 2020

Set of hyper-parameters to reproduce LOLA DICE #11

Set of hyper-parameters to reproduce LOLA DICE #11

Comments

Manuscrit commented Nov 7, 2020