Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Set of hyper-parameters to reproduce LOLA DICE #11

Open
Manuscrit opened this issue Nov 7, 2020 · 0 comments
Open

Set of hyper-parameters to reproduce LOLA DICE #11

Manuscrit opened this issue Nov 7, 2020 · 0 comments

Comments

@Manuscrit
Copy link

Are the current default hyper-parameters the one used to produce the results of the DICE paper?
Current default HP are (from scripts/run_lola_dice.py):

batch-size=64
runs=5
epochs=200
use_dice=True

gamma=.96,
lr_inner=.1,
lr_outer=.2,
lr_value=.1,
lr_om=.1,
inner_asymm=True,
n_agents=2,
n_inner_steps=2,
value_batch_size=16,
value_epochs=0,
om_batch_size=16,
om_epochs=0,
use_baseline=False,

Or should we use the default from lola_dice/rpg.py?

epochs=100,
gamma=.96,
lr_inner=1.,          # lr for the inner loop steps
lr_outer=1.,          # lr for the outer loop steps
lr_value=.1,          # lr for the value function estimator
lr_om=.1,             # lr for opponent modeling
n_agents=2,
n_inner_steps=1,
inner_asymm=True,
om_batch_size=64,     # batch size used for fitting opponent models
om_epochs=5,          # epochs per iteration to fit opponent models
value_batch_size=64,  # batch size used for fitting the values
value_epochs=5,       # epochs per iteration to fit value functions
use_baseline=True,
use_dice=True,
use_opp_modeling=False,
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant