Task-aware world model learning
with meta weighting via bi-level optimization
(NeurIPS 2023)

This is a implementation of Task-aware Environment Modeling Pipline with Bi-level Optimization (TEMPO) built on top of the official DreamerV2 code.

If you find this code useful, please reference our paper in your work:

@inproceedings{
yuan2023taskaware,
title={Task-aware world model learning with meta weighting via bi-level optimization},
author={Huining Yuan and Hongkun Dou and Xingyu Jiang and Yue Deng},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
year={2023},
url={https://openreview.net/forum?id=IN3hQx1BrC}
}

Dependencies

A conda environment configuration of our experiments is provided in env_config.yml.

Core dependencies can be installed using pip:

tensorflow==2.6.0
tensorflow-probability==0.14.0
ruamel-yaml==0.17.21
gym==0.19.0
atari-py==0.2.9
dm-control==1.0.11

Instructions

This code work in a similar fashion to DreamerV2. Our main modifications are concentrated in agent.py and common/nets.py.

Default hyperparameters for our experiments are provided in config.yaml. By default, the code evaluates agent's performance with one episode every 1e5 training steps for Atari and 1e4 training steps for DMC.

Train on Atari:

python3 train.py --logdir ./logs/atari_pong --configs atari --task atari_pong

Train on DMC:

python3 train.py --logdir ./logs/dmc_walker_walk --configs dmc_vision --task dmc_walker_walk

Monitor results:

tensorboard --logdir ./logs/atari_pong --samples_per_plugin "images=1000"

Results

Results on DMC:

Results on Atari:

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
common		common
figs		figs
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
api.py		api.py
configs.yaml		configs.yaml
env_configs.yml		env_configs.yml
expl.py		expl.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Task-aware world model learning
with meta weighting via bi-level optimization
(NeurIPS 2023)

Dependencies

Instructions

Results

About

Releases

Packages

Languages

License

deng-ai-lab/TEMPO

Folders and files

Latest commit

History

Repository files navigation

Task-aware world model learning with meta weighting via bi-level optimization (NeurIPS 2023)

Dependencies

Instructions

Results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Task-aware world model learning
with meta weighting via bi-level optimization
(NeurIPS 2023)

Packages