Skip to content

Task-aware world model learning with meta weighting via bi-level optimization

License

Notifications You must be signed in to change notification settings

deng-ai-lab/TEMPO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Task-aware world model learning
with meta weighting via bi-level optimization
(NeurIPS 2023)

This is a implementation of Task-aware Environment Modeling Pipline with Bi-level Optimization (TEMPO) built on top of the official DreamerV2 code.

If you find this code useful, please reference our paper in your work:

@inproceedings{
yuan2023taskaware,
title={Task-aware world model learning with meta weighting via bi-level optimization},
author={Huining Yuan and Hongkun Dou and Xingyu Jiang and Yue Deng},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
year={2023},
url={https://openreview.net/forum?id=IN3hQx1BrC}
}

Dependencies

A conda environment configuration of our experiments is provided in env_config.yml.

Core dependencies can be installed using pip:

tensorflow==2.6.0
tensorflow-probability==0.14.0
ruamel-yaml==0.17.21
gym==0.19.0
atari-py==0.2.9
dm-control==1.0.11

Instructions

This code work in a similar fashion to DreamerV2. Our main modifications are concentrated in agent.py and common/nets.py.

Default hyperparameters for our experiments are provided in config.yaml. By default, the code evaluates agent's performance with one episode every 1e5 training steps for Atari and 1e4 training steps for DMC.

Train on Atari:

python3 train.py --logdir ./logs/atari_pong --configs atari --task atari_pong

Train on DMC:

python3 train.py --logdir ./logs/dmc_walker_walk --configs dmc_vision --task dmc_walker_walk

Monitor results:

tensorboard --logdir ./logs/atari_pong --samples_per_plugin "images=1000"

Results

Results on DMC:

Results on Atari:

About

Task-aware world model learning with meta weighting via bi-level optimization

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages