Iterated Q-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning
Paper
•
2403.02107
•
Published
i-DQN and i-IQN
This repository contains the model parameters trained with i-DQN on 56 Atari games and trained with i-IQN on 20 Atari games 🎮 5 seeds are available for each configuration which makes a total of 380 available models 📈
The evaluate.ipynb notebook contains a minimal example to evaluate to model parameters 🧑🏫 It uses JAX 🚀 The hyperparameters used during training are reported in config.json 🔧
To the training code 👉💻
ps: The set of 20 Atari games is included in the set of 56 Atari games.
i-DQN and i-IQN are improvements of DQN and IQN. Published at TMLR✨ List of games trained with Alien, Amidar, Assault, Asterix, Asteroids, Atlantis, BankHeist, BattleZone, BeamRider, Berzerk, Bowling, Boxing, Breakout, Centipede, ChopperCommand, CrazyClimber, DemonAttack, DoubleDunk, Enduro, FishingDerby, Freeway, Frostbite, Gopher, Gravitar, Hero, IceHockey, Jamesbond, Kangaroo, Krull, KungFuMaster, MontezumaRevenge, MsPacman, NameThisGame, Phoenix, Pitfall, Pong, Pooyan, PrivateEye, Qbert, Riverraid, RoadRunner, Robotank, Seaquest, Skiing, Solaris, SpaceInvaders, StarGunner, Tennis, TimePilot, Tutankham, UpNDown, Venture, VideoPinball, WizardOfWor, YarsRevenge, Zaxxon. |
![]() |
|---|
Python 3.10 is recommended. Create a Python virtual environment, activate it, update pip and install the package and its dependencies in editable mode:
python3.10 -m venv env
source env/bin/activate
pip install --upgrade pip
pip install numpy==1.23.5 # to avoid numpy==2.XX
pip install -r requirements.txt
pip install --upgrade "jax[cuda12_pip]==0.4.13" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html
iterated Q-Network
@article{vincent2024iterated,
title={Iterated $ Q $-Network: Beyond the One-Step Bellman Operator},
author={Vincent, Th{\'e}o and Palenicek, Daniel and Belousov, Boris and Peters, Jan and D'Eramo, Carlo},
journal={Transactions on Machine Learning Research},
year={2025}
}