Skip to content

bmabsout/AQS

Repository files navigation

EXPRESSIVE REINFORCEMENT LEARNING VIA ALGEBRAIC Q-VALUE SCALARIZATION

ant_walking_resized.webm

We can learn controllers such as for Hopper-v4 in 7000 steps!

Screencast.from.2024-09-12.19-47-55.webm

Installation

Via nix

Via Conda

  • conda env create --file environment.yml
  • conda activate cmorl_env

Run training

python envs/Pendulum/train_pendulum.py, this uses ddpg to train the algorithm and produces automatically checkpoints and logs in the trained folder python envs/Pendulum/test_pendulum.py -lr training

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published