Experiments used in Average-Reward Soft Actor-Critic by Jacob Adamczyk, Volodymyr Makarenko, Stas Tiomkin, and Rahul V. Kulkarni. Environments: Gridworlds, Gymnasium's classic control and Mujoco.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results