tinkoff-ai · vkurenkov · May 23, 2023 · May 21, 2023 · May 22, 2023 · May 23, 2023
diff --git a/README.md b/README.md
@@ -28,16 +28,16 @@ docker run --gpus=all -it --rm --name <container_name> <image_name>
 
 | Algorithm                                                                                                                       | Variants Implemented                               | Wandb Report |
 |---------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------| ----------- |
-| ✅ Behavioral Cloning <br>(BC)                                                                                                   | [`any_percent_bc.py`](algorithms/any_percent_bc.py) |  [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/BC-D4RL-Results--VmlldzoyNzA2MjE1)
-| ✅ Behavioral Cloning-10% <br>(BC-10%)                                                                                           | [`any_percent_bc.py`](algorithms/any_percent_bc.py) |  [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/BC-10-D4RL-Results--VmlldzoyNzEwMjcx)
-| ✅ [Conservative Q-Learning for Offline Reinforcement Learning <br>(CQL)](https://arxiv.org/abs/2006.04779)                      | [`cql.py`](algorithms/cql.py)                      | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/CQL-D4RL-Results--VmlldzoyNzA2MTk5)
-| ✅ [Accelerating Online Reinforcement Learning with Offline Datasets <br>(AWAC)](https://arxiv.org/abs/2006.09359)               | [`awac.py`](algorithms/awac.py)                    | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/AWAC-D4RL-Results--VmlldzoyNzA2MjE3)
-| ✅ [Offline Reinforcement Learning with Implicit Q-Learning <br>(IQL)](https://arxiv.org/abs/2110.06169)                         | [`iql.py`](algorithms/iql.py)                      | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/IQL-D4RL-Results--VmlldzoyNzA2MTkx)
-| ✅ [A Minimalist Approach to Offline Reinforcement Learning <br>(TD3+BC)](https://arxiv.org/abs/2106.06860)                      | [`td3_bc.py`](algorithms/td3_bc.py)                | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/TD3-BC-D4RL-Results--VmlldzoyNzA2MjA0)
-| ✅ [Decision Transformer: Reinforcement Learning via Sequence Modeling <br>(DT)](https://arxiv.org/abs/2106.01345)               | [`dt.py`](algorithms/dt.py)                        | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/DT-D4RL-Results--VmlldzoyNzA2MTk3)
-| ✅ [Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble <br>(SAC-N)](https://arxiv.org/abs/2110.01548)  | [`sac_n.py`](algorithms/sac_n.py)                  | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/SAC-N-D4RL-Results--VmlldzoyNzA1NTY1)
-| ✅ [Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble <br>(EDAC)](https://arxiv.org/abs/2110.01548)   | [`edac.py`](algorithms/edac.py)                    | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/EDAC-D4RL-Results--VmlldzoyNzA5ODUw)
-| ✅ [Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size <br>(LB-SAC)](https://arxiv.org/abs/2211.11092) | [`lb_sac.py`](algorithms/lb_sac.py)                | [`Gym-MuJoCo`](https://wandb.ai/tlab/CORL/reports/LB-SAC-D4RL-Results--VmlldzozNjIxMDY1)
+| ✅ Behavioral Cloning <br>(BC)                                                                                                   | [`any_percent_bc.py`](algorithms/offline/any_percent_bc.py) |  [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/BC-D4RL-Results--VmlldzoyNzA2MjE1)
+| ✅ Behavioral Cloning-10% <br>(BC-10%)                                                                                           | [`any_percent_bc.py`](algorithms/offline/any_percent_bc.py) |  [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/BC-10-D4RL-Results--VmlldzoyNzEwMjcx)
+| ✅ [Conservative Q-Learning for Offline Reinforcement Learning <br>(CQL)](https://arxiv.org/abs/2006.04779)                      | [`cql.py`](algorithms/offline/cql.py)                      | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/CQL-D4RL-Results--VmlldzoyNzA2MTk5)
+| ✅ [Accelerating Online Reinforcement Learning with Offline Datasets <br>(AWAC)](https://arxiv.org/abs/2006.09359)               | [`awac.py`](algorithms/offline/awac.py)                    | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/AWAC-D4RL-Results--VmlldzoyNzA2MjE3)
+| ✅ [Offline Reinforcement Learning with Implicit Q-Learning <br>(IQL)](https://arxiv.org/abs/2110.06169)                         | [`iql.py`](algorithms/offline/iql.py)                      | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/IQL-D4RL-Results--VmlldzoyNzA2MTkx)
+| ✅ [A Minimalist Approach to Offline Reinforcement Learning <br>(TD3+BC)](https://arxiv.org/abs/2106.06860)                      | [`td3_bc.py`](algorithms/offline/td3_bc.py)                | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/TD3-BC-D4RL-Results--VmlldzoyNzA2MjA0)
+| ✅ [Decision Transformer: Reinforcement Learning via Sequence Modeling <br>(DT)](https://arxiv.org/abs/2106.01345)               | [`dt.py`](algorithms/offline/dt.py)                        | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/DT-D4RL-Results--VmlldzoyNzA2MTk3)
+| ✅ [Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble <br>(SAC-N)](https://arxiv.org/abs/2110.01548)  | [`sac_n.py`](algorithms/offline/sac_n.py)                  | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/SAC-N-D4RL-Results--VmlldzoyNzA1NTY1)
+| ✅ [Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble <br>(EDAC)](https://arxiv.org/abs/2110.01548)   | [`edac.py`](algorithms/offline/edac.py)                    | [`Gym-MuJoCo, Maze2D`](https://wandb.ai/tlab/CORL/reports/EDAC-D4RL-Results--VmlldzoyNzA5ODUw)
+| ✅ [Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size <br>(LB-SAC)](https://arxiv.org/abs/2211.11092) | [`lb_sac.py`](algorithms/offline/lb_sac.py)                | [`Gym-MuJoCo`](https://wandb.ai/tlab/CORL/reports/LB-SAC-D4RL-Results--VmlldzozNjIxMDY1)
 
 ## D4RL Benchmarks
 For learning curves and all the details, you can check the links above. Here, we report reproduced **final** and **best** scores. Note that thay differ by a big margin, and some papers may use different approaches not making it always explicit which one reporting methodology they chose.

diff --git a/algorithms/__init__.py b/algorithms/__init__.py
diff --git a/algorithms/any_percent_bc.py → algorithms/offline/any_percent_bc.py b/algorithms/any_percent_bc.py → algorithms/offline/any_percent_bc.py
diff --git a/algorithms/awac.py → algorithms/offline/awac.py b/algorithms/awac.py → algorithms/offline/awac.py
diff --git a/algorithms/cql.py → algorithms/offline/cql.py b/algorithms/cql.py → algorithms/offline/cql.py
diff --git a/algorithms/dt.py → algorithms/offline/dt.py b/algorithms/dt.py → algorithms/offline/dt.py
diff --git a/algorithms/edac.py → algorithms/offline/edac.py b/algorithms/edac.py → algorithms/offline/edac.py
diff --git a/algorithms/iql.py → algorithms/offline/iql.py b/algorithms/iql.py → algorithms/offline/iql.py
diff --git a/algorithms/lb_sac.py → algorithms/offline/lb_sac.py b/algorithms/lb_sac.py → algorithms/offline/lb_sac.py
diff --git a/algorithms/sac_n.py → algorithms/offline/sac_n.py b/algorithms/sac_n.py → algorithms/offline/sac_n.py
diff --git a/algorithms/td3_bc.py → algorithms/offline/td3_bc.py b/algorithms/td3_bc.py → algorithms/offline/td3_bc.py
diff --git a/configs/awac/antmaze/large_play_v0.yaml → ...s/offline/awac/antmaze/large_play_v0.yaml b/configs/awac/antmaze/large_play_v0.yaml → ...s/offline/awac/antmaze/large_play_v0.yaml
diff --git a/configs/awac/antmaze/medium_play_v0.yaml → .../offline/awac/antmaze/medium_play_v0.yaml b/configs/awac/antmaze/medium_play_v0.yaml → .../offline/awac/antmaze/medium_play_v0.yaml
diff --git a/configs/awac/antmaze/umaze_v0.yaml → configs/offline/awac/antmaze/umaze_v0.yaml b/configs/awac/antmaze/umaze_v0.yaml → configs/offline/awac/antmaze/umaze_v0.yaml
diff --git a/configs/awac/halfcheetah/expert_v2.yaml → ...s/offline/awac/halfcheetah/expert_v2.yaml b/configs/awac/halfcheetah/expert_v2.yaml → ...s/offline/awac/halfcheetah/expert_v2.yaml
diff --git a/configs/awac/halfcheetah/full_replay_v2.yaml → ...line/awac/halfcheetah/full_replay_v2.yaml b/configs/awac/halfcheetah/full_replay_v2.yaml → ...line/awac/halfcheetah/full_replay_v2.yaml
diff --git a/...gs/awac/halfcheetah/medium_expert_v2.yaml → ...ne/awac/halfcheetah/medium_expert_v2.yaml b/...gs/awac/halfcheetah/medium_expert_v2.yaml → ...ne/awac/halfcheetah/medium_expert_v2.yaml
diff --git a/...gs/awac/halfcheetah/medium_replay_v2.yaml → ...ne/awac/halfcheetah/medium_replay_v2.yaml b/...gs/awac/halfcheetah/medium_replay_v2.yaml → ...ne/awac/halfcheetah/medium_replay_v2.yaml
diff --git a/configs/awac/halfcheetah/medium_v2.yaml → ...s/offline/awac/halfcheetah/medium_v2.yaml b/configs/awac/halfcheetah/medium_v2.yaml → ...s/offline/awac/halfcheetah/medium_v2.yaml
diff --git a/configs/awac/halfcheetah/random_v2.yaml → ...s/offline/awac/halfcheetah/random_v2.yaml b/configs/awac/halfcheetah/random_v2.yaml → ...s/offline/awac/halfcheetah/random_v2.yaml
diff --git a/configs/awac/hopper/expert_v2.yaml → configs/offline/awac/hopper/expert_v2.yaml b/configs/awac/hopper/expert_v2.yaml → configs/offline/awac/hopper/expert_v2.yaml
diff --git a/configs/awac/hopper/full_replay_v2.yaml → ...s/offline/awac/hopper/full_replay_v2.yaml b/configs/awac/hopper/full_replay_v2.yaml → ...s/offline/awac/hopper/full_replay_v2.yaml
diff --git a/configs/awac/hopper/medium_expert_v2.yaml → ...offline/awac/hopper/medium_expert_v2.yaml b/configs/awac/hopper/medium_expert_v2.yaml → ...offline/awac/hopper/medium_expert_v2.yaml
diff --git a/configs/awac/hopper/medium_replay_v2.yaml → ...offline/awac/hopper/medium_replay_v2.yaml b/configs/awac/hopper/medium_replay_v2.yaml → ...offline/awac/hopper/medium_replay_v2.yaml
diff --git a/configs/awac/hopper/medium_v2.yaml → configs/offline/awac/hopper/medium_v2.yaml b/configs/awac/hopper/medium_v2.yaml → configs/offline/awac/hopper/medium_v2.yaml
diff --git a/configs/awac/hopper/random_v2.yaml → configs/offline/awac/hopper/random_v2.yaml b/configs/awac/hopper/random_v2.yaml → configs/offline/awac/hopper/random_v2.yaml
diff --git a/configs/awac/maze2d/large_dense_v1.yaml → ...s/offline/awac/maze2d/large_dense_v1.yaml b/configs/awac/maze2d/large_dense_v1.yaml → ...s/offline/awac/maze2d/large_dense_v1.yaml
diff --git a/configs/awac/maze2d/large_v1.yaml → configs/offline/awac/maze2d/large_v1.yaml b/configs/awac/maze2d/large_v1.yaml → configs/offline/awac/maze2d/large_v1.yaml
diff --git a/configs/awac/maze2d/medium_dense_v1.yaml → .../offline/awac/maze2d/medium_dense_v1.yaml b/configs/awac/maze2d/medium_dense_v1.yaml → .../offline/awac/maze2d/medium_dense_v1.yaml
diff --git a/configs/awac/maze2d/medium_v1.yaml → configs/offline/awac/maze2d/medium_v1.yaml b/configs/awac/maze2d/medium_v1.yaml → configs/offline/awac/maze2d/medium_v1.yaml
diff --git a/configs/awac/maze2d/umaze_dense_v1.yaml → ...s/offline/awac/maze2d/umaze_dense_v1.yaml b/configs/awac/maze2d/umaze_dense_v1.yaml → ...s/offline/awac/maze2d/umaze_dense_v1.yaml
diff --git a/configs/awac/maze2d/umaze_v1.yaml → configs/offline/awac/maze2d/umaze_v1.yaml b/configs/awac/maze2d/umaze_v1.yaml → configs/offline/awac/maze2d/umaze_v1.yaml
diff --git a/configs/awac/walker2d/expert_v2.yaml → configs/offline/awac/walker2d/expert_v2.yaml b/configs/awac/walker2d/expert_v2.yaml → configs/offline/awac/walker2d/expert_v2.yaml
diff --git a/configs/awac/walker2d/full_replay_v2.yaml → ...offline/awac/walker2d/full_replay_v2.yaml b/configs/awac/walker2d/full_replay_v2.yaml → ...offline/awac/walker2d/full_replay_v2.yaml
diff --git a/configs/awac/walker2d/medium_expert_v2.yaml → ...fline/awac/walker2d/medium_expert_v2.yaml b/configs/awac/walker2d/medium_expert_v2.yaml → ...fline/awac/walker2d/medium_expert_v2.yaml
diff --git a/configs/awac/walker2d/medium_replay_v2.yaml → ...fline/awac/walker2d/medium_replay_v2.yaml b/configs/awac/walker2d/medium_replay_v2.yaml → ...fline/awac/walker2d/medium_replay_v2.yaml
diff --git a/configs/awac/walker2d/medium_v2.yaml → configs/offline/awac/walker2d/medium_v2.yaml b/configs/awac/walker2d/medium_v2.yaml → configs/offline/awac/walker2d/medium_v2.yaml
diff --git a/configs/awac/walker2d/random_v2.yaml → configs/offline/awac/walker2d/random_v2.yaml b/configs/awac/walker2d/random_v2.yaml → configs/offline/awac/walker2d/random_v2.yaml
diff --git a/configs/bc/antmaze/large_play_v0.yaml → ...igs/offline/bc/antmaze/large_play_v0.yaml b/configs/bc/antmaze/large_play_v0.yaml → ...igs/offline/bc/antmaze/large_play_v0.yaml
diff --git a/configs/bc/antmaze/medium_play_v0.yaml → ...gs/offline/bc/antmaze/medium_play_v0.yaml b/configs/bc/antmaze/medium_play_v0.yaml → ...gs/offline/bc/antmaze/medium_play_v0.yaml
diff --git a/configs/bc/antmaze/umaze_v0.yaml → configs/offline/bc/antmaze/umaze_v0.yaml b/configs/bc/antmaze/umaze_v0.yaml → configs/offline/bc/antmaze/umaze_v0.yaml
diff --git a/configs/bc/halfcheetah/expert_v2.yaml → ...igs/offline/bc/halfcheetah/expert_v2.yaml b/configs/bc/halfcheetah/expert_v2.yaml → ...igs/offline/bc/halfcheetah/expert_v2.yaml
diff --git a/configs/bc/halfcheetah/full_replay_v2.yaml → ...ffline/bc/halfcheetah/full_replay_v2.yaml b/configs/bc/halfcheetah/full_replay_v2.yaml → ...ffline/bc/halfcheetah/full_replay_v2.yaml
diff --git a/configs/bc/halfcheetah/medium_expert_v2.yaml → ...line/bc/halfcheetah/medium_expert_v2.yaml b/configs/bc/halfcheetah/medium_expert_v2.yaml → ...line/bc/halfcheetah/medium_expert_v2.yaml
diff --git a/configs/bc/halfcheetah/medium_replay_v2.yaml → ...line/bc/halfcheetah/medium_replay_v2.yaml b/configs/bc/halfcheetah/medium_replay_v2.yaml → ...line/bc/halfcheetah/medium_replay_v2.yaml
diff --git a/configs/bc/halfcheetah/medium_v2.yaml → ...igs/offline/bc/halfcheetah/medium_v2.yaml b/configs/bc/halfcheetah/medium_v2.yaml → ...igs/offline/bc/halfcheetah/medium_v2.yaml
diff --git a/configs/bc/halfcheetah/random_v2.yaml → ...igs/offline/bc/halfcheetah/random_v2.yaml b/configs/bc/halfcheetah/random_v2.yaml → ...igs/offline/bc/halfcheetah/random_v2.yaml
diff --git a/configs/bc/hopper/expert_v2.yaml → configs/offline/bc/hopper/expert_v2.yaml b/configs/bc/hopper/expert_v2.yaml → configs/offline/bc/hopper/expert_v2.yaml
diff --git a/configs/bc/hopper/full_replay_v2.yaml → ...igs/offline/bc/hopper/full_replay_v2.yaml b/configs/bc/hopper/full_replay_v2.yaml → ...igs/offline/bc/hopper/full_replay_v2.yaml
diff --git a/configs/bc/hopper/medium_expert_v2.yaml → ...s/offline/bc/hopper/medium_expert_v2.yaml b/configs/bc/hopper/medium_expert_v2.yaml → ...s/offline/bc/hopper/medium_expert_v2.yaml
diff --git a/configs/bc/hopper/medium_replay_v2.yaml → ...s/offline/bc/hopper/medium_replay_v2.yaml b/configs/bc/hopper/medium_replay_v2.yaml → ...s/offline/bc/hopper/medium_replay_v2.yaml
diff --git a/configs/bc/hopper/medium_v2.yaml → configs/offline/bc/hopper/medium_v2.yaml b/configs/bc/hopper/medium_v2.yaml → configs/offline/bc/hopper/medium_v2.yaml
diff --git a/configs/bc/hopper/random_v2.yaml → configs/offline/bc/hopper/random_v2.yaml b/configs/bc/hopper/random_v2.yaml → configs/offline/bc/hopper/random_v2.yaml
diff --git a/configs/bc/maze2d/large_dense_v1.yaml → ...igs/offline/bc/maze2d/large_dense_v1.yaml b/configs/bc/maze2d/large_dense_v1.yaml → ...igs/offline/bc/maze2d/large_dense_v1.yaml
diff --git a/configs/bc/maze2d/large_v1.yaml → configs/offline/bc/maze2d/large_v1.yaml b/configs/bc/maze2d/large_v1.yaml → configs/offline/bc/maze2d/large_v1.yaml
diff --git a/configs/bc/maze2d/medium_dense_v1.yaml → ...gs/offline/bc/maze2d/medium_dense_v1.yaml b/configs/bc/maze2d/medium_dense_v1.yaml → ...gs/offline/bc/maze2d/medium_dense_v1.yaml
diff --git a/configs/bc/maze2d/medium_v1.yaml → configs/offline/bc/maze2d/medium_v1.yaml b/configs/bc/maze2d/medium_v1.yaml → configs/offline/bc/maze2d/medium_v1.yaml
diff --git a/configs/bc/maze2d/umaze_dense_v1.yaml → ...igs/offline/bc/maze2d/umaze_dense_v1.yaml b/configs/bc/maze2d/umaze_dense_v1.yaml → ...igs/offline/bc/maze2d/umaze_dense_v1.yaml
diff --git a/configs/bc/maze2d/umaze_v1.yaml → configs/offline/bc/maze2d/umaze_v1.yaml b/configs/bc/maze2d/umaze_v1.yaml → configs/offline/bc/maze2d/umaze_v1.yaml
diff --git a/configs/bc/walker2d/expert_v2.yaml → configs/offline/bc/walker2d/expert_v2.yaml b/configs/bc/walker2d/expert_v2.yaml → configs/offline/bc/walker2d/expert_v2.yaml
diff --git a/configs/bc/walker2d/full_replay_v2.yaml → ...s/offline/bc/walker2d/full_replay_v2.yaml b/configs/bc/walker2d/full_replay_v2.yaml → ...s/offline/bc/walker2d/full_replay_v2.yaml
diff --git a/configs/bc/walker2d/medium_expert_v2.yaml → ...offline/bc/walker2d/medium_expert_v2.yaml b/configs/bc/walker2d/medium_expert_v2.yaml → ...offline/bc/walker2d/medium_expert_v2.yaml
diff --git a/configs/bc/walker2d/medium_replay_v2.yaml → ...offline/bc/walker2d/medium_replay_v2.yaml b/configs/bc/walker2d/medium_replay_v2.yaml → ...offline/bc/walker2d/medium_replay_v2.yaml
diff --git a/configs/bc/walker2d/medium_v2.yaml → configs/offline/bc/walker2d/medium_v2.yaml b/configs/bc/walker2d/medium_v2.yaml → configs/offline/bc/walker2d/medium_v2.yaml
diff --git a/configs/bc/walker2d/random_v2.yaml → configs/offline/bc/walker2d/random_v2.yaml b/configs/bc/walker2d/random_v2.yaml → configs/offline/bc/walker2d/random_v2.yaml
diff --git a/configs/bc_10/antmaze/large_play_v0.yaml → .../offline/bc_10/antmaze/large_play_v0.yaml b/configs/bc_10/antmaze/large_play_v0.yaml → .../offline/bc_10/antmaze/large_play_v0.yaml
diff --git a/configs/bc_10/antmaze/medium_play_v0.yaml → ...offline/bc_10/antmaze/medium_play_v0.yaml b/configs/bc_10/antmaze/medium_play_v0.yaml → ...offline/bc_10/antmaze/medium_play_v0.yaml
diff --git a/configs/bc_10/antmaze/umaze_v0.yaml → configs/offline/bc_10/antmaze/umaze_v0.yaml b/configs/bc_10/antmaze/umaze_v0.yaml → configs/offline/bc_10/antmaze/umaze_v0.yaml
diff --git a/configs/bc_10/halfcheetah/expert_v2.yaml → .../offline/bc_10/halfcheetah/expert_v2.yaml b/configs/bc_10/halfcheetah/expert_v2.yaml → .../offline/bc_10/halfcheetah/expert_v2.yaml
diff --git a/...igs/bc_10/halfcheetah/full_replay_v2.yaml → ...ine/bc_10/halfcheetah/full_replay_v2.yaml b/...igs/bc_10/halfcheetah/full_replay_v2.yaml → ...ine/bc_10/halfcheetah/full_replay_v2.yaml
diff --git a/...s/bc_10/halfcheetah/medium_expert_v2.yaml → ...e/bc_10/halfcheetah/medium_expert_v2.yaml b/...s/bc_10/halfcheetah/medium_expert_v2.yaml → ...e/bc_10/halfcheetah/medium_expert_v2.yaml
diff --git a/...s/bc_10/halfcheetah/medium_replay_v2.yaml → ...e/bc_10/halfcheetah/medium_replay_v2.yaml b/...s/bc_10/halfcheetah/medium_replay_v2.yaml → ...e/bc_10/halfcheetah/medium_replay_v2.yaml
diff --git a/configs/bc_10/halfcheetah/medium_v2.yaml → .../offline/bc_10/halfcheetah/medium_v2.yaml b/configs/bc_10/halfcheetah/medium_v2.yaml → .../offline/bc_10/halfcheetah/medium_v2.yaml
diff --git a/configs/bc_10/halfcheetah/random_v2.yaml → .../offline/bc_10/halfcheetah/random_v2.yaml b/configs/bc_10/halfcheetah/random_v2.yaml → .../offline/bc_10/halfcheetah/random_v2.yaml
diff --git a/configs/bc_10/hopper/expert_v2.yaml → configs/offline/bc_10/hopper/expert_v2.yaml b/configs/bc_10/hopper/expert_v2.yaml → configs/offline/bc_10/hopper/expert_v2.yaml
diff --git a/configs/bc_10/hopper/full_replay_v2.yaml → .../offline/bc_10/hopper/full_replay_v2.yaml b/configs/bc_10/hopper/full_replay_v2.yaml → .../offline/bc_10/hopper/full_replay_v2.yaml
diff --git a/configs/bc_10/hopper/medium_expert_v2.yaml → ...ffline/bc_10/hopper/medium_expert_v2.yaml b/configs/bc_10/hopper/medium_expert_v2.yaml → ...ffline/bc_10/hopper/medium_expert_v2.yaml
diff --git a/configs/bc_10/hopper/medium_replay_v2.yaml → ...ffline/bc_10/hopper/medium_replay_v2.yaml b/configs/bc_10/hopper/medium_replay_v2.yaml → ...ffline/bc_10/hopper/medium_replay_v2.yaml
diff --git a/configs/bc_10/hopper/medium_v2.yaml → configs/offline/bc_10/hopper/medium_v2.yaml b/configs/bc_10/hopper/medium_v2.yaml → configs/offline/bc_10/hopper/medium_v2.yaml
diff --git a/configs/bc_10/hopper/random_v2.yaml → configs/offline/bc_10/hopper/random_v2.yaml b/configs/bc_10/hopper/random_v2.yaml → configs/offline/bc_10/hopper/random_v2.yaml
diff --git a/configs/bc_10/maze2d/large_dense_v1.yaml → .../offline/bc_10/maze2d/large_dense_v1.yaml b/configs/bc_10/maze2d/large_dense_v1.yaml → .../offline/bc_10/maze2d/large_dense_v1.yaml
diff --git a/configs/bc_10/maze2d/large_v1.yaml → configs/offline/bc_10/maze2d/large_v1.yaml b/configs/bc_10/maze2d/large_v1.yaml → configs/offline/bc_10/maze2d/large_v1.yaml
diff --git a/configs/bc_10/maze2d/medium_dense_v1.yaml → ...offline/bc_10/maze2d/medium_dense_v1.yaml b/configs/bc_10/maze2d/medium_dense_v1.yaml → ...offline/bc_10/maze2d/medium_dense_v1.yaml
diff --git a/configs/bc_10/maze2d/medium_v1.yaml → configs/offline/bc_10/maze2d/medium_v1.yaml b/configs/bc_10/maze2d/medium_v1.yaml → configs/offline/bc_10/maze2d/medium_v1.yaml
diff --git a/configs/bc_10/maze2d/umaze_dense_v1.yaml → .../offline/bc_10/maze2d/umaze_dense_v1.yaml b/configs/bc_10/maze2d/umaze_dense_v1.yaml → .../offline/bc_10/maze2d/umaze_dense_v1.yaml
diff --git a/configs/bc_10/maze2d/umaze_v1.yaml → configs/offline/bc_10/maze2d/umaze_v1.yaml b/configs/bc_10/maze2d/umaze_v1.yaml → configs/offline/bc_10/maze2d/umaze_v1.yaml
diff --git a/configs/bc_10/walker2d/expert_v2.yaml → ...igs/offline/bc_10/walker2d/expert_v2.yaml b/configs/bc_10/walker2d/expert_v2.yaml → ...igs/offline/bc_10/walker2d/expert_v2.yaml
diff --git a/configs/bc_10/walker2d/full_replay_v2.yaml → ...ffline/bc_10/walker2d/full_replay_v2.yaml b/configs/bc_10/walker2d/full_replay_v2.yaml → ...ffline/bc_10/walker2d/full_replay_v2.yaml
diff --git a/configs/bc_10/walker2d/medium_expert_v2.yaml → ...line/bc_10/walker2d/medium_expert_v2.yaml b/configs/bc_10/walker2d/medium_expert_v2.yaml → ...line/bc_10/walker2d/medium_expert_v2.yaml
diff --git a/configs/bc_10/walker2d/medium_replay_v2.yaml → ...line/bc_10/walker2d/medium_replay_v2.yaml b/configs/bc_10/walker2d/medium_replay_v2.yaml → ...line/bc_10/walker2d/medium_replay_v2.yaml
diff --git a/configs/bc_10/walker2d/medium_v2.yaml → ...igs/offline/bc_10/walker2d/medium_v2.yaml b/configs/bc_10/walker2d/medium_v2.yaml → ...igs/offline/bc_10/walker2d/medium_v2.yaml
diff --git a/configs/bc_10/walker2d/random_v2.yaml → ...igs/offline/bc_10/walker2d/random_v2.yaml b/configs/bc_10/walker2d/random_v2.yaml → ...igs/offline/bc_10/walker2d/random_v2.yaml
diff --git a/configs/cql/antmaze/large_play_v0.yaml → ...gs/offline/cql/antmaze/large_play_v0.yaml b/configs/cql/antmaze/large_play_v0.yaml → ...gs/offline/cql/antmaze/large_play_v0.yaml
diff --git a/configs/cql/antmaze/medium_play_v0.yaml → ...s/offline/cql/antmaze/medium_play_v0.yaml b/configs/cql/antmaze/medium_play_v0.yaml → ...s/offline/cql/antmaze/medium_play_v0.yaml
diff --git a/configs/cql/antmaze/umaze_v0.yaml → configs/offline/cql/antmaze/umaze_v0.yaml b/configs/cql/antmaze/umaze_v0.yaml → configs/offline/cql/antmaze/umaze_v0.yaml
diff --git a/configs/cql/halfcheetah/expert_v2.yaml → ...gs/offline/cql/halfcheetah/expert_v2.yaml b/configs/cql/halfcheetah/expert_v2.yaml → ...gs/offline/cql/halfcheetah/expert_v2.yaml
diff --git a/configs/cql/halfcheetah/full_replay_v2.yaml → ...fline/cql/halfcheetah/full_replay_v2.yaml b/configs/cql/halfcheetah/full_replay_v2.yaml → ...fline/cql/halfcheetah/full_replay_v2.yaml
diff --git a/...igs/cql/halfcheetah/medium_expert_v2.yaml → ...ine/cql/halfcheetah/medium_expert_v2.yaml b/...igs/cql/halfcheetah/medium_expert_v2.yaml → ...ine/cql/halfcheetah/medium_expert_v2.yaml
diff --git a/...igs/cql/halfcheetah/medium_replay_v2.yaml → ...ine/cql/halfcheetah/medium_replay_v2.yaml b/...igs/cql/halfcheetah/medium_replay_v2.yaml → ...ine/cql/halfcheetah/medium_replay_v2.yaml
diff --git a/configs/cql/halfcheetah/medium_v2.yaml → ...gs/offline/cql/halfcheetah/medium_v2.yaml b/configs/cql/halfcheetah/medium_v2.yaml → ...gs/offline/cql/halfcheetah/medium_v2.yaml
diff --git a/configs/cql/halfcheetah/random_v2.yaml → ...gs/offline/cql/halfcheetah/random_v2.yaml b/configs/cql/halfcheetah/random_v2.yaml → ...gs/offline/cql/halfcheetah/random_v2.yaml
diff --git a/configs/cql/hopper/expert_v2.yaml → configs/offline/cql/hopper/expert_v2.yaml b/configs/cql/hopper/expert_v2.yaml → configs/offline/cql/hopper/expert_v2.yaml
diff --git a/configs/cql/hopper/full_replay_v2.yaml → ...gs/offline/cql/hopper/full_replay_v2.yaml b/configs/cql/hopper/full_replay_v2.yaml → ...gs/offline/cql/hopper/full_replay_v2.yaml
diff --git a/configs/cql/hopper/medium_expert_v2.yaml → .../offline/cql/hopper/medium_expert_v2.yaml b/configs/cql/hopper/medium_expert_v2.yaml → .../offline/cql/hopper/medium_expert_v2.yaml
diff --git a/configs/cql/hopper/medium_replay_v2.yaml → .../offline/cql/hopper/medium_replay_v2.yaml b/configs/cql/hopper/medium_replay_v2.yaml → .../offline/cql/hopper/medium_replay_v2.yaml
diff --git a/configs/cql/hopper/medium_v2.yaml → configs/offline/cql/hopper/medium_v2.yaml b/configs/cql/hopper/medium_v2.yaml → configs/offline/cql/hopper/medium_v2.yaml
diff --git a/configs/cql/hopper/random_v2.yaml → configs/offline/cql/hopper/random_v2.yaml b/configs/cql/hopper/random_v2.yaml → configs/offline/cql/hopper/random_v2.yaml
diff --git a/configs/cql/maze2d/large_dense_v1.yaml → ...gs/offline/cql/maze2d/large_dense_v1.yaml b/configs/cql/maze2d/large_dense_v1.yaml → ...gs/offline/cql/maze2d/large_dense_v1.yaml
diff --git a/configs/cql/maze2d/large_v1.yaml → configs/offline/cql/maze2d/large_v1.yaml b/configs/cql/maze2d/large_v1.yaml → configs/offline/cql/maze2d/large_v1.yaml
diff --git a/configs/cql/maze2d/medium_dense_v1.yaml → ...s/offline/cql/maze2d/medium_dense_v1.yaml b/configs/cql/maze2d/medium_dense_v1.yaml → ...s/offline/cql/maze2d/medium_dense_v1.yaml
diff --git a/configs/cql/maze2d/medium_v1.yaml → configs/offline/cql/maze2d/medium_v1.yaml b/configs/cql/maze2d/medium_v1.yaml → configs/offline/cql/maze2d/medium_v1.yaml
diff --git a/configs/cql/maze2d/umaze_dense_v1.yaml → ...gs/offline/cql/maze2d/umaze_dense_v1.yaml b/configs/cql/maze2d/umaze_dense_v1.yaml → ...gs/offline/cql/maze2d/umaze_dense_v1.yaml
diff --git a/configs/cql/maze2d/umaze_v1.yaml → configs/offline/cql/maze2d/umaze_v1.yaml b/configs/cql/maze2d/umaze_v1.yaml → configs/offline/cql/maze2d/umaze_v1.yaml
diff --git a/configs/cql/walker2d/expert_v2.yaml → configs/offline/cql/walker2d/expert_v2.yaml b/configs/cql/walker2d/expert_v2.yaml → configs/offline/cql/walker2d/expert_v2.yaml
diff --git a/configs/cql/walker2d/full_replay_v2.yaml → .../offline/cql/walker2d/full_replay_v2.yaml b/configs/cql/walker2d/full_replay_v2.yaml → .../offline/cql/walker2d/full_replay_v2.yaml
diff --git a/configs/cql/walker2d/medium_expert_v2.yaml → ...ffline/cql/walker2d/medium_expert_v2.yaml b/configs/cql/walker2d/medium_expert_v2.yaml → ...ffline/cql/walker2d/medium_expert_v2.yaml
diff --git a/configs/cql/walker2d/medium_replay_v2.yaml → ...ffline/cql/walker2d/medium_replay_v2.yaml b/configs/cql/walker2d/medium_replay_v2.yaml → ...ffline/cql/walker2d/medium_replay_v2.yaml
diff --git a/configs/cql/walker2d/medium_v2.yaml → configs/offline/cql/walker2d/medium_v2.yaml b/configs/cql/walker2d/medium_v2.yaml → configs/offline/cql/walker2d/medium_v2.yaml
diff --git a/configs/cql/walker2d/random_v2.yaml → configs/offline/cql/walker2d/random_v2.yaml b/configs/cql/walker2d/random_v2.yaml → configs/offline/cql/walker2d/random_v2.yaml
diff --git a/configs/dt/antmaze/large_play_v0.yaml → ...igs/offline/dt/antmaze/large_play_v0.yaml b/configs/dt/antmaze/large_play_v0.yaml → ...igs/offline/dt/antmaze/large_play_v0.yaml
diff --git a/configs/dt/antmaze/medium_play_v0.yaml → ...gs/offline/dt/antmaze/medium_play_v0.yaml b/configs/dt/antmaze/medium_play_v0.yaml → ...gs/offline/dt/antmaze/medium_play_v0.yaml
diff --git a/configs/dt/antmaze/umaze_v0.yaml → configs/offline/dt/antmaze/umaze_v0.yaml b/configs/dt/antmaze/umaze_v0.yaml → configs/offline/dt/antmaze/umaze_v0.yaml
diff --git a/configs/dt/halfcheetah/medium_expert_v2.yaml → ...line/dt/halfcheetah/medium_expert_v2.yaml b/configs/dt/halfcheetah/medium_expert_v2.yaml → ...line/dt/halfcheetah/medium_expert_v2.yaml
diff --git a/configs/dt/halfcheetah/medium_replay_v2.yaml → ...line/dt/halfcheetah/medium_replay_v2.yaml b/configs/dt/halfcheetah/medium_replay_v2.yaml → ...line/dt/halfcheetah/medium_replay_v2.yaml
diff --git a/configs/dt/halfcheetah/medium_v2.yaml → ...igs/offline/dt/halfcheetah/medium_v2.yaml b/configs/dt/halfcheetah/medium_v2.yaml → ...igs/offline/dt/halfcheetah/medium_v2.yaml
diff --git a/configs/dt/hopper/medium_expert_v2.yaml → ...s/offline/dt/hopper/medium_expert_v2.yaml b/configs/dt/hopper/medium_expert_v2.yaml → ...s/offline/dt/hopper/medium_expert_v2.yaml
diff --git a/configs/dt/hopper/medium_replay_v2.yaml → ...s/offline/dt/hopper/medium_replay_v2.yaml b/configs/dt/hopper/medium_replay_v2.yaml → ...s/offline/dt/hopper/medium_replay_v2.yaml
diff --git a/configs/dt/hopper/medium_v2.yaml → configs/offline/dt/hopper/medium_v2.yaml b/configs/dt/hopper/medium_v2.yaml → configs/offline/dt/hopper/medium_v2.yaml
diff --git a/configs/dt/maze2d/large_v1.yaml → configs/offline/dt/maze2d/large_v1.yaml b/configs/dt/maze2d/large_v1.yaml → configs/offline/dt/maze2d/large_v1.yaml
diff --git a/configs/dt/maze2d/medium_v1.yaml → configs/offline/dt/maze2d/medium_v1.yaml b/configs/dt/maze2d/medium_v1.yaml → configs/offline/dt/maze2d/medium_v1.yaml
diff --git a/configs/dt/maze2d/umaze_v1.yaml → configs/offline/dt/maze2d/umaze_v1.yaml b/configs/dt/maze2d/umaze_v1.yaml → configs/offline/dt/maze2d/umaze_v1.yaml
diff --git a/configs/dt/walker2d/medium_expert_v2.yaml → ...offline/dt/walker2d/medium_expert_v2.yaml b/configs/dt/walker2d/medium_expert_v2.yaml → ...offline/dt/walker2d/medium_expert_v2.yaml
diff --git a/configs/dt/walker2d/medium_replay_v2.yaml → ...offline/dt/walker2d/medium_replay_v2.yaml b/configs/dt/walker2d/medium_replay_v2.yaml → ...offline/dt/walker2d/medium_replay_v2.yaml
diff --git a/configs/dt/walker2d/medium_v2.yaml → configs/offline/dt/walker2d/medium_v2.yaml b/configs/dt/walker2d/medium_v2.yaml → configs/offline/dt/walker2d/medium_v2.yaml
diff --git a/configs/edac/antmaze/large_play_v0.yaml → ...s/offline/edac/antmaze/large_play_v0.yaml b/configs/edac/antmaze/large_play_v0.yaml → ...s/offline/edac/antmaze/large_play_v0.yaml
diff --git a/configs/edac/antmaze/medium_play_v0.yaml → .../offline/edac/antmaze/medium_play_v0.yaml b/configs/edac/antmaze/medium_play_v0.yaml → .../offline/edac/antmaze/medium_play_v0.yaml
diff --git a/configs/edac/antmaze/umaze_v0.yaml → configs/offline/edac/antmaze/umaze_v0.yaml b/configs/edac/antmaze/umaze_v0.yaml → configs/offline/edac/antmaze/umaze_v0.yaml
diff --git a/...gs/edac/halfcheetah/medium_expert_v2.yaml → ...ne/edac/halfcheetah/medium_expert_v2.yaml b/...gs/edac/halfcheetah/medium_expert_v2.yaml → ...ne/edac/halfcheetah/medium_expert_v2.yaml
diff --git a/...gs/edac/halfcheetah/medium_replay_v2.yaml → ...ne/edac/halfcheetah/medium_replay_v2.yaml b/...gs/edac/halfcheetah/medium_replay_v2.yaml → ...ne/edac/halfcheetah/medium_replay_v2.yaml
diff --git a/configs/edac/halfcheetah/medium_v2.yaml → ...s/offline/edac/halfcheetah/medium_v2.yaml b/configs/edac/halfcheetah/medium_v2.yaml → ...s/offline/edac/halfcheetah/medium_v2.yaml
diff --git a/configs/edac/hopper/medium_expert_v2.yaml → ...offline/edac/hopper/medium_expert_v2.yaml b/configs/edac/hopper/medium_expert_v2.yaml → ...offline/edac/hopper/medium_expert_v2.yaml
diff --git a/configs/edac/hopper/medium_replay_v2.yaml → ...offline/edac/hopper/medium_replay_v2.yaml b/configs/edac/hopper/medium_replay_v2.yaml → ...offline/edac/hopper/medium_replay_v2.yaml
diff --git a/configs/edac/hopper/medium_v2.yaml → configs/offline/edac/hopper/medium_v2.yaml b/configs/edac/hopper/medium_v2.yaml → configs/offline/edac/hopper/medium_v2.yaml
diff --git a/configs/edac/maze2d/large_v1.yaml → configs/offline/edac/maze2d/large_v1.yaml b/configs/edac/maze2d/large_v1.yaml → configs/offline/edac/maze2d/large_v1.yaml
diff --git a/configs/edac/maze2d/medium_v1.yaml → configs/offline/edac/maze2d/medium_v1.yaml b/configs/edac/maze2d/medium_v1.yaml → configs/offline/edac/maze2d/medium_v1.yaml
diff --git a/configs/edac/maze2d/umaze_v1.yaml → configs/offline/edac/maze2d/umaze_v1.yaml b/configs/edac/maze2d/umaze_v1.yaml → configs/offline/edac/maze2d/umaze_v1.yaml
diff --git a/configs/edac/walker2d/medium_expert_v2.yaml → ...fline/edac/walker2d/medium_expert_v2.yaml b/configs/edac/walker2d/medium_expert_v2.yaml → ...fline/edac/walker2d/medium_expert_v2.yaml
diff --git a/configs/edac/walker2d/medium_replay_v2.yaml → ...fline/edac/walker2d/medium_replay_v2.yaml b/configs/edac/walker2d/medium_replay_v2.yaml → ...fline/edac/walker2d/medium_replay_v2.yaml
diff --git a/configs/edac/walker2d/medium_v2.yaml → configs/offline/edac/walker2d/medium_v2.yaml b/configs/edac/walker2d/medium_v2.yaml → configs/offline/edac/walker2d/medium_v2.yaml
diff --git a/configs/iql/antmaze/large_play_v0.yaml → ...gs/offline/iql/antmaze/large_play_v0.yaml b/configs/iql/antmaze/large_play_v0.yaml → ...gs/offline/iql/antmaze/large_play_v0.yaml
diff --git a/configs/iql/antmaze/medium_play_v0.yaml → ...s/offline/iql/antmaze/medium_play_v0.yaml b/configs/iql/antmaze/medium_play_v0.yaml → ...s/offline/iql/antmaze/medium_play_v0.yaml
diff --git a/configs/iql/antmaze/umaze_v0.yaml → configs/offline/iql/antmaze/umaze_v0.yaml b/configs/iql/antmaze/umaze_v0.yaml → configs/offline/iql/antmaze/umaze_v0.yaml
diff --git a/configs/iql/halfcheetah/expert_v2.yaml → ...gs/offline/iql/halfcheetah/expert_v2.yaml b/configs/iql/halfcheetah/expert_v2.yaml → ...gs/offline/iql/halfcheetah/expert_v2.yaml
diff --git a/configs/iql/halfcheetah/full_replay_v2.yaml → ...fline/iql/halfcheetah/full_replay_v2.yaml b/configs/iql/halfcheetah/full_replay_v2.yaml → ...fline/iql/halfcheetah/full_replay_v2.yaml
diff --git a/...igs/iql/halfcheetah/medium_expert_v2.yaml → ...ine/iql/halfcheetah/medium_expert_v2.yaml b/...igs/iql/halfcheetah/medium_expert_v2.yaml → ...ine/iql/halfcheetah/medium_expert_v2.yaml
diff --git a/...igs/iql/halfcheetah/medium_replay_v2.yaml → ...ine/iql/halfcheetah/medium_replay_v2.yaml b/...igs/iql/halfcheetah/medium_replay_v2.yaml → ...ine/iql/halfcheetah/medium_replay_v2.yaml
diff --git a/configs/iql/halfcheetah/medium_v2.yaml → ...gs/offline/iql/halfcheetah/medium_v2.yaml b/configs/iql/halfcheetah/medium_v2.yaml → ...gs/offline/iql/halfcheetah/medium_v2.yaml
diff --git a/configs/iql/halfcheetah/random_v2.yaml → ...gs/offline/iql/halfcheetah/random_v2.yaml b/configs/iql/halfcheetah/random_v2.yaml → ...gs/offline/iql/halfcheetah/random_v2.yaml
diff --git a/configs/iql/hopper/expert_v2.yaml → configs/offline/iql/hopper/expert_v2.yaml b/configs/iql/hopper/expert_v2.yaml → configs/offline/iql/hopper/expert_v2.yaml
diff --git a/configs/iql/hopper/full_replay_v2.yaml → ...gs/offline/iql/hopper/full_replay_v2.yaml b/configs/iql/hopper/full_replay_v2.yaml → ...gs/offline/iql/hopper/full_replay_v2.yaml
diff --git a/configs/iql/hopper/medium_expert_v2.yaml → .../offline/iql/hopper/medium_expert_v2.yaml b/configs/iql/hopper/medium_expert_v2.yaml → .../offline/iql/hopper/medium_expert_v2.yaml
diff --git a/configs/iql/hopper/medium_replay_v2.yaml → .../offline/iql/hopper/medium_replay_v2.yaml b/configs/iql/hopper/medium_replay_v2.yaml → .../offline/iql/hopper/medium_replay_v2.yaml
diff --git a/configs/iql/hopper/medium_v2.yaml → configs/offline/iql/hopper/medium_v2.yaml b/configs/iql/hopper/medium_v2.yaml → configs/offline/iql/hopper/medium_v2.yaml
diff --git a/configs/iql/hopper/random_v2.yaml → configs/offline/iql/hopper/random_v2.yaml b/configs/iql/hopper/random_v2.yaml → configs/offline/iql/hopper/random_v2.yaml
diff --git a/configs/iql/maze2d/large_dense_v1.yaml → ...gs/offline/iql/maze2d/large_dense_v1.yaml b/configs/iql/maze2d/large_dense_v1.yaml → ...gs/offline/iql/maze2d/large_dense_v1.yaml
diff --git a/configs/iql/maze2d/large_v1.yaml → configs/offline/iql/maze2d/large_v1.yaml b/configs/iql/maze2d/large_v1.yaml → configs/offline/iql/maze2d/large_v1.yaml
diff --git a/configs/iql/maze2d/medium_dense_v1.yaml → ...s/offline/iql/maze2d/medium_dense_v1.yaml b/configs/iql/maze2d/medium_dense_v1.yaml → ...s/offline/iql/maze2d/medium_dense_v1.yaml
diff --git a/configs/iql/maze2d/medium_v1.yaml → configs/offline/iql/maze2d/medium_v1.yaml b/configs/iql/maze2d/medium_v1.yaml → configs/offline/iql/maze2d/medium_v1.yaml
diff --git a/configs/iql/maze2d/umaze_dense_v1.yaml → ...gs/offline/iql/maze2d/umaze_dense_v1.yaml b/configs/iql/maze2d/umaze_dense_v1.yaml → ...gs/offline/iql/maze2d/umaze_dense_v1.yaml
diff --git a/configs/iql/maze2d/umaze_v1.yaml → configs/offline/iql/maze2d/umaze_v1.yaml b/configs/iql/maze2d/umaze_v1.yaml → configs/offline/iql/maze2d/umaze_v1.yaml
diff --git a/configs/iql/walker2d/expert_v2.yaml → configs/offline/iql/walker2d/expert_v2.yaml b/configs/iql/walker2d/expert_v2.yaml → configs/offline/iql/walker2d/expert_v2.yaml
diff --git a/configs/iql/walker2d/full_replay_v2.yaml → .../offline/iql/walker2d/full_replay_v2.yaml b/configs/iql/walker2d/full_replay_v2.yaml → .../offline/iql/walker2d/full_replay_v2.yaml
diff --git a/configs/iql/walker2d/medium_expert_v2.yaml → ...ffline/iql/walker2d/medium_expert_v2.yaml b/configs/iql/walker2d/medium_expert_v2.yaml → ...ffline/iql/walker2d/medium_expert_v2.yaml
diff --git a/configs/iql/walker2d/medium_replay_v2.yaml → ...ffline/iql/walker2d/medium_replay_v2.yaml b/configs/iql/walker2d/medium_replay_v2.yaml → ...ffline/iql/walker2d/medium_replay_v2.yaml
diff --git a/configs/iql/walker2d/medium_v2.yaml → configs/offline/iql/walker2d/medium_v2.yaml b/configs/iql/walker2d/medium_v2.yaml → configs/offline/iql/walker2d/medium_v2.yaml
diff --git a/configs/iql/walker2d/random_v2.yaml → configs/offline/iql/walker2d/random_v2.yaml b/configs/iql/walker2d/random_v2.yaml → configs/offline/iql/walker2d/random_v2.yaml
diff --git a/configs/lb-sac/halfcheetah/expert_v2.yaml → ...offline/lb-sac/halfcheetah/expert_v2.yaml b/configs/lb-sac/halfcheetah/expert_v2.yaml → ...offline/lb-sac/halfcheetah/expert_v2.yaml
diff --git a/...gs/lb-sac/halfcheetah/full_replay_v2.yaml → ...ne/lb-sac/halfcheetah/full_replay_v2.yaml b/...gs/lb-sac/halfcheetah/full_replay_v2.yaml → ...ne/lb-sac/halfcheetah/full_replay_v2.yaml
diff --git a/.../lb-sac/halfcheetah/medium_expert_v2.yaml → .../lb-sac/halfcheetah/medium_expert_v2.yaml b/.../lb-sac/halfcheetah/medium_expert_v2.yaml → .../lb-sac/halfcheetah/medium_expert_v2.yaml
diff --git a/.../lb-sac/halfcheetah/medium_replay_v2.yaml → .../lb-sac/halfcheetah/medium_replay_v2.yaml b/.../lb-sac/halfcheetah/medium_replay_v2.yaml → .../lb-sac/halfcheetah/medium_replay_v2.yaml
diff --git a/configs/lb-sac/halfcheetah/medium_v2.yaml → ...offline/lb-sac/halfcheetah/medium_v2.yaml b/configs/lb-sac/halfcheetah/medium_v2.yaml → ...offline/lb-sac/halfcheetah/medium_v2.yaml
diff --git a/configs/lb-sac/halfcheetah/random_v2.yaml → ...offline/lb-sac/halfcheetah/random_v2.yaml b/configs/lb-sac/halfcheetah/random_v2.yaml → ...offline/lb-sac/halfcheetah/random_v2.yaml
diff --git a/configs/lb-sac/hopper/expert_v2.yaml → configs/offline/lb-sac/hopper/expert_v2.yaml b/configs/lb-sac/hopper/expert_v2.yaml → configs/offline/lb-sac/hopper/expert_v2.yaml
diff --git a/configs/lb-sac/hopper/full_replay_v2.yaml → ...offline/lb-sac/hopper/full_replay_v2.yaml b/configs/lb-sac/hopper/full_replay_v2.yaml → ...offline/lb-sac/hopper/full_replay_v2.yaml
diff --git a/configs/lb-sac/hopper/medium_expert_v2.yaml → ...fline/lb-sac/hopper/medium_expert_v2.yaml b/configs/lb-sac/hopper/medium_expert_v2.yaml → ...fline/lb-sac/hopper/medium_expert_v2.yaml
diff --git a/configs/lb-sac/hopper/medium_replay_v2.yaml → ...fline/lb-sac/hopper/medium_replay_v2.yaml b/configs/lb-sac/hopper/medium_replay_v2.yaml → ...fline/lb-sac/hopper/medium_replay_v2.yaml
diff --git a/configs/lb-sac/hopper/medium_v2.yaml → configs/offline/lb-sac/hopper/medium_v2.yaml b/configs/lb-sac/hopper/medium_v2.yaml → configs/offline/lb-sac/hopper/medium_v2.yaml
diff --git a/configs/lb-sac/hopper/random_v2.yaml → configs/offline/lb-sac/hopper/random_v2.yaml b/configs/lb-sac/hopper/random_v2.yaml → configs/offline/lb-sac/hopper/random_v2.yaml
diff --git a/configs/lb-sac/walker2d/expert_v2.yaml → ...gs/offline/lb-sac/walker2d/expert_v2.yaml b/configs/lb-sac/walker2d/expert_v2.yaml → ...gs/offline/lb-sac/walker2d/expert_v2.yaml
diff --git a/configs/lb-sac/walker2d/full_replay_v2.yaml → ...fline/lb-sac/walker2d/full_replay_v2.yaml b/configs/lb-sac/walker2d/full_replay_v2.yaml → ...fline/lb-sac/walker2d/full_replay_v2.yaml
diff --git a/...igs/lb-sac/walker2d/medium_expert_v2.yaml → ...ine/lb-sac/walker2d/medium_expert_v2.yaml b/...igs/lb-sac/walker2d/medium_expert_v2.yaml → ...ine/lb-sac/walker2d/medium_expert_v2.yaml
diff --git a/...igs/lb-sac/walker2d/medium_replay_v2.yaml → ...ine/lb-sac/walker2d/medium_replay_v2.yaml b/...igs/lb-sac/walker2d/medium_replay_v2.yaml → ...ine/lb-sac/walker2d/medium_replay_v2.yaml
diff --git a/configs/lb-sac/walker2d/medium_v2.yaml → ...gs/offline/lb-sac/walker2d/medium_v2.yaml b/configs/lb-sac/walker2d/medium_v2.yaml → ...gs/offline/lb-sac/walker2d/medium_v2.yaml
diff --git a/configs/lb-sac/walker2d/random_v2.yaml → ...gs/offline/lb-sac/walker2d/random_v2.yaml b/configs/lb-sac/walker2d/random_v2.yaml → ...gs/offline/lb-sac/walker2d/random_v2.yaml
diff --git a/configs/sac_n/antmaze/large_play_v0.yaml → .../offline/sac_n/antmaze/large_play_v0.yaml b/configs/sac_n/antmaze/large_play_v0.yaml → .../offline/sac_n/antmaze/large_play_v0.yaml
diff --git a/configs/sac_n/antmaze/medium_play_v0.yaml → ...offline/sac_n/antmaze/medium_play_v0.yaml b/configs/sac_n/antmaze/medium_play_v0.yaml → ...offline/sac_n/antmaze/medium_play_v0.yaml
diff --git a/configs/sac_n/antmaze/umaze_v0.yaml → configs/offline/sac_n/antmaze/umaze_v0.yaml b/configs/sac_n/antmaze/umaze_v0.yaml → configs/offline/sac_n/antmaze/umaze_v0.yaml
diff --git a/...s/sac_n/halfcheetah/medium_expert_v2.yaml → ...e/sac_n/halfcheetah/medium_expert_v2.yaml b/...s/sac_n/halfcheetah/medium_expert_v2.yaml → ...e/sac_n/halfcheetah/medium_expert_v2.yaml
diff --git a/...s/sac_n/halfcheetah/medium_replay_v2.yaml → ...e/sac_n/halfcheetah/medium_replay_v2.yaml b/...s/sac_n/halfcheetah/medium_replay_v2.yaml → ...e/sac_n/halfcheetah/medium_replay_v2.yaml
diff --git a/configs/sac_n/halfcheetah/medium_v2.yaml → .../offline/sac_n/halfcheetah/medium_v2.yaml b/configs/sac_n/halfcheetah/medium_v2.yaml → .../offline/sac_n/halfcheetah/medium_v2.yaml
diff --git a/configs/sac_n/hopper/medium_expert_v2.yaml → ...ffline/sac_n/hopper/medium_expert_v2.yaml b/configs/sac_n/hopper/medium_expert_v2.yaml → ...ffline/sac_n/hopper/medium_expert_v2.yaml
diff --git a/configs/sac_n/hopper/medium_replay_v2.yaml → ...ffline/sac_n/hopper/medium_replay_v2.yaml b/configs/sac_n/hopper/medium_replay_v2.yaml → ...ffline/sac_n/hopper/medium_replay_v2.yaml
diff --git a/configs/sac_n/hopper/medium_v2.yaml → configs/offline/sac_n/hopper/medium_v2.yaml b/configs/sac_n/hopper/medium_v2.yaml → configs/offline/sac_n/hopper/medium_v2.yaml
diff --git a/configs/sac_n/maze2d/large_v1.yaml → configs/offline/sac_n/maze2d/large_v1.yaml b/configs/sac_n/maze2d/large_v1.yaml → configs/offline/sac_n/maze2d/large_v1.yaml
diff --git a/configs/sac_n/maze2d/medium_v1.yaml → configs/offline/sac_n/maze2d/medium_v1.yaml b/configs/sac_n/maze2d/medium_v1.yaml → configs/offline/sac_n/maze2d/medium_v1.yaml
diff --git a/configs/sac_n/maze2d/umaze_v1.yaml → configs/offline/sac_n/maze2d/umaze_v1.yaml b/configs/sac_n/maze2d/umaze_v1.yaml → configs/offline/sac_n/maze2d/umaze_v1.yaml
diff --git a/configs/sac_n/walker2d/medium_expert_v2.yaml → ...line/sac_n/walker2d/medium_expert_v2.yaml b/configs/sac_n/walker2d/medium_expert_v2.yaml → ...line/sac_n/walker2d/medium_expert_v2.yaml
diff --git a/configs/sac_n/walker2d/medium_replay_v2.yaml → ...line/sac_n/walker2d/medium_replay_v2.yaml b/configs/sac_n/walker2d/medium_replay_v2.yaml → ...line/sac_n/walker2d/medium_replay_v2.yaml
diff --git a/configs/sac_n/walker2d/medium_v2.yaml → ...igs/offline/sac_n/walker2d/medium_v2.yaml b/configs/sac_n/walker2d/medium_v2.yaml → ...igs/offline/sac_n/walker2d/medium_v2.yaml
diff --git a/configs/td3_bc/antmaze/large_play_v0.yaml → ...offline/td3_bc/antmaze/large_play_v0.yaml b/configs/td3_bc/antmaze/large_play_v0.yaml → ...offline/td3_bc/antmaze/large_play_v0.yaml
diff --git a/configs/td3_bc/antmaze/medium_play_v0.yaml → ...ffline/td3_bc/antmaze/medium_play_v0.yaml b/configs/td3_bc/antmaze/medium_play_v0.yaml → ...ffline/td3_bc/antmaze/medium_play_v0.yaml
diff --git a/configs/td3_bc/antmaze/umaze_v0.yaml → configs/offline/td3_bc/antmaze/umaze_v0.yaml b/configs/td3_bc/antmaze/umaze_v0.yaml → configs/offline/td3_bc/antmaze/umaze_v0.yaml
diff --git a/configs/td3_bc/halfcheetah/expert_v2.yaml → ...offline/td3_bc/halfcheetah/expert_v2.yaml b/configs/td3_bc/halfcheetah/expert_v2.yaml → ...offline/td3_bc/halfcheetah/expert_v2.yaml
diff --git a/...gs/td3_bc/halfcheetah/full_replay_v2.yaml → ...ne/td3_bc/halfcheetah/full_replay_v2.yaml b/...gs/td3_bc/halfcheetah/full_replay_v2.yaml → ...ne/td3_bc/halfcheetah/full_replay_v2.yaml
diff --git a/.../td3_bc/halfcheetah/medium_expert_v2.yaml → .../td3_bc/halfcheetah/medium_expert_v2.yaml b/.../td3_bc/halfcheetah/medium_expert_v2.yaml → .../td3_bc/halfcheetah/medium_expert_v2.yaml
diff --git a/.../td3_bc/halfcheetah/medium_replay_v2.yaml → .../td3_bc/halfcheetah/medium_replay_v2.yaml b/.../td3_bc/halfcheetah/medium_replay_v2.yaml → .../td3_bc/halfcheetah/medium_replay_v2.yaml
diff --git a/configs/td3_bc/halfcheetah/medium_v2.yaml → ...offline/td3_bc/halfcheetah/medium_v2.yaml b/configs/td3_bc/halfcheetah/medium_v2.yaml → ...offline/td3_bc/halfcheetah/medium_v2.yaml
diff --git a/configs/td3_bc/halfcheetah/random_v2.yaml → ...offline/td3_bc/halfcheetah/random_v2.yaml b/configs/td3_bc/halfcheetah/random_v2.yaml → ...offline/td3_bc/halfcheetah/random_v2.yaml
diff --git a/configs/td3_bc/hopper/expert_v2.yaml → configs/offline/td3_bc/hopper/expert_v2.yaml b/configs/td3_bc/hopper/expert_v2.yaml → configs/offline/td3_bc/hopper/expert_v2.yaml
diff --git a/configs/td3_bc/hopper/full_replay_v2.yaml → ...offline/td3_bc/hopper/full_replay_v2.yaml b/configs/td3_bc/hopper/full_replay_v2.yaml → ...offline/td3_bc/hopper/full_replay_v2.yaml
diff --git a/configs/td3_bc/hopper/medium_expert_v2.yaml → ...fline/td3_bc/hopper/medium_expert_v2.yaml b/configs/td3_bc/hopper/medium_expert_v2.yaml → ...fline/td3_bc/hopper/medium_expert_v2.yaml
diff --git a/configs/td3_bc/hopper/medium_replay_v2.yaml → ...fline/td3_bc/hopper/medium_replay_v2.yaml b/configs/td3_bc/hopper/medium_replay_v2.yaml → ...fline/td3_bc/hopper/medium_replay_v2.yaml
diff --git a/configs/td3_bc/hopper/medium_v2.yaml → configs/offline/td3_bc/hopper/medium_v2.yaml b/configs/td3_bc/hopper/medium_v2.yaml → configs/offline/td3_bc/hopper/medium_v2.yaml
diff --git a/configs/td3_bc/hopper/random_v2.yaml → configs/offline/td3_bc/hopper/random_v2.yaml b/configs/td3_bc/hopper/random_v2.yaml → configs/offline/td3_bc/hopper/random_v2.yaml
diff --git a/configs/td3_bc/maze2d/large_dense_v1.yaml → ...offline/td3_bc/maze2d/large_dense_v1.yaml b/configs/td3_bc/maze2d/large_dense_v1.yaml → ...offline/td3_bc/maze2d/large_dense_v1.yaml
diff --git a/configs/td3_bc/maze2d/large_v1.yaml → configs/offline/td3_bc/maze2d/large_v1.yaml b/configs/td3_bc/maze2d/large_v1.yaml → configs/offline/td3_bc/maze2d/large_v1.yaml
diff --git a/configs/td3_bc/maze2d/medium_dense_v1.yaml → ...ffline/td3_bc/maze2d/medium_dense_v1.yaml b/configs/td3_bc/maze2d/medium_dense_v1.yaml → ...ffline/td3_bc/maze2d/medium_dense_v1.yaml
diff --git a/configs/td3_bc/maze2d/medium_v1.yaml → configs/offline/td3_bc/maze2d/medium_v1.yaml b/configs/td3_bc/maze2d/medium_v1.yaml → configs/offline/td3_bc/maze2d/medium_v1.yaml
diff --git a/configs/td3_bc/maze2d/umaze_dense_v1.yaml → ...offline/td3_bc/maze2d/umaze_dense_v1.yaml b/configs/td3_bc/maze2d/umaze_dense_v1.yaml → ...offline/td3_bc/maze2d/umaze_dense_v1.yaml
diff --git a/configs/td3_bc/maze2d/umaze_v1.yaml → configs/offline/td3_bc/maze2d/umaze_v1.yaml b/configs/td3_bc/maze2d/umaze_v1.yaml → configs/offline/td3_bc/maze2d/umaze_v1.yaml
diff --git a/configs/td3_bc/walker2d/expert_v2.yaml → ...gs/offline/td3_bc/walker2d/expert_v2.yaml b/configs/td3_bc/walker2d/expert_v2.yaml → ...gs/offline/td3_bc/walker2d/expert_v2.yaml
diff --git a/configs/td3_bc/walker2d/full_replay_v2.yaml → ...fline/td3_bc/walker2d/full_replay_v2.yaml b/configs/td3_bc/walker2d/full_replay_v2.yaml → ...fline/td3_bc/walker2d/full_replay_v2.yaml
diff --git a/...igs/td3_bc/walker2d/medium_expert_v2.yaml → ...ine/td3_bc/walker2d/medium_expert_v2.yaml b/...igs/td3_bc/walker2d/medium_expert_v2.yaml → ...ine/td3_bc/walker2d/medium_expert_v2.yaml
diff --git a/...igs/td3_bc/walker2d/medium_replay_v2.yaml → ...ine/td3_bc/walker2d/medium_replay_v2.yaml b/...igs/td3_bc/walker2d/medium_replay_v2.yaml → ...ine/td3_bc/walker2d/medium_replay_v2.yaml
diff --git a/configs/td3_bc/walker2d/medium_v2.yaml → ...gs/offline/td3_bc/walker2d/medium_v2.yaml b/configs/td3_bc/walker2d/medium_v2.yaml → ...gs/offline/td3_bc/walker2d/medium_v2.yaml
diff --git a/configs/td3_bc/walker2d/random_v2.yaml → ...gs/offline/td3_bc/walker2d/random_v2.yaml b/configs/td3_bc/walker2d/random_v2.yaml → ...gs/offline/td3_bc/walker2d/random_v2.yaml