Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training quickly diverges with default settings #3

Closed
dementrock opened this issue Mar 10, 2025 · 4 comments
Closed

Training quickly diverges with default settings #3

dementrock opened this issue Mar 10, 2025 · 4 comments
Assignees

Comments

@dementrock
Copy link

This is what I got following the default training command

${ISAACLAB_PATH:?}/isaaclab.sh -p scripts/rsl_rl/train_teacher_policy.py \
    --num_envs 10 \
    --reference_motion_path neural_wbc/data/data/motions/stable_punch.pkl
Image

Zooming in

Image

What should be changed?

@huihuaNvidia2023
Copy link
Collaborator

Just to clarify, are you using the retargeted punch.pkl dataset?

@huihuaNvidia2023
Copy link
Collaborator

huihuaNvidia2023 commented Mar 10, 2025

I just followed the tutorial and trained on the punch.pkl locally. Here is the tensor board curve

Image. It converges quite well even after 2000 steps.

In the Omnih2o homepage, it does provide a dataset via CMU

Here is the command I run

${ISAACLAB_PATH:?}/isaaclab.sh -p scripts/rsl_rl/train_teacher_policy.py     --num_envs 1024     --reference_motion_path neural_wbc/data/data/motions/punch.pkl  --headless

@huihuaNvidia2023 huihuaNvidia2023 self-assigned this Mar 10, 2025
@dementrock
Copy link
Author

Thanks @huihuaNvidia2023 ! To confirm did you get the pkl from https://github.com/LeCAR-Lab/human2humanoid/blob/main/legged_gym/resources/motions/h1/stable_punch.pkl?

I'll try your setting override of num_envs=1024

@dementrock
Copy link
Author

I think I figured out the reason... It was indeed because of the num_envs option. To avoid confusion, it might be better to use this 1024 default value in https://github.com/NVlabs/HOVER?tab=readme-ov-file#teacher-policy rather than using 10?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants