Questions about reproducing results #18

humanbins · 2021-07-26T05:16:18Z

Hi, thanks for providing the code of the paper. I have tried to reproduce the result, but get some trouble.
I want to reproduce the result of the default settings in DMC envs, such as walker_walk.
However, it seems weird that my results seem bad. Since the default setting is without distractors, so I remove --resource_files and --img_source in the run_local.sh file:

DOMAIN=walker

TASK=walk
SEED=1
SAVEDIR=./save

MUJOCO_GL="egl" LD_PRELOAD=/usr/lib/x86_64-linux-gnu/libGLEW.so:/usr/lib/nvidia-430/libGL.so.1.7.0  CUDA_VISIBLE_DEVICES=0 nohup python -u train.py \
    --domain_name ${DOMAIN} \
    --task_name ${TASK} \
    --agent 'bisim' \
    --init_steps 1000 \
    --num_train_steps 500000 \
    --encoder_type pixel \
    --decoder_type pixel \
    --transition_model_type 'ensemble' \
    --action_repeat 2 \
    --critic_tau 0.01 \
    --encoder_tau 0.05 \
    --decoder_weight_lambda 0.0000001 \
    --hidden_dim 1024 \
    --total_frames 1000 \
    --num_layers 4 \
    --num_filters 32 \
    --batch_size 128 \
    --init_temperature 0.1 \
    --alpha_lr 1e-4 \
    --alpha_beta 0.5 \
    --work_dir ${SAVEDIR}/${DOMAIN}_${TASK}_${SEED} \
    --save_tb \
    --seed ${SEED} $@ > ${DOMAIN}_${TASK}_${SEED}.log &`

I wonder if I have set some wrong hyperparameters? Or did I miss something important?

Here's some result:

The text was updated successfully, but these errors were encountered:

amyzhang · 2021-07-27T15:56:45Z

hi! you should use the hyperparameters set in the run_cluster.sh file.

humanbins · 2021-07-27T16:13:01Z

Thanks for the fast reply! I have checked the run_cluster.sh file, and I find that the batch_size is 512. I will try it again. Besides, should I reserve the --IMG_SOURCE and --resource_files as run_cluster.sh suggested even if I want to test in the default environments? Or I can ignore these two things?

amyzhang · 2021-07-27T18:47:04Z

you can ignore those, the default values should correspond to the default env with no distractors.

humanbins · 2021-07-28T04:47:19Z

Hi, Amy
I have tried to change the batch_size to 512. And test it with setting seed as 1, the performance seems still bad. At 130K step, the batch_reward is about 0.13, and the reward is about 100-150. I have checked some details, but still have no idea why this happens. Can the different setting of MUJOCO_GL affect the performance? It seems osmesa will cause some error in my server, so I changed to egl.

ywh19980519 · 2022-04-26T12:55:15Z

Hi, Amy I have tried to change the batch_size to 512. And test it with setting seed as 1, the performance seems still bad. At 130K step, the batch_reward is about 0.13, and the reward is about 100-150. I have checked some details, but still have no idea why this happens. Can the different setting of MUJOCO_GL affect the performance? It seems osmesa will cause some error in my server, so I changed to egl.

Hi, humanbins
Have you solved this problem now？Could you please share your experience?Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about reproducing results #18

Questions about reproducing results #18

humanbins commented Jul 26, 2021

amyzhang commented Jul 27, 2021

humanbins commented Jul 27, 2021

amyzhang commented Jul 27, 2021

humanbins commented Jul 28, 2021 •

edited

Loading

ywh19980519 commented Apr 26, 2022

Questions about reproducing results #18

Questions about reproducing results #18

Comments

humanbins commented Jul 26, 2021

amyzhang commented Jul 27, 2021

humanbins commented Jul 27, 2021

amyzhang commented Jul 27, 2021

humanbins commented Jul 28, 2021 • edited Loading

ywh19980519 commented Apr 26, 2022

humanbins commented Jul 28, 2021 •

edited

Loading