-
Notifications
You must be signed in to change notification settings - Fork 154
Open
Labels
bugSomething isn't workingSomething isn't working
Description
In the RL training pipeline (for SAC and PPO), during evaluation runs, there seems to be an issue with computed/tracked mse values. They neither match with mse in "info" from env.step nor with rmse results from directly policy evaluation through rl_experiment.sh (A deeper dive suggests issue in how mse is handled in "RecordEpisodeStatistics")
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working