Skip to content

RL: Add Hook/Callback Mechanism to Rollout Workers #1740

@rjpower

Description

@rjpower

RolloutWorker currently has hardcoded evaluation logic in its main run loop that executes micro-evaluations every 10 steps and full curriculum evaluations every 100 steps. This tight coupling makes it difficult to customize evaluation behavior, add monitoring hooks, or test the worker without monkey-patching. We need to replace this with a flexible hook/callback system that allows arbitrary hooks to be registered while preserving existing evaluation behavior through a default EvaluationHook.

Parent Issue: #1738

Relevant Code

Metadata

Metadata

Assignees

Labels

p2Do before next releaserl

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions