Skip to content

RL: Add RolloutManager to allow testing rollouts without worker overhead #1746

@rjpower

Description

@rjpower

Testing rollout generation currently requires full RolloutWorker infrastructure including JAX/mesh setup, background InferenceServer threads, weight transfer via Arrow Flight, Ray curriculum actors, and rollout storage backends. This creates slow tests with high overhead (100+ lines for simple cases) and complex debugging buried in async workers. We need a lightweight RolloutManager class that provides synchronous rollout generation for unit tests and debugging while maintaining compatibility with production pipeline output formats.

Parent Issue: #1738

Relevant Code

Metadata

Metadata

Assignees

Labels

p2Do before next releaserl

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions