Support recording workflow running status #397

pan-x-c · 2025-11-20T10:47:26Z

Description

As the title says

Checklist

Please check the following items before code is ready to be reviewed.

Code has passed all tests
Docstrings have been added/updated in Google Style
Documentation has been updated
Code is ready for review

gemini-code-assist · 2025-11-20T10:47:52Z

Summary of Changes

Hello @pan-x-c, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly improves the observability and debugging capabilities of workflows by introducing a robust system for recording and monitoring their running status. It establishes a clear mechanism for workflows to report their internal state via the ModelWrapper, which is then aggregated and tracked by individual WorkflowRunner instances. The Scheduler is enhanced with a dedicated monitoring loop to periodically collect and expose these detailed runner states, providing a real-time overview of ongoing workflow executions.

Highlights

Enhanced Workflow State Management: The ModelWrapper now includes a dedicated workflow_state dictionary and thread-safe asynchronous methods (set_workflow_state, clean_workflow_state, get_workflow_state) to store and manage workflow-specific progress and data.
Detailed Runner Status Tracking: The WorkflowRunner has been augmented to maintain its own runner_state, capturing key execution details such as runner_id, running_workflow_id, model_version, begin_time, and terminate_time. It also provides a get_runner_state method that aggregates its internal state with the workflow's state from the ModelWrapper.
Centralized Scheduler Monitoring: A new configurable runner_state_report_interval has been added to the ExplorerConfig. The Scheduler now incorporates a _monitor_runner_state_loop that periodically fetches and updates the states of all active RunnerWrapper instances, offering methods (get_key_state, get_runner_state, get_all_state) for comprehensive state retrieval.
Comprehensive Test Coverage: New dummy workflows (DummyWorkflowWithState, StateRecordingWorkflow) and extensive test cases (TestRunnerStateCollection, TestWorkflowStateRecording, test_workflow_runner_get_state) have been introduced to thoroughly validate the correct recording, updating, and retrieval of both workflow and runner states.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a mechanism for recording and monitoring the running status of workflows. The overall implementation is good, but there are a few critical issues that need to be addressed. I've identified a busy-loop in the scheduler's monitoring task that could lead to high CPU usage, and several race conditions in the workflow runner when updating state, which could result in inconsistent state reads. Additionally, there's some unused code and a leftover print statement in a test. I've provided detailed comments and suggestions to resolve these issues.

trinity/explorer/scheduler.py

trinity/explorer/workflow_runner.py

tests/explorer/workflow_test.py

trinity/common/config.py

trinity/explorer/scheduler.py

pan-x-c · 2025-11-20T11:28:54Z

/unittest-diff

github-actions · 2025-11-20T11:50:52Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
82	80	1	1	0	0	20m 19s

Failed Tests

Failed Tests ❌	Fail Message
❌ tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner	The test failed in the call phase due to an assertion error

Skipped

Tests	Status
tests/explorer/workflow_test.py::TestAgentScopeWorkflowAdapter::test_adapter	skipped ⏭️

Tests

Test Name	Status	Duration
tests/common/config_test.py::TestConfig::test_all_examples_are_valid	✅	33.0s
tests/common/config_test.py::TestConfig::test_chat_template_path	✅	93ms
tests/common/config_test.py::TestConfig::test_config_flatten	✅	41ms
tests/common/config_test.py::TestConfig::test_continue_from_checkpoint_is_valid	✅	191ms
tests/common/config_test.py::TestConfig::test_default_workflow	✅	91ms
tests/common/config_test.py::TestConfig::test_load_default_config	✅	33.3s
tests/common/config_test.py::TestConfig::test_max_token_len_per_gpu_set_correctly	✅	94ms
tests/common/config_test.py::TestConfig::test_optimizer_config_propagation	✅	92ms
tests/common/config_test.py::TestConfig::test_update_config_from_ray_cluster	✅	1.9s
tests/common/experience_test.py::TestEID::test_eid_properties	✅	1ms
tests/common/experience_test.py::TestExperience::test_action_mask_and_logprobs_type	✅	1ms
tests/common/experience_test.py::TestExperience::test_assertions	✅	1ms
tests/common/experience_test.py::TestExperience::test_dpo_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_gather	✅	1ms
tests/common/experience_test.py::TestExperience::test_hf_datasets_conversion	✅	15ms
tests/common/experience_test.py::TestExperience::test_multi_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_serialize_deserialize	✅	2ms
tests/common/experience_test.py::TestExperience::test_single_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_to_dict	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_dpo_experience_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_experience_model_experience_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_gather_experiences_with_custom_fields	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_multiturn_experience_batch_converstion	✅	1ms
tests/common/vllm_test.py::ModelWrapperTest_0::test_generate	✅	52.9s
tests/common/vllm_test.py::ModelWrapperTest_1::test_generate	✅	31.5s
tests/common/vllm_test.py::ModelWrapperTest_2::test_generate	✅	43.3s
tests/common/vllm_test.py::TestModelLen_0::test_model_len	✅	17.5s
tests/common/vllm_test.py::TestModelLen_1::test_model_len	✅	16.9s
tests/common/vllm_test.py::TestAPIServer::test_api	✅	23.0s
tests/common/vllm_test.py::TestLogprobs::test_logprobs	✅	19.1s
tests/common/vllm_test.py::TestAsyncAPIServer::test_api_async	✅	22.5s
tests/common/vllm_test.py::TestTokenizer::test_action_mask	✅	232ms
tests/common/vllm_test.py::TestTokenizer::test_action_mask_with_tools	✅	231ms
tests/common/vllm_test.py::TestAPIServerToolCall_0_deepseek_r1::test_api_tool_calls	✅	19.3s
tests/common/vllm_test.py::TestAPIServerToolCall_1::test_api_tool_calls	✅	17.3s
tests/common/vllm_test.py::TestSuperLongGeneration::test_generate	✅	52.8s
tests/explorer/explorer_test.py::TestExplorerCountdownEval::test_explorer	✅	1m 4s
tests/explorer/explorer_test.py::TestExplorerGSM8KRULERNoEval::test_explorer	✅	1m 42s
tests/explorer/explorer_test.py::TestExplorerGSM8k::test_explorer	✅	3m 37s
tests/explorer/explorer_test.py::ServeTest::test_serve	✅	1m 21s
tests/explorer/scheduler_test.py::SchedulerTest::test_async_workflow	✅	12.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_concurrent_operations	✅	12.2s
tests/explorer/scheduler_test.py::SchedulerTest::test_dynamic_timeout	✅	19.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_get_results	✅	27.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_non_repeatable_workflow_0	✅	12.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_non_repeatable_workflow_1	✅	12.2s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_repeatable_workflow_0	✅	12.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_repeatable_workflow_1	✅	12.3s
tests/explorer/scheduler_test.py::SchedulerTest::test_multi_step_execution	✅	12.2s
tests/explorer/scheduler_test.py::SchedulerTest::test_non_repeatable_workflow	✅	12.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_over_rollout_min_wait	✅	16.1s
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_all_methods	✅	22.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_restart_after_stop	✅	23.9s
tests/explorer/scheduler_test.py::SchedulerTest::test_split_tasks	✅	15.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_stepwise_experience_eid	✅	32.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all	✅	15.2s
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all_timeout_with_multi_batch	✅	20.9s
tests/explorer/scheduler_test.py::TestRunnerStateCollection::test_runner_state_collection	✅	17.4s
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow_0	✅	2ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow_1	✅	602ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow_0	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow_1	✅	1.0s
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_raise_error	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_stop_at_max_env_steps	✅	1.0s
tests/explorer/workflow_test.py::WorkflowTest::test_gsm8k_workflow	✅	35ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_boxed_workflow	✅	24ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_complex_workflow	✅	176ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_eval_workflow	✅	4ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_fraction_workflow	✅	13ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_workflow	✅	8ms
tests/explorer/workflow_test.py::WorkflowTest::test_rm_gallery_workflow	✅	87ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable_0	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable_1	✅	101ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable_0	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable_1	✅	202ms
tests/explorer/workflow_test.py::MultiTurnWorkflowTest_0::test_multi_turn_workflow	✅	15.1s
tests/explorer/workflow_test.py::MultiTurnWorkflowTest_1::test_multi_turn_workflow	✅	14.5s
tests/explorer/workflow_test.py::TestWorkflowStateRecording::test_workflow_state_recording	✅	4.0s
tests/explorer/workflow_test.py::TestAgentScopeWorkflowAdapter::test_adapter	⏭️	1ms
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner	❌	200ms
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner_get_state	✅	8.1s

Github Test Reporter by CTRF 💚

pan-x-c · 2025-11-20T11:56:38Z

/unittest-module-explorer

github-actions · 2025-11-20T12:12:26Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
45	44	0	1	0	0	14m 11s

Skipped

Tests	Status
tests/explorer/workflow_test.py::TestAgentScopeWorkflowAdapter::test_adapter	skipped ⏭️

Tests

Test Name	Status	Duration
tests/explorer/explorer_test.py::TestExplorerCountdownEval::test_explorer	✅	1m 13s
tests/explorer/explorer_test.py::TestExplorerGSM8KRULERNoEval::test_explorer	✅	1m 44s
tests/explorer/explorer_test.py::TestExplorerGSM8k::test_explorer	✅	3m 36s
tests/explorer/explorer_test.py::ServeTest::test_serve	✅	1m 27s
tests/explorer/scheduler_test.py::SchedulerTest::test_async_workflow	✅	12.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_concurrent_operations	✅	12.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_dynamic_timeout	✅	19.6s
tests/explorer/scheduler_test.py::SchedulerTest::test_get_results	✅	27.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_non_repeatable_workflow_0	✅	12.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_non_repeatable_workflow_1	✅	12.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_repeatable_workflow_0	✅	12.6s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_repeatable_workflow_1	✅	12.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_multi_step_execution	✅	12.6s
tests/explorer/scheduler_test.py::SchedulerTest::test_non_repeatable_workflow	✅	12.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_over_rollout_min_wait	✅	16.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_all_methods	✅	22.1s
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_restart_after_stop	✅	23.3s
tests/explorer/scheduler_test.py::SchedulerTest::test_split_tasks	✅	15.6s
tests/explorer/scheduler_test.py::SchedulerTest::test_stepwise_experience_eid	✅	32.1s
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all	✅	15.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all_timeout_with_multi_batch	✅	21.1s
tests/explorer/scheduler_test.py::TestRunnerStateCollection::test_runner_state_collection	✅	17.2s
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow_0	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow_1	✅	602ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow_0	✅	2ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow_1	✅	1.0s
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_raise_error	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_stop_at_max_env_steps	✅	1.0s
tests/explorer/workflow_test.py::WorkflowTest::test_gsm8k_workflow	✅	34ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_boxed_workflow	✅	25ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_complex_workflow	✅	174ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_eval_workflow	✅	3ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_fraction_workflow	✅	13ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_workflow	✅	8ms
tests/explorer/workflow_test.py::WorkflowTest::test_rm_gallery_workflow	✅	85ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable_0	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable_1	✅	100ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable_0	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable_1	✅	201ms
tests/explorer/workflow_test.py::MultiTurnWorkflowTest_0::test_multi_turn_workflow	✅	14.9s
tests/explorer/workflow_test.py::MultiTurnWorkflowTest_1::test_multi_turn_workflow	✅	14.8s
tests/explorer/workflow_test.py::TestWorkflowStateRecording::test_workflow_state_recording	✅	4.0s
tests/explorer/workflow_test.py::TestAgentScopeWorkflowAdapter::test_adapter	⏭️	1ms
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner	✅	293ms
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner_get_state	✅	8.1s

Github Test Reporter by CTRF 💚

pan-x-c · 2025-11-21T04:08:26Z

/unittest-diff

github-actions · 2025-11-21T04:32:19Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
82	81	0	1	0	0	22m 12s

Skipped

Tests	Status
tests/explorer/workflow_test.py::TestAgentScopeWorkflowAdapter::test_adapter	skipped ⏭️

Tests

Test Name	Status	Duration
tests/common/config_test.py::TestConfig::test_all_examples_are_valid	✅	33.0s
tests/common/config_test.py::TestConfig::test_chat_template_path	✅	95ms
tests/common/config_test.py::TestConfig::test_config_flatten	✅	41ms
tests/common/config_test.py::TestConfig::test_continue_from_checkpoint_is_valid	✅	195ms
tests/common/config_test.py::TestConfig::test_default_workflow	✅	93ms
tests/common/config_test.py::TestConfig::test_load_default_config	✅	38.6s
tests/common/config_test.py::TestConfig::test_max_token_len_per_gpu_set_correctly	✅	93ms
tests/common/config_test.py::TestConfig::test_optimizer_config_propagation	✅	92ms
tests/common/config_test.py::TestConfig::test_update_config_from_ray_cluster	✅	1.9s
tests/common/experience_test.py::TestEID::test_eid_properties	✅	1ms
tests/common/experience_test.py::TestExperience::test_action_mask_and_logprobs_type	✅	1ms
tests/common/experience_test.py::TestExperience::test_assertions	✅	1ms
tests/common/experience_test.py::TestExperience::test_dpo_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_gather	✅	1ms
tests/common/experience_test.py::TestExperience::test_hf_datasets_conversion	✅	15ms
tests/common/experience_test.py::TestExperience::test_multi_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_serialize_deserialize	✅	2ms
tests/common/experience_test.py::TestExperience::test_single_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_to_dict	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_dpo_experience_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_experience_model_experience_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_gather_experiences_with_custom_fields	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_multiturn_experience_batch_converstion	✅	1ms
tests/common/vllm_test.py::ModelWrapperTest_0::test_generate	✅	53.3s
tests/common/vllm_test.py::ModelWrapperTest_1::test_generate	✅	32.1s
tests/common/vllm_test.py::ModelWrapperTest_2::test_generate	✅	43.5s
tests/common/vllm_test.py::TestModelLen_0::test_model_len	✅	17.5s
tests/common/vllm_test.py::TestModelLen_1::test_model_len	✅	17.4s
tests/common/vllm_test.py::TestAPIServer::test_api	✅	22.8s
tests/common/vllm_test.py::TestLogprobs::test_logprobs	✅	19.3s
tests/common/vllm_test.py::TestAsyncAPIServer::test_api_async	✅	22.6s
tests/common/vllm_test.py::TestTokenizer::test_action_mask	✅	236ms
tests/common/vllm_test.py::TestTokenizer::test_action_mask_with_tools	✅	238ms
tests/common/vllm_test.py::TestAPIServerToolCall_0_deepseek_r1::test_api_tool_calls	✅	19.4s
tests/common/vllm_test.py::TestAPIServerToolCall_1::test_api_tool_calls	✅	17.4s
tests/common/vllm_test.py::TestSuperLongGeneration::test_generate	✅	2m 31s
tests/explorer/explorer_test.py::TestExplorerCountdownEval::test_explorer	✅	1m 11s
tests/explorer/explorer_test.py::TestExplorerGSM8KRULERNoEval::test_explorer	✅	1m 42s
tests/explorer/explorer_test.py::TestExplorerGSM8k::test_explorer	✅	3m 38s
tests/explorer/explorer_test.py::ServeTest::test_serve	✅	1m 19s
tests/explorer/scheduler_test.py::SchedulerTest::test_async_workflow	✅	12.2s
tests/explorer/scheduler_test.py::SchedulerTest::test_concurrent_operations	✅	12.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_dynamic_timeout	✅	19.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_get_results	✅	27.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_non_repeatable_workflow_0	✅	12.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_non_repeatable_workflow_1	✅	12.3s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_repeatable_workflow_0	✅	12.2s
tests/explorer/scheduler_test.py::SchedulerTest::test_metric_calculation_with_repeatable_workflow_1	✅	12.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_multi_step_execution	✅	12.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_non_repeatable_workflow	✅	12.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_over_rollout_min_wait	✅	16.4s
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_all_methods	✅	22.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_restart_after_stop	✅	23.7s
tests/explorer/scheduler_test.py::SchedulerTest::test_split_tasks	✅	15.8s
tests/explorer/scheduler_test.py::SchedulerTest::test_stepwise_experience_eid	✅	32.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all	✅	15.5s
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all_timeout_with_multi_batch	✅	21.0s
tests/explorer/scheduler_test.py::TestRunnerStateCollection::test_runner_state_collection	✅	17.2s
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow_0	✅	2ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_reward_propagation_workflow_1	✅	602ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow_0	✅	2ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_step_wise_reward_workflow_1	✅	1.0s
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_raise_error	✅	1ms
tests/explorer/step_wise_workflow_test.py::WorkflowTest::test_workflows_stop_at_max_env_steps	✅	1.0s
tests/explorer/workflow_test.py::WorkflowTest::test_gsm8k_workflow	✅	34ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_boxed_workflow	✅	24ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_complex_workflow	✅	176ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_eval_workflow	✅	423ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_fraction_workflow	✅	14ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_workflow	✅	8ms
tests/explorer/workflow_test.py::WorkflowTest::test_rm_gallery_workflow	✅	92ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable_0	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable_1	✅	102ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable_0	✅	2ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable_1	✅	203ms
tests/explorer/workflow_test.py::MultiTurnWorkflowTest_0::test_multi_turn_workflow	✅	14.7s
tests/explorer/workflow_test.py::MultiTurnWorkflowTest_1::test_multi_turn_workflow	✅	14.5s
tests/explorer/workflow_test.py::TestWorkflowStateRecording::test_workflow_state_recording	✅	4.0s
tests/explorer/workflow_test.py::TestAgentScopeWorkflowAdapter::test_adapter	⏭️	1ms
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner	✅	300ms
tests/explorer/workflow_test.py::TestWorkflowRunner::test_workflow_runner_get_state	✅	8.1s

Github Test Reporter by CTRF 💚

hiyuchang · 2025-11-21T05:53:10Z

docs/sphinx_doc/source/tutorial/trinity_configs.md

  dynamic_timeout:
    enable: false
    ratio: 3.0
+  runner_state_report_interval: 0


Minor comment: 默认是-1会不会好一些？
runner_state_report_interval=0可以误以为每隔0秒就report一次（i.e. 持续report）

pan-x-c added 3 commits November 20, 2025 16:11

fix conflict

731eb8a

support collecting workflow status

b438bfc

fix pre-commit

c104202

gemini-code-assist bot reviewed Nov 20, 2025

View reviewed changes

pan-x-c added 2 commits November 20, 2025 19:16

fix comments

b7143ac

fix comments

5a44c03

fix tests

14939cf

pan-x-c added 3 commits November 20, 2025 20:52

print runner state

b6d3a27

update doc

b60ae7f

simplify workflow runner state

9faf849

Merge branch 'main' into feature/report_workflow_state

fb525eb

hiyuchang reviewed Nov 21, 2025

View reviewed changes

remove confusing log

54179be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support recording workflow running status #397

Support recording workflow running status #397

Uh oh!

pan-x-c commented Nov 20, 2025

Uh oh!

gemini-code-assist bot commented Nov 20, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pan-x-c commented Nov 20, 2025

Uh oh!

github-actions bot commented Nov 20, 2025

Uh oh!

pan-x-c commented Nov 20, 2025

Uh oh!

github-actions bot commented Nov 20, 2025

Uh oh!

pan-x-c commented Nov 21, 2025

Uh oh!

github-actions bot commented Nov 21, 2025

Uh oh!

hiyuchang Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Support recording workflow running status #397

Are you sure you want to change the base?

Support recording workflow running status #397

Uh oh!

Conversation

pan-x-c commented Nov 20, 2025

Description

Checklist

Uh oh!

gemini-code-assist bot commented Nov 20, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pan-x-c commented Nov 20, 2025

Uh oh!

github-actions bot commented Nov 20, 2025

Summary

Failed Tests

Skipped

Tests

Uh oh!

pan-x-c commented Nov 20, 2025

Uh oh!

github-actions bot commented Nov 20, 2025

Summary

Skipped

Tests

Uh oh!

pan-x-c commented Nov 21, 2025

Uh oh!

github-actions bot commented Nov 21, 2025

Summary

Skipped

Tests

Uh oh!

hiyuchang Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants