-
-
Notifications
You must be signed in to change notification settings - Fork 11.8k
[Bugfix] fix --scheduling-policy=priority & n>1 crashes engine #29764
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: chaunceyjiang <[email protected]>
Signed-off-by: chaunceyjiang <[email protected]>
Signed-off-by: chaunceyjiang <[email protected]>
|
Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits. |
njhill
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @chaunceyjiang! I'm wondering whether it would be better to just add id(request) into the tuple in the priority request queue? I'm not sure about baking specific ordering logic into the Request type itself (and it also kind of duplicates the ordering defined by the tuples).
@njhill Actually, this problem is solved by using the request_id for sorting. When n=2, the request_id format is 0_AAaabb, 1_AAaabb. Using id(request) is just a fallback measure. |
|
@chaunceyjiang I mean including one or the other of these in the tuple in the priority queue heap rather than implementing |
Signed-off-by: chaunceyjiang <[email protected]>
@njhill Done. PTAL. |
Signed-off-by: chaunceyjiang <[email protected]>
Signed-off-by: chaunceyjiang <[email protected]>
|
@chaunceyjiang apologies, after seeing the changes and thinking some more, maybe your original change to make the requests comparable would be better. Since priority and arrival time are properties of the request itself, I think this does make sense as a canonical "default" ordering after all. We can then also simplify the priority heap to contain just the requests rather than wrapping them in tuples. If it's too late in the day for you to update this, I can make the change quickly so that we can still include this in the release. If you don't respond here soon I'll do that! |
Signed-off-by: Nick Hill <[email protected]>
njhill
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks again @chaunceyjiang
Signed-off-by: chaunceyjiang <[email protected]> Signed-off-by: Nick Hill <[email protected]> Co-authored-by: Nick Hill <[email protected]> (cherry picked from commit 0a9caca)
…project#29764) Signed-off-by: chaunceyjiang <[email protected]> Signed-off-by: Nick Hill <[email protected]> Co-authored-by: Nick Hill <[email protected]> Signed-off-by: Xingyu Liu <[email protected]>
FIX #29747
Purpose
test
before
this pr
Test Plan
see e2e
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.