Segmentation fault during orbax checkpoint restore for pi0_fast_droid #328
Unanswered
hypercosmac
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I'm running into a segmentation fault when building a docker image using a slightly modified serve_policy.Dockerfile to run the pi0 fast droid model for inference on Modal GPUs, and the issue appears to be with orbax checkpoint restore -
I've double checked the paths (eg. metadata is read in successfully) - screenshot attached. Not sure if its a float 32/16 issue..has anyone faced anything similar?
This is where the code stalls before running into the seg fault:
Beta Was this translation helpful? Give feedback.
All reactions