Prerequisites
Problem Description
Model reading from disk slow, achieve only 2GB/s on 12GB/s SSD
Proposed Solution
- Add shm interface in sllm_store
- Change model loading pipeline to use state_dict
Alternatives Considered
No response
Additional Context
No response
Importance
Nice to have
Usage Statistics (Optional)
No response