Skip to content

Commit 502826c

Browse files
committed
Compile PyTorch, torchvision, onnxruntime from source for numpy 2.x support
- PyTorch 2.8.0 with Jetson Orin optimizations (arch 8.7, ARM+CUDA linker optimization) - Disabled unnecessary features (NCCL, QNNPACK, XNNPACK, FBGEMM, Kineto, etc.) - torchvision 0.23.0 with CUDA support - onnxruntime 1.20.0 with TensorRT EP - flash-attn 2.8.3 (latest version) Performance: 65.7 FPS (vs 62.2 FPS baseline = 5.6% faster) Image size: 6.74GB (vs 8.28GB baseline = 18.6% smaller) Size optimizations: - cuDNN/TensorRT symlink preservation: ~2GB saved - Remove test directories, dev tools, examples: ~500MB saved - Conservative cleanup preserving public APIs (numpy.testing, torch.testing) TensorRT optimization: - FP16 precision enabled - Engine caching enabled with 2GB workspace - Builder optimization level 3 - Aux streams optimized for memory efficiency
1 parent 12882e3 commit 502826c

File tree

2 files changed

+259
-112
lines changed

2 files changed

+259
-112
lines changed

.github/workflows/docker.jetson.6.2.0.yml

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -19,8 +19,9 @@ env:
1919
jobs:
2020
docker:
2121
runs-on:
22-
labels: ubuntu-latest
23-
timeout-minutes: 120
22+
labels: depot-ubuntu-24.04-4
23+
group: public-depot
24+
timeout-minutes: 360
2425
permissions:
2526
id-token: write
2627
contents: read

0 commit comments

Comments
 (0)