onnx-coreml backend #2319

borg323 · 2025-10-16T09:50:26Z

Currently the onnxruntime coreml provider doesn't support everything required, the following three patches are needed for both fp32 and fp16 with fixed batch size (default for now).
~~microsoft/onnxruntime#26443~~ (merged)
microsoft/onnxruntime#26442
~~microsoft/onnxruntime#26462~~ (merged)

For variable batch size, hopefully the fix for issue microsoft/onnxruntime#26328 is simple.

If someone wants to try it out, the default onnxruntime branch should work. The last outstanding patch is for Gather fp16 support, which is the last kernel before the policy output, so doing it on the cpu shouldn't cause a huge performance drop.

Copilot

Pull Request Overview

This PR adds support for the CoreML execution provider to the ONNX backend, enabling hardware acceleration on Apple Silicon devices. The changes register a new "onnx-coreml" backend option and configure it to use the MLProgram model format with compute plan profiling.

Adds COREML as a new OnnxProvider enum value
Implements CoreML provider configuration with MLProgram format and profiling enabled
Registers the "onnx-coreml" backend with priority 59
Updates CI pipeline to build with ONNX runtime and test the CoreML backend on macOS ARM64

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
src/neural/backends/network_onnx.cc	Adds CoreML provider enum, configuration logic, and backend registration
.circleci/config.yml	Adds ONNX runtime installation, build configuration, and CoreML backend testing on macOS ARM64

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

borg323 · 2025-11-02T14:34:25Z

Some preliminary tests using lc0 bench with 791556 on a Apple M3 Pro.

fp32:

Total time (ms) : 5217
Nodes searched  : 13762
Nodes/second    : 2637

fp16

Total time (ms) : 5203
Nodes searched  : 20807
Nodes/second    : 3998

fp16 with PR26442 applied:

Total time (ms) : 5179
Nodes searched  : 26833
Nodes/second    : 5180

onnx-coreml test version

00c0f66

borg323 requested a review from Copilot October 16, 2025 09:50

Copilot AI reviewed Oct 16, 2025

View reviewed changes

borg323 force-pushed the onnx-coreml branch 8 times, most recently from 091e474 to 0f71b12 Compare October 17, 2025 17:20

borg323 added 4 commits October 23, 2025 03:26

verbose output for tests

5ef2c6f

try AllowLowPrecisionAccumulationOnGPU

8b66ffe

use fixed batch size by default

ef094cb

coreml doesn't have selu so use an alternative

95c893b

borg323 force-pushed the onnx-coreml branch from 69d0ed2 to 95c893b Compare October 23, 2025 00:28

Merge branch 'master' into onnx-coreml

f3237a8

try alt mish in native float format

6a3858d

borg323 force-pushed the onnx-coreml branch from 678001e to 6a3858d Compare November 3, 2025 14:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

onnx-coreml backend #2319

onnx-coreml backend #2319

Uh oh!

borg323 commented Oct 16, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

borg323 commented Nov 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

onnx-coreml backend #2319

Are you sure you want to change the base?

onnx-coreml backend #2319

Uh oh!

Conversation

borg323 commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

borg323 commented Nov 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

borg323 commented Oct 16, 2025 •

edited

Loading