-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request] Add official support for onnxruntime-gpu on ARM64/aarch64 platforms #22903
Comments
Do you have a specific use case? Different GPUs require different things. e.g. CUDA vs AMD vs integrated. Mobile vs server is also very different. You mention AWS Graviton but I don't see a GPU option with that. |
I'm also interested in this feature. We'd like to use onnxruntime with CUDA on Linux and arm64 Nvidia Grace CPUs. It would be great if we could install wheel's from PyPI for this rather than need to build onnxruntime ourselves. I imagine that space pressure on PyPI is a concern, as you already need to remove old versions to make space for new releases (eg. #22747), and adding another architecture would make this worse. If that's a blocker to adding a new architecture then maybe providing builds from an Azure devops feed like you do for CUDA 11 builds would be an option?
G5g instances are Graviton machines with an Nvidia GPU. |
If this is something that would be accepted, I or someone else in my team could help update the CI pipelines to add arm64 CUDA builds. We could probably use the existing |
Describe the feature request
Issue Description
Currently, onnxruntime-gpu package lacks official support for ARM64/aarch64 architecture, limiting GPU acceleration capabilities on increasingly popular ARM-based platforms.
Current Situation
No official pre-built wheels for onnxruntime-gpu on ARM64/aarch64
Limited documentation for ARM64 GPU deployment
Technical Details
Architecture: ARM64/aarch64
Platform: Various (like AWS Graviton, etc.)
Python Versions: 3.8+ compatibility needed
Proposed Solution
Official pre-built wheels for ARM64/aarch64
CI/CD pipeline additions for ARM64 builds
Would appreciate any feedback or guidance on how to make this happen?
Describe scenario use case
Use Case :
Growing adoption of ARM64 in edge/hpc computing.
Cloud deployments on ARM64-based servers (AWS Graviton, etc.)
Machine learning workloads on newer ARM-based development machines
IoT and embedded systems requiring GPU acceleration
The text was updated successfully, but these errors were encountered: