Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update range of gpu arch #23309

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

Update range of gpu arch #23309

wants to merge 2 commits into from

Conversation

yf711
Copy link
Contributor

@yf711 yf711 commented Jan 9, 2025

Description

Remove deprecated gpu arch and reduce nuget/python package size (latest TRT supports sm75 Turing and newer arch)

Test on pkg CI Python-cuda12 Nuget-cuda12
Before Linux: 279MB Win: 267MB Linux: 247MB Win: 235MB
After Linux: 174MB Win: 162MB Linux: 168MB Win: 156MB

Motivation and Context

snnn
snnn previously approved these changes Jan 9, 2025
@tianleiwu
Copy link
Contributor

If we drop older arch, shall we also drop ort package for cuda 11.8 in next release?

@snnn
Copy link
Member

snnn commented Jan 9, 2025

If we drop older arch, shall we also drop ort package for cuda 11.8 in next release?

I highly recommend doing so. Now we only have two people working on build pipelines. We should focus more on the main targets.

@yf711 yf711 requested a review from jywu-msft January 10, 2025 00:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants