Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FastSortFusedNew Function Hangs on Compute Capability 7.5 (Q6000) but Runs Fine on Compute Capability 8.6 (3090/A4500) #55

Open
xiaoc57 opened this issue Aug 21, 2024 · 0 comments

Comments

@xiaoc57
Copy link

xiaoc57 commented Aug 21, 2024

I have converted a C++ codebase into a PyTorch extension, and it runs perfectly on GPUs with Compute Capability 8.6, specifically on the RTX 3090 and A4500. However, when testing on a Quadro RTX 6000 with Compute Capability 7.5, the FastSortFusedNew function hangs. The function either stalls upon first entry or hangs immediately.

Details:

PyTorch Version: 2.1.0
CUDA Version: 11.8
Operating System (working): Ubuntu 20.04
Operating System (failing): Ubuntu 18.04 or 22.04
I suspect the issue might not be related to the OS version since I encountered the same problem on both Ubuntu 18.04 and 22.04. The function runs without issues on the same codebase on GPUs with Compute Capability 8.6.

Has anyone experienced a similar issue, or does anyone have insights into why this might be happening?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant