Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pallas:triton] The lowering now uses PTX instead of Triton IR #25903

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

copybara-service[bot]
Copy link

[pallas:triton] The lowering now uses PTX instead of Triton IR

This change improves the stability and backward compatibility of Pallas Triton
calls, because unlike PTX, the Triton dialect has no stability guarantees
and does change in practice.

A few notes

  • The implementation only supports CUDA at the moment. More work is needed
    to support ROCm.
  • Pallas Triton no longer delegates compilation to PTX to XLA:GPU. Instead,
    compilation is done via a new PjRt extension, which uses its own compilation
    pipeline mirrored after the one in the Triton Python bindings.
  • The implementation of the old custom call used by Pallas Triton is
    deprecated and will be removed after 6 months as per
    [compatibility guarantees] *

This change improves the stability and backward compatibility of Pallas Triton
calls, because unlike PTX, the Triton dialect has no stability guarantees
and does change in practice.

A few notes

* The implementation only supports CUDA at the moment. More work is needed
  to support ROCm.
* Pallas Triton no longer delegates compilation to PTX to XLA:GPU. Instead,
  compilation is done via a new PjRt extension, which uses its own compilation
  pipeline mirrored after the one in the Triton Python bindings.
* The implementation of the old custom call used by Pallas Triton is
  deprecated and will be removed after 6 months as per
  [compatibility guarantees] [*]

[*]: https://jax.readthedocs.io/en/latest/export/export.html#compatibility-guarantees

PiperOrigin-RevId: 715379979
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant