Introduce PerAxis Quantized constraint for StableHLO Quantized OPs #2007

abhigunj · 2024-02-14T16:33:05Z

StableHLO OPs supporting Quantization can be categorized into following three types
a. Only PerTensor Quantized Tensors
b. Only PerAxis Quantized Tensors (no OP in this category for now)
c. Both PerTensor and PerAxis Quantized Tensors
PerAxis constraint from the PR allow only PerTensorQuantized Tensor inputs to type (a) OPs and don't allow PerAxis Quantized Tensors.
Also,
Added negative test cases to validate this behavior
Added positive test cases for PerTensor , PerAxis Quantized Tensor support

Excluded OPs

Deprecated :BroadCast and Call
Not Specced :
dynamic_iota, create_token, dynamic_broadcast_in_dim, cross-replica-sum, einsum, unary_einsum, dynamic_reshape, set_dimension_size, trace, return, torch_index_select, real_dynamic_slice, dynamic_pad, dynamic_gather, dynamic_conv, dynamic_reshape_shape, cstr_reshapable

stablehlo/tests/ops_stablehlo_quantized.mlir

GleasonK · 2024-02-16T00:46:04Z

Delegating my review to @sdasgup3. I took a scan over the code / tests and from a high level everything LGTM.

sdasgup3

Thanks again for the work. Here is some of my initial reviews which I want to publish soon. I need to check the test files in my follow up review.

Here is the remaining list of ops which are uncovered and the following support need to be added.

constant : is current using HLO_StaticShapeTensor which per this change has per-tensor output type. It should be both.
iota uses HLO_StaticShapeIntFpOrComplexTensor which is not updated. iota is expected to support per-tensor
infeed: Accepts both per-tensor and per-axis.
recv: should take both per-tensor and per-axis.
If, Case, While: The output supports both.
GetTupleElement / Tuple: supports both.
broadcast: update the type coinstraints treating the in/out types similar to brodcast_in_dim
custom-call: both - fft: fix the result type as per spec.
transpose: both are supported.

Nit:
- dynamic_iota, create_token, dynamic_broadcast_in_dim, cross-replica-sum, einsum, unary_einsum, dynamic_reshape, set_dimension_size, trace, return, torch_index_select, real_dynamic_slice, dynamic_pad, dynamic_gather, dynamic_conv, dynamic_reshape_shape, cstr_reshapable: Let's mention in the description that we are excluding the above stablehlo ops as they are not specced.

As follow up PRs:

collective_broadcast should also support per-tensor type like other distribution ops.
~~get_dimension_size: fix the spec~~
Please free to open PRs for the above items.

stablehlo/dialect/Base.td

stablehlo/dialect/StablehloOps.td

stablehlo/tests/ops_stablehlo_quantized.mlir

abhigunj · 2024-02-20T17:35:10Z

Thanks again for the work. Here is some of my initial reviews which I want to publish soon. I need to check the test files in my follow up review.

Here is the remaining list of ops which are uncovered and the following support need to be added.

constant : is current using HLO_StaticShapeTensor which per this change has per-tensor output type. It should be both.

iota uses HLO_StaticShapeIntFpOrComplexTensor which is not updated. iota is expected to support per-tensor

infeed: Accepts both per-tensor and per-axis.

recv: should take both per-tensor and per-axis.

If, Case, While: The output supports both.

GetTupleElement / Tuple: supports both.

broadcast: update the type coinstraints treating the in/out types similar to brodcast_in_dim

custom-call: both - fft: fix the result type as per spec.

Thanks for the thorough review, these OPs fall into result Quantized category, which I incorrectly ignored during audit. Updated the OP def. Updated the audit sheet.

transpose: both are supported.

Already taken care of?

Nit: - dynamic_iota, create_token, dynamic_broadcast_in_dim, cross-replica-sum, einsum, unary_einsum, dynamic_reshape, set_dimension_size, trace, return, torch_index_select, real_dynamic_slice, dynamic_pad, dynamic_gather, dynamic_conv, dynamic_reshape_shape, cstr_reshapable: Let's mention in the description that we are excluding the above stablehlo ops as they are not specced.

Done

As follow up PRs:

collective_broadcast should also support per-tensor type like other distribution ops.

get_dimension_size: fix the spec
Please free to open PRs for the above items.

Yes, will create separate PR as it involves changes to the spec.

sdasgup3

lgtm with some minor comments.

stablehlo/dialect/Base.td

stablehlo/dialect/StablehloOps.td

stablehlo/dialect/Base.td

GleasonK

I'll think more on naming, but that shouldn't block this from going in since tablegen variable names don't impact the generated code / impl.

stablehlo/dialect/Base.td

stablehlo/dialect/StablehloOps.td

and make Quantized test compatible. This was missed during resolving merged conflict for #2007

Custom call op permits just about everything: Tuple, Tensor, Token, Unranked, all quantization, etc. This change restores its ability to operate on unranked tensors. This was squashed in a merge conflict between #2045 and #2007. Adding a testpoint to avoid this issue going forward.

* made `isCompatibleElementTypeForHloTypeInference` stricter to return error for {not Quantize, Quantize}, {per-axis Quantized, per-tensor Quantized} cases * `AddOp` VHLO Test failures : addressed test failures because {not Quantize, Quantize} is not allowed * CorrectedTraits for `CholeskyOp` and `ClampOp` to match it with the spec ~~Note: This PR is based on in review PR #2007 Follow up PR will add/update OP verifiers for OPs which need special handling

abhigunj requested review from GleasonK and sdasgup3 February 14, 2024 16:33

GleasonK reviewed Feb 14, 2024

View reviewed changes

stablehlo/tests/ops_stablehlo_quantized.mlir Outdated Show resolved Hide resolved

GleasonK reviewed Feb 14, 2024

View reviewed changes

stablehlo/tests/ops_stablehlo_quantized.mlir Show resolved Hide resolved

abhigunj mentioned this pull request Feb 15, 2024

Whitespace Check did not detect - no new line at EOF #2010

Closed

sdasgup3 reviewed Feb 17, 2024

View reviewed changes

sdasgup3 approved these changes Feb 21, 2024

View reviewed changes

abhigunj force-pushed the q_types branch from 87cfca4 to b80f606 Compare February 22, 2024 01:33

abhigunj mentioned this pull request Feb 23, 2024

Quantization Verifiers based on T2x set of Traits #2041

Merged

GleasonK approved these changes Feb 26, 2024

View reviewed changes

stablehlo/dialect/Base.td Show resolved Hide resolved

stablehlo/dialect/Base.td Show resolved Hide resolved

stablehlo/dialect/StablehloOps.td Show resolved Hide resolved

GleasonK assigned abhigunj Feb 26, 2024

abhigunj added 2 commits February 27, 2024 01:38

Resolved merge conflict after rebase

53bafe9

make quantization tests compatible for ranked tensors

8553f47

abhigunj force-pushed the q_types branch from f37360b to 8553f47 Compare February 27, 2024 01:58

abhigunj merged commit 19fd17d into openxla:main Feb 27, 2024
10 checks passed

abhigunj mentioned this pull request Feb 27, 2024

Make HLO_FpOrQuantizedIntTensor def RankedTensorOf #2052

Merged

abhigunj added a commit that referenced this pull request Feb 27, 2024

Make HLO_FpOrQuantizedIntTensor def RankedTensorOf (#2052)

4a26dde

and make Quantized test compatible. This was missed during resolving merged conflict for #2007

GleasonK mentioned this pull request Feb 27, 2024

Allow CustomCallOp to have unranked tensor operands and results #2055

Merged

abhigunj added the Migrate to MHLO PR that needs to be migrated to MLIR-HLO label Feb 28, 2024

sdasgup3 added the Quantization label Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce PerAxis Quantized constraint for StableHLO Quantized OPs #2007

Introduce PerAxis Quantized constraint for StableHLO Quantized OPs #2007

abhigunj commented Feb 14, 2024 •

edited

Loading

GleasonK commented Feb 16, 2024

sdasgup3 left a comment •

edited

Loading

abhigunj commented Feb 20, 2024

sdasgup3 left a comment

GleasonK left a comment

Introduce PerAxis Quantized constraint for StableHLO Quantized OPs #2007

Introduce PerAxis Quantized constraint for StableHLO Quantized OPs #2007

Conversation

abhigunj commented Feb 14, 2024 • edited Loading

GleasonK commented Feb 16, 2024

sdasgup3 left a comment • edited Loading

Choose a reason for hiding this comment

abhigunj commented Feb 20, 2024

sdasgup3 left a comment

Choose a reason for hiding this comment

GleasonK left a comment

Choose a reason for hiding this comment

abhigunj commented Feb 14, 2024 •

edited

Loading

sdasgup3 left a comment •

edited

Loading