Why non-decreasing dimension order on block ptr is not supported? #207

Nullkooland · 2025-01-01T02:41:41Z

Nullkooland
Jan 1, 2025

In #140 introduced a check against the order param in tts.make_tptr which is directly converted from the tt.make_tensor_ptr, when order is increasing (row-major?), like:

  pid_n = tl.program_id(axis=0)    # x
  pid_m = tl.program_id(axis=1)    # y
  offset_m = pid_m * BLOCK_SIZE_M
  offset_n = pid_n * BLOCK_SIZE_N

  in_block_ptr = tl.make_block_ptr(
      base=in_ptr,
      shape=(m, n),
      strides=(in_stride_m, in_stride_n),
      offsets=(offset_m, offset_n),
      block_shape=(BLOCK_SIZE_M, BLOCK_SIZE_N),
      order=(0, 1), # increasing.
  )

the check fails.

I wonder what's the purpose of this check, doesn't tt.make_tptr is converted to a memref.reinterpret_cast later, why does this order matter? @nhat-nguyen Could you take a look?

Answered by nhat-nguyen

Jan 3, 2025

Thanks for the question! This check was introduced because our lowering doesn't take into account the order field at the moment; it prevents us from producing incorrect code because the conversion to memref.reinterpret_cast currently assumes row-major layout. When using increasing order on a row-major tensor, the behavior is as if there were an implicit transpose. Currently we're reworking some of the passes to support arbitrary pointer patterns. Once that is done, we would appreciate any help updating the structured-to-memref pass to take into account the order field.

View full answer

parsifal-47 · 2025-01-01T03:11:10Z

parsifal-47
Jan 1, 2025

I am also interested since it does not allow to run flash attention v2 from here:
https://triton-lang.org/main/getting-started/tutorials/06-fused-attention.html

0 replies

nhat-nguyen · 2025-01-03T00:54:21Z

nhat-nguyen
Jan 3, 2025
Maintainer

Thanks for the question! This check was introduced because our lowering doesn't take into account the order field at the moment; it prevents us from producing incorrect code because the conversion to memref.reinterpret_cast currently assumes row-major layout. When using increasing order on a row-major tensor, the behavior is as if there were an implicit transpose. Currently we're reworking some of the passes to support arbitrary pointer patterns. Once that is done, we would appreciate any help updating the structured-to-memref pass to take into account the order field.

1 reply

Nullkooland Jan 3, 2025
Author

Thanks for the question! This check was introduced because our lowering doesn't take into account the order field at the moment; it prevents us from producing incorrect code because the conversion to memref.reinterpret_cast currently assumes row-major layout. When using increasing order on a row-major tensor, the behavior is as if there were an implicit transpose. Currently we're reworking some of the passes to support arbitrary pointer patterns. Once that is done, we would appreciate any help updating the structured-to-memref pass to take into account the order field.

So the decreasing order like [1, 0] means both src (the data at base_ptr) and dst (the data of the loaded tensor) are row-major order and there's no implicit transpose? I thought it was the opposite.
The triton does not have a good documentation for this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why non-decreasing dimension order on block ptr is not supported? #207

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

Select a reply

Why non-decreasing dimension order on block ptr is not supported? #207

Nullkooland Jan 1, 2025

Replies: 2 comments · 1 reply

parsifal-47 Jan 1, 2025

nhat-nguyen Jan 3, 2025 Maintainer

Nullkooland Jan 3, 2025 Author

Nullkooland
Jan 1, 2025

Replies: 2 comments 1 reply

parsifal-47
Jan 1, 2025

nhat-nguyen
Jan 3, 2025
Maintainer

Nullkooland Jan 3, 2025
Author