VectorToXeGPU: Allows lowering vector.transfer_read and vector.transfer_write to XeGPU #773

Scarlet1ssimo · 2024-06-10T19:09:33Z

Please review these guidelines to help with the review process:

Have you provided a meaningful PR description? Hope so.
Have you added a test, a reproducer, or a reference to an issue with a reproducer?
Have you tested your changes locally for CPU and GPU devices?
Have you made sure that new changes do not introduce compiler warnings?
If this PR is a work in progress, are you filing the PR as a draft?
Have you organized your commits logically and ensured each can be built by itself?

This patch allows lowering vector.transfer_read and vector.transfer_write, which is quite common when handling the vectorization stuff, to corresponding XeGPU dialect.
Namely, it first create a descriptor then apply either a LoadNdOp or a StoreNdOp.
Directly accessing 1d vector runs into some unknown issues, specifically no error reported during compilation and run but got wrong memory access pattern. So temporarily, for access a 1d vector, it first translates to accessing a 1x? vector, which then get reshaped back to 1d. At least it works as expected. Hope this could be fixed on some other sides.
Tested on PVC device.

…GPU dialect that enables vector access on Intel GPU

…u dialect

silee2 · 2024-06-17T20:23:31Z

@Scarlet1ssimo I have a general question about the placement of this code.
Both the source and target dialect of this conversion pass are MLIR upstream dialect.
This conversion pass can be placed in upstream MLIR or as part of your target project as well.
Any specific reason IMEX is the best place for this pass?

silee2 · 2024-06-17T20:25:28Z

lib/Conversion/VectorToXeGPU/VectorToXeGPU.cpp

+    mlir::Value desc;
+    if (auto MemRefTypedSource =
+            mlir::cast<mlir::TypedValue<mlir::MemRefType>>(source)) {
+      desc = rewriter.create<mlir::xegpu::CreateNdDescOp>(


Shouldn't there be a check for memref rank here? XeGPU supports limited ranks.

silee2 · 2024-06-17T20:26:20Z

lib/Conversion/VectorToXeGPU/VectorToXeGPU.cpp

+    mlir::Value desc;
+    if (auto MemRefTypedSource =
+            mlir::cast<mlir::TypedValue<mlir::MemRefType>>(source)) {
+      desc = rewriter.create<mlir::xegpu::CreateNdDescOp>(


Same as my comment above: Check rank and return failure if unsupported shape.

silee2 · 2024-06-17T20:28:38Z

test/Conversion/VectorToXeGPU/gemm_3x3.mlir

@@ -0,0 +1,114 @@
+// RUN: %python_executable %imex_runner --requires=l0-runtime -i %s --pass-pipeline-file=%p/vector-to-llvm.pp \


This test in an Integration test running on GPU.
I would suggest placing in somewhere like under
test/Integration/Dialect/Vector/

silee2 · 2024-06-17T20:29:30Z

test/Conversion/VectorToXeGPU/vector-to-llvm.pp

@@ -0,0 +1,18 @@
+builtin.module(


Should move to a folder for Integration tests along with the test case above.

Scarlet1ssimo added 4 commits June 10, 2024 19:09

VectorToXeGPU: convert vector.transfer_read/write to corresponding Xe…

12e9aee

…GPU dialect that enables vector access on Intel GPU

VectorToXeGPU: add tests for 1d/nd vectors

9a119ba

reduce code repetition

5f8f903

add tests on GPU device (PVC); also add I32 support for lowering xegp…

70a2850

…u dialect

silee2 self-requested a review June 17, 2024 20:20

silee2 requested changes Jun 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VectorToXeGPU: Allows lowering vector.transfer_read and vector.transfer_write to XeGPU #773

VectorToXeGPU: Allows lowering vector.transfer_read and vector.transfer_write to XeGPU #773

Scarlet1ssimo commented Jun 10, 2024 •

edited

Loading

silee2 commented Jun 17, 2024 •

edited

Loading

silee2 Jun 17, 2024

silee2 Jun 17, 2024

silee2 Jun 17, 2024

silee2 Jun 17, 2024

		@@ -0,0 +1,114 @@
		// RUN: %python_executable %imex_runner --requires=l0-runtime -i %s --pass-pipeline-file=%p/vector-to-llvm.pp \

VectorToXeGPU: Allows lowering vector.transfer_read and vector.transfer_write to XeGPU #773

Are you sure you want to change the base?

VectorToXeGPU: Allows lowering vector.transfer_read and vector.transfer_write to XeGPU #773

Conversation

Scarlet1ssimo commented Jun 10, 2024 • edited Loading

silee2 commented Jun 17, 2024 • edited Loading

silee2 Jun 17, 2024

Choose a reason for hiding this comment

silee2 Jun 17, 2024

Choose a reason for hiding this comment

silee2 Jun 17, 2024

Choose a reason for hiding this comment

silee2 Jun 17, 2024

Choose a reason for hiding this comment

Scarlet1ssimo commented Jun 10, 2024 •

edited

Loading

silee2 commented Jun 17, 2024 •

edited

Loading