Added 64-bit support for CUDA Calls. #147

bviyer · 2023-06-15T19:41:33Z

IREE Compiler was converting 64-bit datatype to 32-bit and to support that several CUDA function calls were converted to 32-bit FP (float). This patch will reverse this and allow 64-bit datatype (double).

IREE Compiler was converting 64-bit datatype to 32-bit and to support that several CUDA function calls were converted to 32-bit FP (`float`). This patch will reverse this and allow 64-bit datatype (`double`).

ezhulenev · 2023-06-15T19:45:58Z

runtime/src/openxla/runtime/nvgpu/cudnn/test/conv2d.mlir

-cudnn.graph @conv2d(%x: !cudnn.tensor<8x32x4x4xf32, NHWC>,
-                    %w: !cudnn.tensor<32x32x1x1xf32, KHWC>)
-                    -> !cudnn.tensor<8x32x4x4xf32, NHWC> {
+cudnn.graph @conv2d(%x: !cudnn.tensor<8x32x4x4xf64, NHWC>,


tensors should stay in f32, only alpha/beta should be update to f64

Fixed. Pretty much reverted this file. see: af73d6b

ezhulenev · 2023-06-15T19:46:36Z

compiler/src/openxla/compiler/nvgpu/Dialect/CUDNN/Conversion/ConvertCUDNNToRuntime.cpp

@@ -576,13 +576,13 @@ struct ConvertCudnnBinaryOp : public CudnnOpConversionPattern<T> {
    MLIRContext *ctx = rewriter.getContext();
    ImplicitLocOpBuilder b(op->getLoc(), rewriter);

-    auto f32 = rewriter.getF32Type();
+    auto newType = rewriter.getF64Type();


newType=>f64 (and in few other places as well)

Fixed. (see af73d6b

ezhulenev · 2023-06-16T02:44:22Z

This still fails at IREE head with INTERNAL; import function signature mismatch between module and source cudnn; expected 0rfrfi_r but got 0rFrFi_r; resolving module 'module' imports; creating VM context; creating run context, what the PR that fixes the problem on IREE side?

bviyer · 2023-06-16T16:15:57Z

This still fails at IREE head with INTERNAL; import function signature mismatch between module and source cudnn; expected 0rfrfi_r but got 0rFrFi_r; resolving module 'module' imports; creating VM context; creating run context, what the PR that fixes the problem on IREE side?

iree-org/iree#14114

Added 64-bit support for CUDA Calls.

541467f

IREE Compiler was converting 64-bit datatype to 32-bit and to support that several CUDA function calls were converted to 32-bit FP (`float`). This patch will reverse this and allow 64-bit datatype (`double`).

ezhulenev reviewed Jun 15, 2023

View reviewed changes

Fixed issues mentioned by ezhulenev

af73d6b

bviyer requested a review from ezhulenev June 16, 2023 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added 64-bit support for CUDA Calls. #147

Added 64-bit support for CUDA Calls. #147

bviyer commented Jun 15, 2023

ezhulenev Jun 15, 2023

bviyer Jun 15, 2023 •

edited

Loading

ezhulenev Jun 15, 2023

bviyer Jun 15, 2023 •

edited

Loading

ezhulenev commented Jun 16, 2023

bviyer commented Jun 16, 2023

Added 64-bit support for CUDA Calls. #147

Are you sure you want to change the base?

Added 64-bit support for CUDA Calls. #147

Conversation

bviyer commented Jun 15, 2023

ezhulenev Jun 15, 2023

Choose a reason for hiding this comment

bviyer Jun 15, 2023 • edited Loading

Choose a reason for hiding this comment

ezhulenev Jun 15, 2023

Choose a reason for hiding this comment

bviyer Jun 15, 2023 • edited Loading

Choose a reason for hiding this comment

ezhulenev commented Jun 16, 2023

bviyer commented Jun 16, 2023

bviyer Jun 15, 2023 •

edited

Loading

bviyer Jun 15, 2023 •

edited

Loading