Illegal instruction error with oneMKL mt2203 instruction #441

ghost · 2024-01-23T17:18:57Z

Summary

MersenneTwisterGP11213 Sample migrated from CUDA fails on NVIDIA GPU with opensource oneMKL library - Illegal instruction (core dumped)

checkCudaErrors(DPCT_CHECK_ERROR(prngGPU = dpct::rng::create_host_rng(
                                       dpct::rng::random_engine_type::mt2203)));

Version

onemkl version - https://github.com/oneapi-src/oneMKL/releases/tag/v0.3

Environment

HW - Tesla P100-PCIE-12GB
OS name and version - Ubuntu* 22.04
Compiler version - clang version 18.0.0git (https://github.com/intel/llvm 3468f496c04ca9091da19ff5c73aa5f7ef792924)
output log -

./a.out Starting...

 Allocating data for 2400000 samples...
 Seeding with 777 ...

 Illegal instruction (core dumped)

Steps to reproduce

Code Repo - MersenneTwister
Compile command - clang++ -fsycl -fsycl-targets=nvptx64-nvidia-cuda MersenneTwister.cpp.dp.cpp -I../../../Common/ -I ../../../include/ -I -L -lonemkl_rng_curand -lonemkl

Observed behavior

Sample executes successfully with Intel oneMKL library on intel hardware(Gen9, Gen11)

Expected output

/a.out Starting...

Allocating data for 2400000 samples...
Seeding with 777 ...
Generating random numbers on GPU...


Reading back the results...
Shutting down...

The text was updated successfully, but these errors were encountered:

ghost · 2024-02-02T04:56:26Z

Hi, Any update on this issue, can someone help with this ?

ghost · 2024-02-09T05:14:46Z

@aelizaro any update?

aelizaro · 2024-02-09T12:39:47Z

Hi, @Shwetha-Selma, sorry for the delayed responce, I missed this issue.

MT2203 generator is not yet available via oneMKL interfaces project, only via Intel oneMKL and so it can be used only on x86 CPUs and Intel's GPUs from the oneAPI package (however illegal instruction issue is a surprise, I can check it within dpct tool). In order to run on Tesla P100-PCIE-12GB, please, try another generator available in OneMKL Interfaces with cuRAND backend (Philox4x32x10 or mrg32k3a)

or if the point is the sample migration itself - you can build it with icpx compiler, link it against Intel oneMKL but it won't work on NVIDIA HW yet, you can validate it on x86 CPU:

icpx -fsycl -DMKL_ILP64 -I"${MKLROOT}/include MersenneTwister.cpp.dp.cpp -L${MKLROOT}/lib -lmkl_sycl_rng -lmkl_intel_ilp64 -lmkl_tbb_thread -lmkl_core -lsycl -lpthread -lm -ldl

ghost · 2024-02-12T09:31:44Z

Thanks @aelizaro for confirming MT2203 generator is not yet available for the NVIDIA backend. And It would be helpful if you check about the 'Illegal instruction' error.

Also Is there any plan to include support for MT2203 generator with open-source oneMKL?

aelizaro · 2024-02-12T09:36:09Z

@Shwetha-Selma, yes, we plan to extend oneMKL Interfaces with more generators, the ones which are used in DPCT as the first priority, but no exact timeline yet.

AvijitBag07 · 2024-07-12T08:43:26Z

Hi, any update for MT2203 support in oneMKL ??

paveldyakov · 2024-07-12T11:51:56Z

Hi @AvijitBag07, no updates yet.
Please let us know if it blocks your work - we can try to prioritize it

AvijitBag07 · 2024-07-15T05:30:40Z

Yes it blocks our work, it will be beneficial if it is completed earlier.

paveldyakov · 2024-07-16T08:45:58Z

@AvijitBag07, got it, thank you for the feedback. We will increase priority for this engine

Just some clarification - we can introduce MT2203 in oneMKL Interfaces, but it won't support Nvidia backend. Is that fine for you?

AvijitBag07 · 2024-07-16T10:43:54Z

No, we are looking for Nvidia Backend support only.

paveldyakov · 2024-07-16T11:36:12Z

cuRAND is the Nvidia backend in oneMKL Interfaces.
But cuRAND doesn't support MT2203 engine.

AvijitBag07 · 2024-07-17T09:31:58Z

As oneMKL supports other RNG functions such as philox4x32x10<1> and mcg59<1> added for Nvidia backends, we also want MT2203 to be included

paveldyakov · 2024-07-17T10:11:33Z

@AvijitBag07, do you reference to Device RNG API?
If yes, we have a list of engines that is defined in oneMKL spec - https://github.com/uxlfoundation/oneAPI-spec/blob/main/source/elements/oneMKL/source/domains/rng/device_api/device-engines.rst

If you are interested in adding new engines to oneMKL Interfaces - please propose it for oneMKL spec

AvijitBag07 · 2024-07-19T11:41:44Z

How Intel oneMKL supports MT2203 generator, you can add MT2203 generator too in oneMKL too

paveldyakov · 2024-07-23T08:16:30Z

Intel oneMKL supports MT2203 in Host API only (with x86 CPUs and Intel GPUs as the supported HW).

We can add support for MT2203 Host API into oneMKL Interfaces with Intel oneMKL backend. It means that it will have x86 CPUs and Intel GPUs supported HW. As there is no MT2203 in cuRAND - we cannot enable cuRAND as Nvidia backend for this generator.

AvijitBag07 · 2024-07-24T10:08:02Z

We can link intel oneMKL lib and use it for intel GPU, but for Nvidia there is no option yet. It will be helpful if you can add it

mmeterel assigned aelizaro Feb 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Illegal instruction error with oneMKL mt2203 instruction #441

Illegal instruction error with oneMKL mt2203 instruction #441

ghost commented Jan 23, 2024 •

edited by ghost

Loading

ghost commented Feb 2, 2024

ghost commented Feb 9, 2024

aelizaro commented Feb 9, 2024 •

edited

Loading

ghost commented Feb 12, 2024

aelizaro commented Feb 12, 2024

AvijitBag07 commented Jul 12, 2024

paveldyakov commented Jul 12, 2024

AvijitBag07 commented Jul 15, 2024

paveldyakov commented Jul 16, 2024

AvijitBag07 commented Jul 16, 2024

paveldyakov commented Jul 16, 2024

AvijitBag07 commented Jul 17, 2024

paveldyakov commented Jul 17, 2024

AvijitBag07 commented Jul 19, 2024

paveldyakov commented Jul 23, 2024

AvijitBag07 commented Jul 24, 2024

Illegal instruction error with oneMKL mt2203 instruction #441

Illegal instruction error with oneMKL mt2203 instruction #441

Comments

ghost commented Jan 23, 2024 • edited by ghost Loading

Summary

Version

Environment

Steps to reproduce

Observed behavior

Expected output

ghost commented Feb 2, 2024

ghost commented Feb 9, 2024

aelizaro commented Feb 9, 2024 • edited Loading

ghost commented Feb 12, 2024

aelizaro commented Feb 12, 2024

AvijitBag07 commented Jul 12, 2024

paveldyakov commented Jul 12, 2024

AvijitBag07 commented Jul 15, 2024

paveldyakov commented Jul 16, 2024

AvijitBag07 commented Jul 16, 2024

paveldyakov commented Jul 16, 2024

AvijitBag07 commented Jul 17, 2024

paveldyakov commented Jul 17, 2024

AvijitBag07 commented Jul 19, 2024

paveldyakov commented Jul 23, 2024

AvijitBag07 commented Jul 24, 2024

ghost commented Jan 23, 2024 •

edited by ghost

Loading

aelizaro commented Feb 9, 2024 •

edited

Loading