Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tensor core example result mismatch with that of cublas #53

Open
jdgh000 opened this issue Feb 5, 2024 · 0 comments
Open

tensor core example result mismatch with that of cublas #53

jdgh000 opened this issue Feb 5, 2024 · 0 comments

Comments

@jdgh000
Copy link

jdgh000 commented Feb 5, 2024

while I learning of tensor core low level code example using wmma::, i found there is a mismatch in a resulting c matrxi where tensor core and cublas sums were compared and large number of mismatches occurred. Because i am learning still tensor core I could not find the error is. Can you investigate?
code-samples/posts/tensor-cores


[root@localhost tensor-cores]# ls -l
total 832
-rw-r--r--. 1 root root   1685 Feb  5 08:01 Makefile
-rw-r--r--. 1 root root    317 Feb  5 08:01 README.md
-rwxr-xr-x. 1 root root 830920 Feb  5 08:01 TCGemm
-rw-r--r--. 1 root root  11380 Feb  5 08:01 simpleTensorCoreGEMM.cu
[root@localhost tensor-cores]# make && ./TCGemm
nvcc -o TCGemm -arch=sm_70 -lcublas -lcurand simpleTensorCoreGEMM.cu

M = 16384, N = 16384, K = 16384. alpha = 2.000000, beta = 2.000000

Running with wmma...
Running with cuBLAS...

Checking results...
8266.587891 8267.766602
8240.230469 8241.420898
8242.393555 8243.574219
8209.478516 8210.649414
8100.519043 8101.664062
8251.499023 8252.675781
8189.156738 8190.297852
8260.410156 8261.580078
8311.802734 8313.015625
WMMA does not agree with cuBLAS! 268435456 errors!
[root@localhost tensor-cores]# git remote -v
origin  https://github.com/NVIDIA-developer-blog/code-samples.git (fetch)
origin  https://github.com/NVIDIA-developer-blog/code-samples.git (push)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant