More sparse operations support #50

alyst · 2024-09-10T18:56:30Z

This PR adds support for more SparseMKL operations:

dense := sparse * sparse (sp2md!())
sparse := sparse * sparse (including the in-place one that can optionally disable checks for the sparsity structure) (sp2m(), sp2m!())
dense := X * A * X^T (syprd!())
dense := X * X^T (syrk!())

Implementation Details

while Intel MKL sp2m() supports
reusing the sparsity structure in principle, it only supports the sparsity structure allocated by Intel MKL.
It is therefore not possible to directly update the existing SparseMatrixCSC object, reusing the colptr, rowval and updating the nzval only.
Instead, each sp2m!() call has to recalculate the sparsity structure, so it is not efficient as it could have been.
Intel MKL syrkd() and
syprd() routines only support the CSR format and only fill the upper triangular part of the matrix.
So, while MKLSparse.jl wrappers (syrkd!() and syprd!()) implement CSC support by treating it as a transpose/adjoint of the CSR matrix, it
does not work for complex numbers and operations like C := a * A^H A + b * C, because transposition trick messes with which half of the hermitian C
is used. So complex support for syrkd!() and syprd!() is disabled
Because of these complications, syrkd!() and syprd!() were not "plugged" into mul!() calls like low rank dense BLAS counterparts.
Also, copytri!() is actually very expensive for large matrices due to cache misses, so doing it by default may result in degraded performance.
For syprd!() there is also no direct LinearAlgebra call to overload.
Potentially, MKLSparse.jl may support PDMats.jl via extension mechanism to implement X_A_Xt!() via syprd!().
syrk() only returns the upper triangular part of the resulting sparse matrix. Since allocating and computing the lower triangular part in that case could be even more expensive than
for the dense case, syrk(..., copytri=true) is not allowed.
spmm() and spmmd!() wrappers were added, but as they are just more limited versions of sp2m!() and sp2md!(), they were not tested

amontoison · 2024-09-16T23:29:31Z

@alyst Can you split the PR to add what is already working?

alyst · 2024-09-17T00:08:14Z

@amontoison I will rebase it on top of the 2 PRs I have submitted today. I think all of the new operations should be working (I use them in my code), but I would need to fix/add the tests.

amontoison · 2024-09-17T00:12:03Z

Perfect, thanks @alyst 👍
What we should also try to interface in the future is the sparse preconditioner.

codecov · 2025-01-03T05:53:23Z

Codecov Report

Attention: Patch coverage is 80.44280% with 53 lines in your changes missing coverage. Please review.

Project coverage is 44.39%. Comparing base (cb0852e) to head (4a7c463).
Report is 3 commits behind head on master.

Files with missing lines	Patch %	Lines
src/generic.jl	79.50%	25 Missing ⚠️
src/types.jl	40.90%	13 Missing ⚠️
src/interface.jl	80.85%	9 Missing ⚠️
src/mklsparsematrix.jl	93.65%	4 Missing ⚠️
src/utils.jl	88.23%	2 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           master      #50       +/-   ##
===========================================
+ Coverage   32.98%   44.39%   +11.40%     
===========================================
  Files           7        8        +1     
  Lines        1258     1489      +231     
===========================================
+ Hits          415      661      +246     
+ Misses        843      828       -15

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

- use randn() instead of rand() to include negative values - variable sparsity rate for random sparse matrices

use + to generate symmetric matrix; the result of matrix multiplication may have different eltype

this is required to support checking dimensions for X*A*X^T

to distringuish from MKLSparseMatrix

copyto!(Matrix, CSR) that works with unordered colval

alyst · 2025-01-03T21:59:47Z

@amontoison Finally I was able to add the tests for this PR, so it should be ready for review. I've also added "Implementation Notes" section to the PR description to provide some context about what was implemented and what are the limitations.

amontoison · 2025-01-03T22:06:28Z

@alyst I will review that :)
Do you need the release v2025.0.0 of MKL_jll.jl?

alyst · 2025-01-03T22:12:28Z

I will review that :) Do you need the release v2025.0.0 of MKL_jll.jl?

Thank you! I think v2025.0 is not required, any v2024 should work as well (v2025 is subject to the same restrictions I've mentioned in the implementation notes).

amontoison · 2025-01-03T22:13:04Z

LGTM, I merged and tagged a new release 👍

alyst force-pushed the more_ops branch from 5f9d291 to 5326777 Compare September 17, 2024 01:05

alyst mentioned this pull request Oct 10, 2024

Add Support for Sparse by Sparse Matrix Multiplication #53

Closed

alyst force-pushed the more_ops branch from 2c9d6d1 to e55c39d Compare January 3, 2025 20:09

alyst and others added 23 commits January 3, 2025 12:24

README: link to most recent Intel MKL docs

4f2a272

add top-level comments to clarify code structure

0b23c3e

tests: ntries constant

98f4a99

tests: rework random matrices generation

20ea8b9

- use randn() instead of rand() to include negative values - variable sparsity rate for random sparse matrices

tests: fix matdescra

588bc01

use + to generate symmetric matrix; the result of matrix multiplication may have different eltype

tests: increase atol to reduce spurious failures

090d82a

check_map_op_sizes(): allow C=nothing

08c64c5

check_map_op_sizes(): allow disabling specific checks

6d502ad

this is required to support checking dimensions for X*A*X^T

matrix_descr(): edit specific fields

bd6fcae

use LazyString for exceptions

7e33a1b

rename typealias MKLSparseMat to SparseMat

881edfa

to distringuish from MKLSparseMatrix

tweak typealiases to improve precompilation times

611b663

fix \ support in v1.9-v1.11

172c706

add check_nzpattern() method

8170704

convert(CSC/CSR, MKLSparseMtx): fix for empty mtx

9b3045e

COO, CSR: overloads necessary for unit tests

3dcebba

copy!(CSC/CSR, MKLSparseMatrix)

686b055

CSR/COO: improve dense conversion

c96f756

copyto!(Matrix, CSR) that works with unordered colval

convert(CSR, a::CSC)

473d683

add fastcopytri!() method

90c05d2

dual_opcode(op)

ebab206

mul!(dense, sparse, sparse) support (sp2md!())

d6cdeef

mul!(sparse, sparse, sparse) support (sp2m())

1589ff1

alyst and others added 5 commits January 3, 2025 12:24

dense := A * A^T support (syrkd!())

3dd1015

sparse := A * A^T support (syrk())

a8e9303

dense := A * B * A^T support (syprd!())

8a3e231

spmm() & spmmd!() (untested)

e878bcd

fixup ws

4a7c463

alyst force-pushed the more_ops branch from e55c39d to 4a7c463 Compare January 3, 2025 20:24

alyst marked this pull request as ready for review January 3, 2025 20:24

amontoison approved these changes Jan 3, 2025

View reviewed changes

amontoison merged commit 18ad424 into master Jan 3, 2025
10 checks passed

amontoison deleted the more_ops branch January 3, 2025 22:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More sparse operations support #50

More sparse operations support #50

alyst commented Sep 10, 2024 •

edited

Loading

amontoison commented Sep 16, 2024

alyst commented Sep 17, 2024

amontoison commented Sep 17, 2024

codecov bot commented Jan 3, 2025 •

edited

Loading

alyst commented Jan 3, 2025

amontoison commented Jan 3, 2025

alyst commented Jan 3, 2025

amontoison commented Jan 3, 2025

More sparse operations support #50

More sparse operations support #50

Conversation

alyst commented Sep 10, 2024 • edited Loading

Implementation Details

amontoison commented Sep 16, 2024

alyst commented Sep 17, 2024

amontoison commented Sep 17, 2024

codecov bot commented Jan 3, 2025 • edited Loading

Codecov Report

alyst commented Jan 3, 2025

amontoison commented Jan 3, 2025

alyst commented Jan 3, 2025

amontoison commented Jan 3, 2025

alyst commented Sep 10, 2024 •

edited

Loading

codecov bot commented Jan 3, 2025 •

edited

Loading