[EXTENDED] PR 26 Parallelized multiple right-hand side support, fully typed `tools` #27

MothNik · 2024-06-09T18:11:43Z

⚠️ This pull request can fully replace #26 which can basically be closed ⚠️

Update June 10, 2024

I temporarily converted this to a draft because the decisions for #28 heavily affect this PR because this might involve another change in the Cython-level API.

Changes

This pull request is an improved version of #26, so please refer to the basic changes in there. On top of these changes, this pull request

fully types the tools-module
parallelizes the Cython-implementation which reqiured
- restricting the Cython level interface in solver.pyxd so that the solvers only accept C-contiguous Arrays. This will not be backwards compatible because it could already be that users pass Fortran-contiguous Arrays (maybe without even knowing).
- refactoring the C-level functions to deal with pointers rather than memoryviews which allows to release the GIL and thus parallelize the right-hand side problems in the first place
- changing the setup.py to include OpenMP
adds the worker-argument to the Python interface

Since all that is a breaking change and the Cython interface is not backwards-compatible, I suggest that this is a major version jump from 1 to 2 (maybe as an $\alpha$-version of it?).
Again, I tried to update the changelog, but it might be that you need to still look over it.

Tests

The installation and parallelisation were tested on Windows 11 and Ubuntu (WSL). ✅
Tests now cover serial and parallelel solves, but they have to run with

pytest --cov=pentapy .\tests --cov-report html -x

now to prevent interfering with the multithreading. ✅
⚠️ Note that the -x cancels the process for the first failure which is mandatory for this delicate topic where we deal with pointers ⚠️
Depending on the OS, it might be that the doctests of tests/util_funcs fail because the Array output includes the dtype on Windows while it does not on Linux. ❌
Given that the C-implementation is already very fast in serial, the parallelization does not cause a quantum leap. With 8 threads on a relatively old laptop (there it is, the grain of salt 🧂), I observe a threefold speedup for huge systems (1,000 x 1,000 with 10,000 right-hand sides):

On massively parallel systems, this will give better speedups.
However, it does not negatively affect the serial solves when workers=1:

… test that test more different cases

…oses

…e severe

…proved import chain; improved code readability

… cover the edge cases

… f-string; fixed typo

…omments

… `uint`

…hon level; fixed typos

…t more extensive

… machine

…utput and warning behaviour; tested it altogether

…tate development without compromising on build

… fucking comment; augmented tests to cover this error

MothNik

Guys, you really declared war on making code maintainable 😅 How should anybody infer that np.array is used for both converting lists to Arrays and making a copy at the same time?
I now added a comment and made the tests cover this.

…olve on C-level

MothNik · 2024-06-10T11:38:57Z

The last commit currently does not change the internal logic at all.
However, it already forms the foundation of future validations, e.g., by means of the condition number of the matrix.
Right now, pentapy does never assess or allow the user to assess the quality of the solves because the result not being all np.nan, this does not mean that the solve was meaningful.
If the user wants to have the results validated, one could introduce validate to the Python high-level interface of solve.
The C-level interface was broken anyway, so I tried to keep it flexible for all given future scenarios. Right now, the validation logic does not trigger anything, so this buys flexibility in the future at no cost today.

The idea was taken from LAPACK. It returns an info-value together with the solution. It is usually 0 to indicate success while all other values encode certain warnings or even errors.

…load

src/pentapy/core.py

…I and II

…ers I and II: - now, they share one common processing logic that - calls the solvers it needs depending on which solver was called Besides, a shape check was introduced and the `info`-variable was extended. All of this is now reflected in `core`, where the solvers and errors are not structured a bit better.

… auxiliary functions in `core`; added NumPy-solver error case test

…pragma)

MothNik · 2024-06-11T21:43:37Z

To be able to cope with #28 and #29 in the future without compromising anything on the solvers side, I rearranged the Cython core (not the interface) as follows:

enums were used to provide additional safety for value comparisons and also to make the code more concise by increasing verbosity
the c_penta_solver1 and c_penta_solver2 interface are now calling two common functions that
- factorize the matrix A with the respective PTRANS-factorization (I for the first, II for the second)
- run the (parallelized) loop to solve the transformed right-hand side with this factorization, again with the respective solver
This allowed the removal of the redundant logic (factorize -> solve) that was implemented in both the c_penta_solver1 and c_penta_solver2 interface, thereby removing one critical potential point of failure.
these two common functions are now responsible for all memory allocation and they return the freshly allocated memoryviews of the factorized matrix and the result, respectively. Before that, memory allocation was handled via np.empty in both the c_penta_solver1 and c_penta_solver2 interface.
This in turn now comes with the added bonus, that other interface functions for
- factorization
- solving a right-hand side given an already factorized matrix
can easily be implemented on top and just accept the fresh memoryviews that are return by the common functions. All that's required now for such an interface function is that it calls the common function and casts the returned memoryview to an Array.
With these changes, the computation
- of the determinant ([Enhancement] Evaluate log-determinant for solve #28 ),
- the inverse's main diagonal ([Enhancement] Enable computation of main diagonal of the inverse from factorizations #29 ),
- or just a solve of another right-hand side
can be triggered from Python and Cython after a factorization was obtained.

This is probably a bit too abstract (I barely know how phrase it), so please let me provide the following
Analogy Example
In the basic principle, this is identical to how the banded Cholesky decomposition of SciPy works. For solving a system, there is scipy.linalg.solveh_banded. However, there is also scipy.linalg.cholesky_banded that can perform a banded Cholesky factorization and scipy.linalg.cho_solve_banded that can take the banded Cholesky factorization to solve one or more systems.

As of now, this PR basically implements the scipy.linalg.solveh_banded-analaogy of pentay (factorize and solve in one go), but on the Cython-side I arranged everything that it's super simple to also add the analogies of scipy.linalg.cholesky_banded (factorize only) and scipy.linalg.cho_solve_banded (solve system using a given factorization). But this would probably explode the frame of this already big PR 🤯

MothNik · 2024-06-11T21:47:32Z

@MuellerSeb Sorry for all these iterations. It tooks me some time to get the right strategy for Cython, the GIL, Memory Allocation, the Parallelization, and a testing framework (they all pass still ✅).
But now I feel that this PR is ready for review and I would highly appreciate your feedback 😄 I really wanted to keep everything flexible for future adjustments. It might be true that pentapy is "only" there for solving pentadiagonal systems, so the number of features will never be as mindblowing as SciPy, but playing with the factorizations (like with the determinant) is a common step in mathematical problems and I really wanted to pave way for that 🛣️

…trix Reason: it is the main diagonal of the matrix L for A = LU just like ps is the main diagonal of U for PTRANS-II. So now, the main diagonals of the non unit triangular factors are always in the central column which makes the most sense.

- avoided performance pitfall of `except * nogil` in Cython implementation of the solvers - replaced this statement by `noexcept nogil`

MuellerSeb

Gave it a first view. Was not to much in depth, since I got no time at this point as mentioned. I really like your work and it seems to be solid and improves the package.

CHANGELOG.md

README.md

docs/source/index.rst

pytest.ini

pyproject.toml

setup.py

src/pentapy/__init__.py

src/pentapy/errors.py

- moved `pytest.ini` to `pyproject.toml` - hard-coded dependencies in `pyproject.toml` again and removed the dynamic link

- made OpenMP-link in `setup.py` optional based on an environment variable - changed thread number evaluation to be OpenMP-based (if at all) - dropped `psutil` dependency - renamed `workers` to `num_threads` and made `None` a possible option and the default - adapted tests to the new `num_threads` - updated documentation - adapted to `pylint` comments

- fixed typo in docstring of `solve` for the number of threads

- made input matrix and right-hand side read-only on Cython-level

MothNik added 30 commits May 25, 2024 17:42

pkg: made .venv in .gitignore more general

96f67f6

wip: [11] refactored algorithm 1 to handle multiple right-hand sides

3deb05c

feat: [11] added doctest runs to pytest

328d9c9

tests: [11] replaced single tools test by scalable parametrized tools…

90bdeb1

… test that test more different cases

tests: [11] removed version from coverage

5080641

lint: [11] fixed block comment lint error

2f57609

style: [11] improved headlines

8db6ba4

wip: [11] formatted

1cdd525

test: [11] created conditioned banded matrix creator for testing purp…

ae656cf

…oses

tests: [11] add doctested reference solver; made ill-conditioning mor…

26752f2

…e severe

feat: [11] made error messages informative

a0c8849

fix: [23] disabled cdivision to fix error handling on Python side

ee6943a

feat/refactor: [11] enabled multipe right-hand sides for solver I; im…

0284843

…proved import chain; improved code readability

tests: [11] added shape check to doctest

c32bec2

tests: [11] added extensive parametrized tests for solver I that also…

e7d92c8

… cover the edge cases

tests: [11] added more intermediate sizes for tests

647012f

feat/fix: [11] added cython annotations to build process; fixed wrong…

8ab514f

… f-string; fixed typo

refactor/doc: fixed missing variable declarations; added clarifying c…

5086959

…omments

style: [11] made - signs better readable

f062b88

doc/refactor: [11] improved docs and comments of algorithm I; removed…

15c787e

… `uint`

wip: [11] restructured chaotic alias comparisons

ef7ec51

feat: [11] finalized multiple right-hand side support (serial) on Cyt…

94d110c

…hon level; fixed typos

tests: [11] unified test for both pentapy algorithms, made tests a lo…

ba5945f

…t more extensive

tests: [11] removed superficial unittest

026e8ea

misc: [11] fixed typos

f6fa4cb

style/refactor: [11] made core code readable to humans and not only a…

9b3e122

… machine

feat/refactor: [11] ensured all solvers behave the same in terms of o…

92abbbd

…utput and warning behaviour; tested it altogether

fix: [11] fixed coverage typo

fb0cf2a

lint: [11] linted cython files

c2e8032

package: [11] made requirements file-based and dynamic to make facili…

b4190f8

…tate development without compromising on build

MothNik changed the title ~~[EXTENDED] 11 Parallelized multiple right-hand side support, fully typed tools~~ [EXTENDED] PR 26 Parallelized multiple right-hand side support, fully typed tools Jun 9, 2024

fix/doc: [11] reverted change that caused overwrite of mat; added a…

88e30e6

… fucking comment; augmented tests to cover this error

MothNik commented Jun 9, 2024

View reviewed changes

MothNik added 3 commits June 9, 2024 21:29

docs: [11] removed doubled defaults from docstrings

b4ce4a2

docs: [11] fixed docstring and spelling inconsistencies

e71ff9d

feat: [11] enabled future possible validation of the quality of the s…

aef7d0f

…olve on C-level

MothNik added 2 commits June 10, 2024 14:03

refactor: [11] made internal workers-handling safer

86ebf88

docs: [11] updated outdated README

b5f8b95

MothNik mentioned this pull request Jun 10, 2024

[Enhancement] Evaluate log-determinant for solve #28

Open

MothNik marked this pull request as draft June 10, 2024 14:21

refactor: [11] removed validate and leveraged info; reduced test …

87264d7

…load

MothNik commented Jun 10, 2024

View reviewed changes

src/pentapy/core.py Outdated Show resolved Hide resolved

MothNik added 6 commits June 10, 2024 18:43

doc: [11] updated outdated index

8db881f

refactor: [11] made Cython structure smarter for algorithm I

37c2fbc

wip: [11] made use of Enums for checks; prepared for unifying solver …

896b1cc

…I and II

refactor/tests: [11] split up cluttered ptrans-solving into dedicated…

549422d

… auxiliary functions in `core`; added NumPy-solver error case test

fix: [11] fixed broken coverage of unknown error (was not skipped by …

a31fe71

…pragma)

MothNik marked this pull request as ready for review June 12, 2024 05:40

fix:

0945501

- avoided performance pitfall of `except * nogil` in Cython implementation of the solvers - replaced this statement by `noexcept nogil`

MuellerSeb reviewed Jul 23, 2024

View reviewed changes

MothNik added 4 commits July 23, 2024 19:35

refactor:

d1433f9

- moved `pytest.ini` to `pyproject.toml` - hard-coded dependencies in `pyproject.toml` again and removed the dynamic link

doc:

5b42428

- fixed typo in docstring of `solve` for the number of threads

refactor:

743549b

- made input matrix and right-hand side read-only on Cython-level

MothNik requested a review from MuellerSeb July 23, 2024 21:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EXTENDED] PR 26 Parallelized multiple right-hand side support, fully typed `tools` #27

[EXTENDED] PR 26 Parallelized multiple right-hand side support, fully typed `tools` #27

MothNik commented Jun 9, 2024 •

edited

Loading

MothNik left a comment

MothNik commented Jun 10, 2024 •

edited

Loading

MothNik commented Jun 11, 2024 •

edited

Loading

MothNik commented Jun 11, 2024

MuellerSeb left a comment

[EXTENDED] PR 26 Parallelized multiple right-hand side support, fully typed tools #27

Are you sure you want to change the base?

[EXTENDED] PR 26 Parallelized multiple right-hand side support, fully typed tools #27

Conversation

MothNik commented Jun 9, 2024 • edited Loading

Changes

Tests

MothNik left a comment

Choose a reason for hiding this comment

MothNik commented Jun 10, 2024 • edited Loading

MothNik commented Jun 11, 2024 • edited Loading

MothNik commented Jun 11, 2024

MuellerSeb left a comment

Choose a reason for hiding this comment

[EXTENDED] PR 26 Parallelized multiple right-hand side support, fully typed `tools` #27

[EXTENDED] PR 26 Parallelized multiple right-hand side support, fully typed `tools` #27

MothNik commented Jun 9, 2024 •

edited

Loading

MothNik commented Jun 10, 2024 •

edited

Loading

MothNik commented Jun 11, 2024 •

edited

Loading