Generalize arithmetic ops to more combinations of scalars and arrays #782

jturner314 · 2020-02-15T20:48:02Z

This PR generalizes the existing implementations of arithmetic operations between scalars and arrays to more combinations of types. This is especially useful for operations between complex and real types.

A couple of notes:

This removes the special handling of commutative operations. (Before, commutative operations with the scalar on the left side were implemented by calling the operation with the scalar on the right side.) IMO, implementations for more combinations of types are more important than possible differences in compile time due to reusing implementations.
The new &arr (op) scalar implementation brings a performance boost.
An alternative approach for the "scalar on lhs" operations would be to add more implementations for specific combinations of types, e.g. f32 and Complex<f32>. I chose the generic approach instead for its conciseness and flexibility.
This change is backwards compatible, except for possible changes in type inference due to the implementations for more combinations of types.

Fixes #781.

jturner314 · 2020-02-16T20:06:30Z

It appears that Rust 1.37 has a bug that prevents this PR from working properly. Fortunately, this bug isn't present in the latest stable compiler, but we'll need to wait until we bump the minimum required Rust version before merging this PR.

bluss · 2020-04-22T21:55:25Z

Delightful that the .to_owned() was so easy to remove just like that

This change has two benefits: * The new implementation applies to more combinations of types. For example, it now applies to `&Array2<f32>` and `Complex<f32>`. * The new implementation avoids cloning the elements twice, and it avoids iterating over the elements twice. (The old implementation called `.to_owned()` followed by the arithmetic operation, while the new implementation clones the elements and performs the arithmetic operation in the same iteration.) On my machine, this change improves the performance for both contiguous and discontiguous arrays. (`scalar_add_1/2` go from ~530 ns/iter to ~380 ns/iter, and `scalar_add_strided_1/2` go from ~1540 ns/iter to ~1420 ns/iter.)

This doesn't have a noticeable impact on the results of the `scalar_add_2` and `scalar_add_strided_2` benchmarks.

bluss · 2020-12-29T19:00:49Z

Rebased to current master

bluss · 2020-12-29T19:14:29Z

src/impl_ops.rs

+      $scalar: Clone + $trt<A, Output=B>,
+      A: Clone,
+      S: Data<Elem=A>,
+      D: Dimension,


This impl somehow now breaks Rust -- see the failed tests -- and causes a recursion errror - for an expression that has type f32 + f32 which is quite strange/scary(!)

--> tests/oper.rs:159:48 | 159 | .fold(f32::zero(), |acc, (&x, &y)| acc + x * y) | ^ | = help: consider adding a `#![recursion_limit="256"]` attribute to your crate (`oper`) = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32` = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32` = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32` = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32` = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32` = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32` = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32` = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32`

Unsure if this is a Rust bug - for example that the impl is accepted(?), but I think this impl is too general and has infinite descent.

Given the question if f32 implements Add<&ArrayBase<S, D>> look for other impl that has f32: Add<A> where S: Data<Elem=A> which looks recursive, is that it?

It looks like a compiler bug to me. As you point out, the expression involves only f32, but for some reason, the error message indicates that one of the arguments is an array. It's also interesting that on my machine with Rust 1.48.0, the error message is slightly different, saying "impl of Add<ndarray::ArrayBase<_, _>> for f32" instead of the error message in your comment "impl of Add<&ndarray::ArrayBase<_, _>> for f32". (Note the &.)

The function fails to compile (with the same error message) even after adding type annotations:

fn reference_dot<'a, V1, V2>(a: V1, b: V2) -> f32 where V1: AsArray<'a, f32>, V2: AsArray<'a, f32>, { let a: ArrayView1<'a, f32> = a.into(); let b: ArrayView1<'a, f32> = b.into(); a.iter() .zip(b.iter()) .fold(f32::zero(), |acc: f32, (&x, &y): (&f32, &f32)| acc + x * y) }

but if I remove the + x * y, it compiles successfully:

fn reference_dot<'a, V1, V2>(a: V1, b: V2) -> f32 where V1: AsArray<'a, f32>, V2: AsArray<'a, f32>, { let a: ArrayView1<'a, f32> = a.into(); let b: ArrayView1<'a, f32> = b.into(); a.iter() .zip(b.iter()) .fold(f32::zero(), |acc: f32, (&x, &y): (&f32, &f32)| acc) }

I don't see any reason other than a compiler bug for the first function to fail to compile when the second one compiles without errors, since the type annotations confirm that the closure is operating only on f32 values.

This also compiles successfully:

fn reference_dot2<'a>(a: ArrayView1<'a, f32>, b: ArrayView1<'a, f32>) -> f32 { a.iter() .zip(b.iter()) .fold(f32::zero(), |acc: f32, (&x, &y): (&f32, &f32)| acc + x * y) }

so the bug involves the .into() calls in some way. It's surprising that adding explicit type annotations for the results of the .into() calls, as in the first example, doesn't work around the bug.

Fwiw, I don't think impl<'a, A, S, D, B> $trt<&'a ArrayBase<S, D>> for $scalar is infinitely recursive, since AFAIK it's not possible to have an array of (arrays of (arrays of (arrays of ... [infinite depth]))). The innermost array type can only have an element type that's not an array. You're right that there is recursion if you're dealing with arrays of arrays, but that's the correct behavior, and the recursion is not infinite.

For the particular function we're looking at, the impl doesn't apply, and I don't think the compiler should be trying to apply it. (I think it should only apply the impl if it knows the RHS has some type &ArrayBase<?S, ?D>, where ?S and ?D are inference variables.)

Interesting, the test runners for cross_test, stable, mips vs i686 disagree with each other about the error too, in the same way, even if they both use Rust 1.48

I reported the issue (with a simplified example) at rust-lang/rust#80542.

bluss · 2020-12-30T13:55:15Z

I have considered the question of deprecating scalars as left hand side (LHS) operands. The reason would be because their implementation does not fit well with how trait impls are normally written, and the inevitable asymmetry between array + scalar and scalar + array in terms of which types are accepted.

jturner314 · 2020-12-31T01:28:51Z

I have considered the question of deprecating scalars as left hand side (LHS) operands.

I agree that the implementations we have are somewhat unsatisfying, but IMO they're useful enough to keep. I would guess that the vast majority of users are dealing with the element types we implement the operators for, probably mostly f32/f64, and the impls are useful because subtraction and division aren't commutative. (To perform subtraction/division with a scalar on the left side without these impls, you'd have to use mapv or azip, which are much more verbose.)

I suppose an alternative option to the existing impls would be a Scalar wrapper type so that you could write expressions like this:

Scalar(2.) / array

which would work with any element type but would be less intuitive and would make expressions more verbose. I'm not sure a Scalar wrapper type is much better than using mapv.

bluss · 2021-01-10T15:26:22Z

I think I have found out that if this (ugly) workaround is applied, the ScalarOperand trait is not needed anymore - meaning an unrestricted Array1<A> + A would be allowed (without A: ScalarOperand). However, I'm unsure if it can be extended to Array1<A> + B - probably not.

I think that Scalar is a lot better than mapv, just more work for us to introduce it with all the right impls.

bluss · 2021-01-10T16:05:46Z

Benchmark and performance improvements are being included by using PR #890, that supersedes just the first commit and the map changes from this PR.

jturner314 added the enhancement label Feb 15, 2020

jturner314 mentioned this pull request Feb 15, 2020

Scalar operations with complex array and complex scalars #781

Open

jturner314 added the postponed label Feb 16, 2020

bluss added this to the 0.14.0 milestone Apr 22, 2020

bluss self-requested a review December 9, 2020 15:53

bluss removed the postponed label Dec 9, 2020

jturner314 added 3 commits December 29, 2020 19:59

Add benches for op with scalar and strided array

52ca234

Generalize lhs scalar ops to more combos of types

b2a7d0b

This doesn't have a noticeable impact on the results of the `scalar_add_2` and `scalar_add_strided_2` benchmarks.

bluss force-pushed the complex-real-ops branch from 19e35d3 to b2a7d0b Compare December 29, 2020 19:00

bluss reviewed Dec 29, 2020

View reviewed changes

jturner314 mentioned this pull request Dec 31, 2020

Operator impl causes compilation error for an expression involving the operator with a different pair of types rust-lang/rust#80542

Open

bluss mentioned this pull request Jan 10, 2021

Scalar + &array and &array + scalar performance improvements #890

Merged

bluss removed this from the 0.15.0 milestone Jan 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize arithmetic ops to more combinations of scalars and arrays #782

Generalize arithmetic ops to more combinations of scalars and arrays #782

jturner314 commented Feb 15, 2020 •

edited

Loading

jturner314 commented Feb 16, 2020

bluss commented Apr 22, 2020

bluss commented Dec 29, 2020

bluss Dec 29, 2020 •

edited

Loading

jturner314 Dec 30, 2020

bluss Dec 30, 2020 •

edited

Loading

jturner314 Dec 31, 2020

bluss commented Dec 30, 2020 •

edited

Loading

jturner314 commented Dec 31, 2020 •

edited

Loading

bluss commented Jan 10, 2021 •

edited

Loading

bluss commented Jan 10, 2021

Generalize arithmetic ops to more combinations of scalars and arrays #782

Are you sure you want to change the base?

Generalize arithmetic ops to more combinations of scalars and arrays #782

Conversation

jturner314 commented Feb 15, 2020 • edited Loading

jturner314 commented Feb 16, 2020

bluss commented Apr 22, 2020

bluss commented Dec 29, 2020

bluss Dec 29, 2020 • edited Loading

Choose a reason for hiding this comment

jturner314 Dec 30, 2020

Choose a reason for hiding this comment

bluss Dec 30, 2020 • edited Loading

Choose a reason for hiding this comment

jturner314 Dec 31, 2020

Choose a reason for hiding this comment

bluss commented Dec 30, 2020 • edited Loading

jturner314 commented Dec 31, 2020 • edited Loading

bluss commented Jan 10, 2021 • edited Loading

bluss commented Jan 10, 2021

jturner314 commented Feb 15, 2020 •

edited

Loading

bluss Dec 29, 2020 •

edited

Loading

bluss Dec 30, 2020 •

edited

Loading

bluss commented Dec 30, 2020 •

edited

Loading

jturner314 commented Dec 31, 2020 •

edited

Loading

bluss commented Jan 10, 2021 •

edited

Loading