A_mul_B and all that jazz #57

jiahao · 2014-01-08T19:21:28Z

The purpose of this issue is to discuss the future of the dozens of functions in Julia that are slight variations of matrix multiplication, matrix division and backslash. I presume that the point was to have fine-grained control over the use of matrix variables and temporaries when needed. However, I don't understand why the availability of these functions is so inconsistent: is this intentional design or a consequence of organic growth with methods being implemented as needed?

List of all the functions (existing and plausibly existing in the future)

Here are the 3x3x3x2=54 possible functions that can be constructed out of A, its transpose At and its Hermitian conjugate Ac; and similarly for B, Bt and Bc; mul, rdiv and ldiv; and mutating and non-mutating variants. (### means that this function does not exist.)

A_mul_B #deprecated synonym of A*B
A_mul_B!
A_mul_Bt
A_mul_Bt!
A_mul_Bc
A_mul_Bc!

At_mul_B
At_mul_B!
At_mul_Bt
At_mul_Bt!
At_mul_Bc   ###
At_mul_Bc!  ###

Ac_mul_B
Ac_mul_B!
Ac_mul_Bt #deprecated
Ac_mul_Bt!  ###
Ac_mul_Bc
Ac_mul_Bc!


A_ldiv_B    ### synonym of A\B
A_ldiv_B!
A_ldiv_Bt
A_ldiv_Bt!  ###
A_ldiv_Bc
A_ldiv_Bc!  ###

At_ldiv_B
At_ldiv_B!  ###
At_ldiv_Bt
At_ldiv_Bt! ###
At_ldiv_Bc  ###
At_ldiv_Bc! ###

Ac_ldiv_B
Ac_ldiv_B!  ###
Ac_ldiv_Bt  ###
Ac_ldiv_Bt! ###
Ac_ldiv_Bc
Ac_ldiv_Bc! ###


A_rdiv_B    ### synonym of A/B
A_rdiv_B!   ###
A_rdiv_Bt
A_rdiv_Bt!  ###
A_rdiv_Bc
A_rdiv_Bc!  ###

At_rdiv_B
At_rdiv_B!  ###
At_rdiv_Bt
At_rdiv_Bt! ###
At_rdiv_Bc  ###
At_rdiv_Bc! ###

Ac_rdiv_B
Ac_rdiv_B!  ###
Ac_rdiv_Bt  ###
Ac_rdiv_Bt! ###
Ac_rdiv_Bc
Ac_rdiv_Bc! ###

I'm rocking this boat because it's come up as a natural consequence of systematically making methods available for special matrix types.

Specific issues

Documentation. The existence, motivation and use of these functions is sparsely documented. Not all the functions even have defined help strings.

2 Growth and proliferation. Do we foresee an eventual future where all 54-3=51 functions (omitting the synonyms for *, / and \) become defined and have methods implemented? There would a great many methods that would have to be implemented to cover all the possible types of matrices that currently exist.

Can we eliminate any of these functions on the basis of being useless? There are no functions defined that mix Ac
and Bt, for example; however, this is a somewhat rare but entirely plausible combination to operate on. Do we care?
How many of these functions can be implemented with just a fallback method?
Having more things in Base without reason is not ideal, but conversely these functions form a coherent grouping for which it doesn't make sense to have some but not others.
How many of these functions can be defined trivially in terms of lazy array views (#5003) and rewriting rules (Label QRCompactWY.T as Triangular #122)?

cc: @JeffBezanson @StefanKarpinski @andreasnoackjensen @dmbates @lindahua

The text was updated successfully, but these errors were encountered:

johnmyleswhite · 2014-01-08T19:27:14Z

If we get rid of these functions, am I right to assume we'll be pushing information into the type system using transpose types and conjugate types?

jiahao · 2014-01-08T19:32:56Z

I think that would depend quite strongly on how #42 is resolved.

dmbates · 2014-01-08T20:40:24Z

Because I develop iterative methods that require a lot of numerical linear algebra within each iteration, I want to be careful about reusing existing storage when possible. Hence I find these mutating functions to be worthwhile. I could always write them as direct calls to functions in the BLAS or LAPACK modules but this makes the code harder to read for those who didn't grow up needing to know Lapack calling sequences as I did.

However, I would be amenable to hiding all these functions in the Base namespace and only exporting the non-mutating versions that are called when expressions like C = A'*B are parsed. The use of mutating functions to save memory allocations is a "consenting adults" activity in my opinion. If you are going to engage in it you should be aware of the potential problems.

ViralBShah · 2014-01-09T02:20:57Z

I think only export non-mutating versions is the right choice. What about the transposed multiply operators? Should those also not be exported by default?

I would be ok with removing the c and t combinations too. It wasn't fun dealing with all of those in a recent PR.

For the stuff with lazy array views, we should wait until we get there, or else we will see a performance degradation. At that point, I would expect that we can get rid of the transposed multiply and divide operators.

johnmyleswhite · 2014-01-09T02:22:49Z

To me, it seems like it would be better to wait until we have a general tool for expressing mutating operations before removing these functions.

ViralBShah · 2014-01-09T02:26:22Z

I am only giving a +1 to the proposal to remove the Ac_mul_Bt where you are mixing complex and real transposed matrices, and not exporting all these in Base by default.

johnmyleswhite · 2014-01-09T02:45:40Z

Ok. That makes sense to me, although I'd think there's something to be said for keeping an eyesore around as a reminder that we need a cleaner solution.

ViralBShah · 2014-01-09T02:47:39Z

There is some value to eyesore, but keeping it around longer means more people use it thinking it is a supported API and makes it difficult to remove in the future.

johnmyleswhite · 2014-01-09T02:48:20Z

Very true.

jiahao · 2014-01-09T03:13:51Z

Time for an @Embarrassing macro?

andreasnoack · 2014-03-25T14:48:39Z

Yesterday I found out that

julia> n=1000;
julia> A=randn(n,n);A=A+A';Aeig=eigfact(Symmetric(A));
julia> V=Aeig[:vectors];D=diagm(Aeig[:values]);
julia> @time V*D*V';
elapsed time: 0.263635665 seconds (24000512 bytes allocated)
julia> @time V*(D*V');
elapsed time: 0.235679586 seconds (16000272 bytes allocated)
julia> @time (V*D)*V';
elapsed time: 0.237066579 seconds (16000272 bytes allocated)

because

julia> @which V*D*V'
*(a,b,c) at operators.jl:71

causing the transpose to be calculated before the multiplication.

I think that @johnmyleswhite's proposal sounds like the right one. It could eliminate all the A_x_B functions and only require mul!,*,ldiv!,rdiv!,\ and /. The rest is dispatch. I think all the function bodies would be unaffected. Should we try this change? I can do the coding, but I would like to know it could fly before spending the time.

johnmyleswhite · 2014-03-25T15:20:47Z

I believe the problem with the transpose type (which somebody else proposed before me and I just brought up again) is that we're not sure how to implement it for higher-order tensors.

Jutho · 2014-04-20T18:12:24Z

Another suggestion would be to follow more closely the BLAS approach and have a single function with additional arguments 'N', 'T' and 'C' specifying the multiplication pattern. This of course only shifts the problem to branching within the method, and probably calling all of the above functions anyway, but the advantage is that only a single function needs to be exported and documented.

Occasionally I regret that the mutating multiplication functions in Julia do not more closely resemble the BLAS design, i.e. not only allow to store the result of the multiplication in a given array, but actually add the result to it. I guess there has been some thought in the BLAS design, and it would be great if Julia could generalise this functionality (i.e. matrices with no stride 1, different types of dense and sparse matrices, ...) without sacrificing the existing functionality.

I know that gemm and gemv are available within Base.LinAlg.BLAS , but if you define a new type hierarchy, say AbstractLinearOperator, for sparse linear operators for use in an iterative linear solver or eigensolver, then you want to implement a method A_mul_B! for its concrete subtypes. If you would allow to actually add the result of the matrix vector multiplication to a given vector in this method, then you could also define a SumOfLinearOperators <: AbstractLinearOperator type, define + for any AbstractLinearOperator to return a SumOfLinearOperators which just keeps track of the different AbstractLinearOperator terms, and implements A_mul_B! for an input vector by calling A_mul_B! sequentially on all the terms, thereby adding the result and not having to create any copies.

JeffBezanson · 2015-03-17T21:20:08Z

Getting rid of these is one of my top wishlist items. We can even have mul!, ldiv!, and rdiv!, just no underscores :) Let's move ahead with this.

johnmyleswhite · 2015-03-18T02:17:00Z

I just telling someone today how these methods are one of the main weak points of Julia's otherwise amazing design for linear algebra.

tkelman · 2015-03-18T12:05:30Z

JuliaLang/julia#6837 is step one, right?

JeffBezanson · 2016-04-21T21:02:54Z

@andreasnoack Any update here?

andreasnoack · 2016-04-21T22:07:34Z

Not yet. I'll give an update on this after the weekend.

andreasnoack · 2016-04-29T03:50:32Z

I've pushed a version that eliminates all the Ax_mul_Bx functions to JuliaLang/julia#6837. I'm using a Transpose type with a conjugated field. I think it works fairly well. It is also possible to avoid quite a few explicit transposes.

However, this hurts compilation time significantly. With the new version, the triangular.jl tests take more than 15 minutes on my machine. The total number of allocations is 3x the version on master and total amount of memory allocation is more than 2x. Running the tests a second time is still as fast as master so this is about compilation.

kmsquire · 2016-04-29T13:06:47Z

Anything you could add to base/precompile.jl that might help?

andreasnoack · 2016-04-29T13:36:16Z

My impression is that precompilation is actaully slower that "normal" compilation so I'd expect that the total compile time would be longer although the time in the tests might be shorter.

ViralBShah · 2017-11-18T04:52:10Z

It would certainly also be helpful to understand the needs of the ML folks before taking a final call on many of the open linalg issues. That would put these improvements squarely past 1.0, and that may be ok.

cc @MikeInnes

MikeInnes · 2017-11-20T13:46:22Z

I will link the CUBLAS wrappers. You can see that there's a fair bit of redundancy with Base; given that CUBLAS has an identical api to blas, ideally we'd just swap out the final gemm! calls without ever touching the higher-level API.

That said it does work fine at present, and I expect that things will be improved with whatever changes are made – reducing the redundancy would just be icing.

I think it's actually a bigger deal for AD, where we have to code a derivative for every Ax_mul_Bx combination. Having a single mul! would be a huge improvement there.

StefanKarpinski · 2017-11-20T18:59:27Z

Is there some way we can isolate Base from this change? The special lowering to these function names is an issue, but it would be possible to provide the fallbacks and then override them in the future, which would leave the unfortunate lowering hack but would allow this to be fixed in the future.

StefanKarpinski · 2017-11-20T19:53:34Z

Related: JuliaLang/julia#23424.

StefanKarpinski · 2017-11-20T20:15:23Z

Our options at this point are:

Keep A_mul_B and all that jazz and live with it until 2.0.
Move a' etc. out of Base into LinAlg and fix this post 1.0, this means that you'd have to do using LinAlg before you can do x'y for a dot product.
Hassle @andyferris about finishing the lazy transpose and conjugate work that this requires.

andyferris · 2017-11-20T23:26:14Z

Hassle @andyferris about finishing the lazy transpose and conjugate work that this requires.

Noted. :)

StefanKarpinski · 2017-11-21T14:52:46Z

For what it's worth, I think this is more crucial than getting the indexing with associatives stuff sorted. The main action item there would simply be to iterate dictionaries by value. The most conservative potential change for 0.7 would be to just deprecate dict iteration altogether and force explicit choice of iteration by pairs(d), keys(d) or values(d). That leaves us in a position to choose any means of iteration we want in the future, including going back to pair iteration.

JeffBezanson · 2017-11-21T17:03:14Z

The more I think about it, the more I like the idea of requiring pairs(d), keys(d), or values(d) to iterate over dicts. Anything else just leads to surprises --- other popular languages do keys by default, and on the other hand many operations make the most sense on values or pairs. There just isn't one obvious right way to do it.

vtjnash · 2017-11-21T21:04:02Z

What does that have to do with A_mul_b and all that jazz?

quinnj · 2017-11-21T21:10:38Z

What does that have to do with A_mul_b and all that jazz?

Because we're collectively prioritizing @andyferris's discretionary time and open-source efforts 😆

andyferris · 2017-11-22T04:35:51Z

@StefanKarpinski I came to the same conclusion - if someone else wants to volunteer for a ValueIterator for dicts and deprecating next(::Associative), I'd love to work on this stuff ;)

@JeffBezanson Getting even more off topic, I still feel the answer lies in what you want map(f, dict) and reduce(op, dict) to do (my answer: the most useful behavior for map (for manipulating data, especially after a group/groupby function) is to preserve keys and map values, just like for arrays (including offset arrays), and to reduce over values). These functions seem subservient to the iteration API - especially reduce.

@vtjnash Yes, it's a bit convoluted how this conversation ended up here! 😄

vtjnash · 2017-11-22T13:47:02Z

Please take off-topic remarks elsewhere. This thread is about making implementations of A_mul_b better, not about making Associative worse.

StefanKarpinski · 2017-11-22T16:39:11Z

Please take off-topic remarks elsewhere.

As explained, these are not off-topic and have to do with work prioritization which affects this issue.

Sacha0 · 2017-11-28T00:05:53Z

Linking this analysis of paths to addressing this issue by 1.0. Best!

JeffBezanson · 2017-12-31T01:18:21Z

These functions have been removed 🎉 so I think we can close this.

jiahao mentioned this issue Mar 10, 2014

Fix A_mul_B! for y inputs with NaNs by setting elements to zero when β is zero. JuliaLang/julia#6096

Merged

andreasnoack mentioned this issue May 14, 2014

WIP:Add Transpose immutable JuliaLang/julia#6837

Closed

jrevels mentioned this issue Dec 14, 2014

Choosing an initial target version of Julia JuliaAttic/QuBase.jl#1

Closed

jiahao mentioned this issue Jan 2, 2015

document A_mul_B! JuliaLang/julia#5862

Closed

jiahao mentioned this issue Jan 26, 2015

At_mul_B! has different methods for sparse and dense matrices #160

Closed

StefanKarpinski mentioned this issue Jun 16, 2016

Arraypocalypse Now and Then #255

Closed

27 tasks

Sacha0 mentioned this issue Dec 19, 2015

methods for left-division ops involving triangular matrices and sparse vectors JuliaLang/julia#14447

Merged

StefanKarpinski assigned andreasnoack Apr 28, 2016

tkelman mentioned this issue Nov 26, 2024

document that 2-arg mul! methods trashes all arguments #338

Closed

Sacha0 mentioned this issue Nov 23, 2017

WIP: make transpose non-recursive JuliaLang/julia#23424

Closed

This was referenced Dec 6, 2017

Extending in-place broadcast (broadcast!) for user types JuliaLang/julia#24914

Closed

lazier, less-jazzy linalg internals JuliaLang/julia#24969

Merged

This was referenced Dec 25, 2017

optimize and fix map/broadcast over Adjoint/Transpose vectors, take 2 JuliaLang/julia#25238

Merged

cumsum inconvenient for RowVector JuliaLang/julia#20041

Open

JeffBezanson closed this as completed Dec 31, 2017

This was referenced Jan 3, 2018

adjoint/transpose transcend material concerns JuliaLang/julia#25364

Merged

make Adjoint/Transpose behave like typical constructors JuliaLang/julia#25461

Merged

abelsiqueira mentioned this issue Sep 28, 2018

An implementation of DIOM JuliaSmoothOptimizers/Krylov.jl#63

Merged

KristofferC transferred this issue from JuliaLang/julia Nov 26, 2024

Sacha0 mentioned this issue Nov 23, 2017

Taking matrix transposes seriously #408

Closed

A_mul_B and all that jazz #57

A_mul_B and all that jazz #57

Comments

jiahao commented Jan 8, 2014

List of all the functions (existing and plausibly existing in the future)

Specific issues

johnmyleswhite commented Jan 8, 2014

jiahao commented Jan 8, 2014

dmbates commented Jan 8, 2014

ViralBShah commented Jan 9, 2014

johnmyleswhite commented Jan 9, 2014

ViralBShah commented Jan 9, 2014

johnmyleswhite commented Jan 9, 2014

ViralBShah commented Jan 9, 2014

johnmyleswhite commented Jan 9, 2014

jiahao commented Jan 9, 2014

andreasnoack commented Mar 25, 2014

johnmyleswhite commented Mar 25, 2014

Jutho commented Apr 20, 2014

JeffBezanson commented Mar 17, 2015

johnmyleswhite commented Mar 18, 2015

tkelman commented Mar 18, 2015

JeffBezanson commented Apr 21, 2016

andreasnoack commented Apr 21, 2016

andreasnoack commented Apr 29, 2016 • edited Loading

kmsquire commented Apr 29, 2016

andreasnoack commented Apr 29, 2016 • edited Loading

ViralBShah commented Nov 18, 2017

MikeInnes commented Nov 20, 2017

StefanKarpinski commented Nov 20, 2017

StefanKarpinski commented Nov 20, 2017

StefanKarpinski commented Nov 20, 2017

andyferris commented Nov 20, 2017

StefanKarpinski commented Nov 21, 2017

JeffBezanson commented Nov 21, 2017

vtjnash commented Nov 21, 2017

quinnj commented Nov 21, 2017

andyferris commented Nov 22, 2017

vtjnash commented Nov 22, 2017

StefanKarpinski commented Nov 22, 2017 • edited Loading

Sacha0 commented Nov 28, 2017

JeffBezanson commented Dec 31, 2017

andreasnoack commented Apr 29, 2016 •

edited

Loading

andreasnoack commented Apr 29, 2016 •

edited

Loading

StefanKarpinski commented Nov 22, 2017 •

edited

Loading