Fix docs explaining broadcasting, or change behaviour of `min.(1,f)` #23445

dlfivefifty · 2017-08-25T13:34:59Z

The docs state unequivocally that:

More generally, f.(args…) is actually equivalent to broadcast(f, args…)

But, as discussed in

https://discourse.julialang.org/t/should-min-1-f-be-broadcasting-as-broadcast-x-min-1-x-f/5554/14

min.(1,f) is actually equivalent to broadcast(x->min(1,x), f). So either this behaviour should change, or the docs need to be rephrased to, say,

More generally, f.(args…) is typically equivalent to broadcast(f, args…)

The text was updated successfully, but these errors were encountered:

stevengj · 2017-08-25T14:57:48Z

See also #22053 and #18202.

dlfivefifty · 2017-08-25T17:40:09Z

#22053 seems to hit the screw on its head: for non-Base types, one often wants to use dot-notation to indicate a functional calculus operation deriving from Hadamard .* multiplication (instead of / in addition to standard * multiplication... in the case of ApproxFun, Hadamard and standard * are the same operation), and these type of optimizations make this impossible when it's only implementable for a subset of functions.

My personal suggested solution: only apply these optimizations to Base types, so that other types can still override dot-notation as they desire too. How to handle fusing for other types would be a separate issue (personally, I'd be fine to always auto-fuse and leave it to the user to be careful if they want the special overrides).

stevengj · 2017-08-25T20:43:25Z

Being restricted to Base types is precisely what we wanted to avoid.

dlfivefifty · 2017-08-25T20:44:56Z

I'm talking about the treatment of constant literals, not fusing in general

stevengj · 2017-08-25T20:48:12Z

In what use case would the treatment of constant literals be a problem but general fusing wouldn't? (In you ApproxFun example, where you want to have special-case code for the min of a Fun and a constant, general fusing would defeat your approach also.)

The general philosophy is that . means fusing semantics, and if you don't want this then you should use some other operator. There are plenty to choose from. (Nevertheless, we are planning a way to optionally disable fusing etc. for user-defined types.)

dlfivefifty · 2017-08-25T20:52:57Z

The issue is not with fusing, its with fusing where the semantics suggest that there should be no fusing at all. It's secretly and misleadingly creating a new function and compiling even though the code as written min.(1,f) is asking it to call `broadcast(min,1,f)

stevengj · 2017-08-26T02:57:35Z

If you write sin.(min.(α,f)) your specialized method won't be called either. So what practical problem does eliminating the literal-inlining optimization actually solve?

(Sure, the documentation could be clarified.)

dlfivefifty · 2017-08-26T06:47:17Z

You do if you call

sin(min.(a, g))

The issue it solves is that min(a, g) should not act Point wise according to the docs.

StefanKarpinski · 2017-08-26T15:26:57Z

It does trouble me that the lowering of broadcast is so seemingly ad hoc. It certainly makes it very hard to explain and I'm a bit worried that it's not terribly future proof. The ideal way to do this is to have broadcast's lowering be fairly simple and uniform and let constant propagation and specialization take care of the rest. However, we don't currently have that level of optimization technology here. Of course, it's not wrong to have some complicated lowering rule as long as we document it and are consistent – it's just kind of messy and hard to reason about.

yuyichao · 2017-08-26T15:29:03Z

future proof

I think the future proof way is to document clearly that any transformation allowed for Base broadcast are allowed. People that somehow really want to use broadcast as the API can always, well, call broadcast directly.

StefanKarpinski · 2017-08-26T15:31:40Z

See my last sentence which addressed exactly that – you're not wrong, it's just nasty. It's not generally a terribly good argument for a language design issue since you can use it to justify any design good or bad by saying "well, that's what it's documented to do".

yuyichao · 2017-08-26T15:42:08Z

I know. I just want to present a slightly different kind of "future proof".

To expand a bit more, I see the dot syntax as an syntax sugar for (de)vectorization rather than a strict mapping to broadcast (after all, it originated from the vectorization issue). Since for calling broadcast you can already just, well call broadcast, I think we don't have to tie another syntax too strictly to boardcast and it is reasonable that we leave more room for future optimization to the dot syntax that won't be possible with broadcast itself.

One example of such an optimization would be for, e.g. sin.(array_1xn) .* cos(array_nx1) one can get (almost) n times speedup by reducing the number of sin/cos evaluation. Of course such optimization is hard and I'm not sure how that can be implemented now. However, I think it's possible to do if we can make use of syntactic information and will be much harder if we have to optimize it as a single boardcast call. It'll be nice if we don't make such an optimization impossible.

dlfivefifty · 2017-09-23T09:30:17Z

Here's another more serious inconsistency due to this:

julia> (1:3) .+ 1
3-element Array{Int64,1}:
 2
 3
 4

julia> x = 1; (1:3) .+ x
2:4

dlfivefifty · 2017-09-23T09:31:40Z

Perhaps this could be done in Julia code instead of when parsing? Something like:

@inline broadcast(f, v::Array, k::Number) = broadcast(x->f(x,k), v)

though I don't know if that would inline the constants correctly.

@lower

This patch represents the combined efforts of four individuals, over 60 commits, and an iterated design over (at least) three pull requests that spanned nearly an entire year (closes #22063, #23692, #25377 by superceding them). This introduces a pure Julia data structure that represents a fused broadcast expression. For example, the expression `2 .* (x .+ 1)` lowers to: ```julia julia> Meta.@lower 2 .* (x .+ 1) :($(Expr(:thunk, CodeInfo(:(begin Core.SSAValue(0) = (Base.getproperty)(Base.Broadcast, :materialize) Core.SSAValue(1) = (Base.getproperty)(Base.Broadcast, :make) Core.SSAValue(2) = (Base.getproperty)(Base.Broadcast, :make) Core.SSAValue(3) = (Core.SSAValue(2))(+, x, 1) Core.SSAValue(4) = (Core.SSAValue(1))(*, 2, Core.SSAValue(3)) Core.SSAValue(5) = (Core.SSAValue(0))(Core.SSAValue(4)) return Core.SSAValue(5) end))))) ``` Or, slightly more readably as: ```julia using .Broadcast: materialize, make materialize(make(*, 2, make(+, x, 1))) ``` The `Broadcast.make` function serves two purposes. Its primary purpose is to construct the `Broadcast.Broadcasted` objects that hold onto the function, the tuple of arguments (potentially including nested `Broadcasted` arguments), and sometimes a set of `axes` to include knowledge of the outer shape. The secondary purpose, however, is to allow an "out" for objects that _don't_ want to participate in fusion. For example, if `x` is a range in the above `2 .* (x .+ 1)` expression, it needn't allocate an array and operate elementwise — it can just compute and return a new range. Thus custom structures are able to specialize `Broadcast.make(f, args...)` just as they'd specialize on `f` normally to return an immediate result. `Broadcast.materialize` is identity for everything _except_ `Broadcasted` objects for which it allocates an appropriate result and computes the broadcast. It does two things: it `initialize`s the outermost `Broadcasted` object to compute its axes and then `copy`s it. Similarly, an in-place fused broadcast like `y .= 2 .* (x .+ 1)` uses the exact same expression tree to compute the right-hand side of the expression as above, and then uses `materialize!(y, make(*, 2, make(+, x, 1)))` to `instantiate` the `Broadcasted` expression tree and then `copyto!` it into the given destination. All-together, this forms a complete API for custom types to extend and customize the behavior of broadcast (fixes #22060). It uses the existing `BroadcastStyle`s throughout to simplify dispatch on many arguments: * Custom types can opt-out of broadcast fusion by specializing `Broadcast.make(f, args...)` or `Broadcast.make(::BroadcastStyle, f, args...)`. * The `Broadcasted` object computes and stores the type of the combined `BroadcastStyle` of its arguments as its first type parameter, allowing for easy dispatch and specialization. * Custom Broadcast storage is still allocated via `broadcast_similar`, however instead of passing just a function as a first argument, the entire `Broadcasted` object is passed as a final argument. This potentially allows for much more runtime specialization dependent upon the exact expression given. * Custom broadcast implmentations for a `CustomStyle` are defined by specializing `copy(bc::Broadcasted{CustomStyle})` or `copyto!(dest::AbstractArray, bc::Broadcasted{CustomStyle})`. * Fallback broadcast specializations for a given output object of type `Dest` (for the `DefaultArrayStyle` or another such style that hasn't implemented assignments into such an object) are defined by specializing `copyto(dest::Dest, bc::Broadcasted{Nothing})`. As it fully supports range broadcasting, this now deprecates `(1:5) + 2` to `.+`, just as had been done for all `AbstractArray`s in general. As a first-mover proof of concept, LinearAlgebra uses this new system to improve broadcasting over structured arrays. Before, broadcasting over a structured matrix would result in a sparse array. Now, broadcasting over a structured matrix will _either_ return an appropriately structured matrix _or_ a dense array. This does incur a type instability (in the form of a discriminated union) in some situations, but thanks to type-based introspection of the `Broadcasted` wrapper commonly used functions can be special cased to be type stable. For example: ```julia julia> f(d) = round.(Int, d) f (generic function with 1 method) julia> @inferred f(Diagonal(rand(3))) 3×3 Diagonal{Int64,Array{Int64,1}}: 0 ⋅ ⋅ ⋅ 0 ⋅ ⋅ ⋅ 1 julia> @inferred Diagonal(rand(3)) .* 3 ERROR: return type Diagonal{Float64,Array{Float64,1}} does not match inferred return type Union{Array{Float64,2}, Diagonal{Float64,Array{Float64,1}}} Stacktrace: [1] error(::String) at ./error.jl:33 [2] top-level scope julia> @inferred Diagonal(1:4) .+ Bidiagonal(rand(4), rand(3), 'U') .* Tridiagonal(1:3, 1:4, 1:3) 4×4 Tridiagonal{Float64,Array{Float64,1}}: 1.30771 0.838589 ⋅ ⋅ 0.0 3.89109 0.0459757 ⋅ ⋅ 0.0 4.48033 2.51508 ⋅ ⋅ 0.0 6.23739 ``` In addition to the issues referenced above, it fixes: * Fixes #19313, #22053, #23445, and #24586: Literals are no longer treated specially in a fused broadcast; they're just arguments in a `Broadcasted` object like everything else. * Fixes #21094: Since broadcasting is now represented by a pure Julia datastructure it can be created within `@generated` functions and serialized. * Fixes #26097: The fallback destination-array specialization method of `copyto!` is specifically implemented as `Broadcasted{Nothing}` and will not be confused by `nothing` arguments. * Fixes the broadcast-specific element of #25499: The default base broadcast implementation no longer depends upon `Base._return_type` to allocate its array (except in the empty or concretely-type cases). Note that the sparse implementation (#19595) is still dependent upon inference and is _not_ fixed. * Fixes #25340: Functions are treated like normal values just like arguments and only evaluated once. * Fixes #22255, and is performant with 12+ fused broadcasts. Okay, that one was fixed on master already, but this fixes it now, too. * Fixes #25521. * The performance of this patch has been thoroughly tested through its iterative development process in #25377. There remain [two classes of performance regressions](#25377) that Nanosoldier flagged. * #25691: Propagation of constant literals sill lose their constant-ness upon going through the broadcast machinery. I believe quite a large number of functions would need to be marked as `@pure` to support this -- including functions that are intended to be specialized. (For bookkeeping, this is the squashed version of the [teh-jn/lazydotfuse](#25377) branch as of a1d4e7e. Squashed and separated out to make it easier to review and commit) Co-authored-by: Tim Holy <[email protected]> Co-authored-by: Jameson Nash <[email protected]> Co-authored-by: Andrew Keller <[email protected]>

@lower

This patch represents the combined efforts of four individuals, over 60 commits, and an iterated design over (at least) three pull requests that spanned nearly an entire year (closes #22063, #23692, #25377 by superceding them). This introduces a pure Julia data structure that represents a fused broadcast expression. For example, the expression `2 .* (x .+ 1)` lowers to: ```julia julia> Meta.@lower 2 .* (x .+ 1) :($(Expr(:thunk, CodeInfo(:(begin Core.SSAValue(0) = (Base.getproperty)(Base.Broadcast, :materialize) Core.SSAValue(1) = (Base.getproperty)(Base.Broadcast, :make) Core.SSAValue(2) = (Base.getproperty)(Base.Broadcast, :make) Core.SSAValue(3) = (Core.SSAValue(2))(+, x, 1) Core.SSAValue(4) = (Core.SSAValue(1))(*, 2, Core.SSAValue(3)) Core.SSAValue(5) = (Core.SSAValue(0))(Core.SSAValue(4)) return Core.SSAValue(5) end))))) ``` Or, slightly more readably as: ```julia using .Broadcast: materialize, make materialize(make(*, 2, make(+, x, 1))) ``` The `Broadcast.make` function serves two purposes. Its primary purpose is to construct the `Broadcast.Broadcasted` objects that hold onto the function, the tuple of arguments (potentially including nested `Broadcasted` arguments), and sometimes a set of `axes` to include knowledge of the outer shape. The secondary purpose, however, is to allow an "out" for objects that _don't_ want to participate in fusion. For example, if `x` is a range in the above `2 .* (x .+ 1)` expression, it needn't allocate an array and operate elementwise — it can just compute and return a new range. Thus custom structures are able to specialize `Broadcast.make(f, args...)` just as they'd specialize on `f` normally to return an immediate result. `Broadcast.materialize` is identity for everything _except_ `Broadcasted` objects for which it allocates an appropriate result and computes the broadcast. It does two things: it `initialize`s the outermost `Broadcasted` object to compute its axes and then `copy`s it. Similarly, an in-place fused broadcast like `y .= 2 .* (x .+ 1)` uses the exact same expression tree to compute the right-hand side of the expression as above, and then uses `materialize!(y, make(*, 2, make(+, x, 1)))` to `instantiate` the `Broadcasted` expression tree and then `copyto!` it into the given destination. All-together, this forms a complete API for custom types to extend and customize the behavior of broadcast (fixes #22060). It uses the existing `BroadcastStyle`s throughout to simplify dispatch on many arguments: * Custom types can opt-out of broadcast fusion by specializing `Broadcast.make(f, args...)` or `Broadcast.make(::BroadcastStyle, f, args...)`. * The `Broadcasted` object computes and stores the type of the combined `BroadcastStyle` of its arguments as its first type parameter, allowing for easy dispatch and specialization. * Custom Broadcast storage is still allocated via `broadcast_similar`, however instead of passing just a function as a first argument, the entire `Broadcasted` object is passed as a final argument. This potentially allows for much more runtime specialization dependent upon the exact expression given. * Custom broadcast implmentations for a `CustomStyle` are defined by specializing `copy(bc::Broadcasted{CustomStyle})` or `copyto!(dest::AbstractArray, bc::Broadcasted{CustomStyle})`. * Fallback broadcast specializations for a given output object of type `Dest` (for the `DefaultArrayStyle` or another such style that hasn't implemented assignments into such an object) are defined by specializing `copyto(dest::Dest, bc::Broadcasted{Nothing})`. As it fully supports range broadcasting, this now deprecates `(1:5) + 2` to `.+`, just as had been done for all `AbstractArray`s in general. As a first-mover proof of concept, LinearAlgebra uses this new system to improve broadcasting over structured arrays. Before, broadcasting over a structured matrix would result in a sparse array. Now, broadcasting over a structured matrix will _either_ return an appropriately structured matrix _or_ a dense array. This does incur a type instability (in the form of a discriminated union) in some situations, but thanks to type-based introspection of the `Broadcasted` wrapper commonly used functions can be special cased to be type stable. For example: ```julia julia> f(d) = round.(Int, d) f (generic function with 1 method) julia> @inferred f(Diagonal(rand(3))) 3×3 Diagonal{Int64,Array{Int64,1}}: 0 ⋅ ⋅ ⋅ 0 ⋅ ⋅ ⋅ 1 julia> @inferred Diagonal(rand(3)) .* 3 ERROR: return type Diagonal{Float64,Array{Float64,1}} does not match inferred return type Union{Array{Float64,2}, Diagonal{Float64,Array{Float64,1}}} Stacktrace: [1] error(::String) at ./error.jl:33 [2] top-level scope julia> @inferred Diagonal(1:4) .+ Bidiagonal(rand(4), rand(3), 'U') .* Tridiagonal(1:3, 1:4, 1:3) 4×4 Tridiagonal{Float64,Array{Float64,1}}: 1.30771 0.838589 ⋅ ⋅ 0.0 3.89109 0.0459757 ⋅ ⋅ 0.0 4.48033 2.51508 ⋅ ⋅ 0.0 6.23739 ``` In addition to the issues referenced above, it fixes: * Fixes #19313, #22053, #23445, and #24586: Literals are no longer treated specially in a fused broadcast; they're just arguments in a `Broadcasted` object like everything else. * Fixes #21094: Since broadcasting is now represented by a pure Julia datastructure it can be created within `@generated` functions and serialized. * Fixes #26097: The fallback destination-array specialization method of `copyto!` is specifically implemented as `Broadcasted{Nothing}` and will not be confused by `nothing` arguments. * Fixes the broadcast-specific element of #25499: The default base broadcast implementation no longer depends upon `Base._return_type` to allocate its array (except in the empty or concretely-type cases). Note that the sparse implementation (#19595) is still dependent upon inference and is _not_ fixed. * Fixes #25340: Functions are treated like normal values just like arguments and only evaluated once. * Fixes #22255, and is performant with 12+ fused broadcasts. Okay, that one was fixed on master already, but this fixes it now, too. * Fixes #25521. * The performance of this patch has been thoroughly tested through its iterative development process in #25377. There remain [two classes of performance regressions](#25377) that Nanosoldier flagged. * #25691: Propagation of constant literals sill lose their constant-ness upon going through the broadcast machinery. I believe quite a large number of functions would need to be marked as `@pure` to support this -- including functions that are intended to be specialized. (For bookkeeping, this is the squashed version of the [teh-jn/lazydotfuse](#25377) branch as of a1d4e7e. Squashed and separated out to make it easier to review and commit) Co-authored-by: Tim Holy <[email protected]> Co-authored-by: Jameson Nash <[email protected]> Co-authored-by: Andrew Keller <[email protected]>

stevengj added the broadcast Applying a function over a collection label Aug 25, 2017

JeffBezanson added the docs This change adds or pertains to documentation label Aug 25, 2017

fredrikekre mentioned this issue Nov 12, 2017

UnitRange broacasting with literals gives a Vector #24586

Closed

mbauman mentioned this issue Apr 24, 2018

Customizable lazy fused broadcasting in pure Julia #26891

Merged

mbauman closed this as completed in #26891 Apr 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix docs explaining broadcasting, or change behaviour of `min.(1,f)` #23445

Fix docs explaining broadcasting, or change behaviour of `min.(1,f)` #23445

dlfivefifty commented Aug 25, 2017 •

edited

Loading

stevengj commented Aug 25, 2017

dlfivefifty commented Aug 25, 2017

stevengj commented Aug 25, 2017

dlfivefifty commented Aug 25, 2017

stevengj commented Aug 25, 2017 •

edited

Loading

dlfivefifty commented Aug 25, 2017

stevengj commented Aug 26, 2017 •

edited

Loading

dlfivefifty commented Aug 26, 2017

StefanKarpinski commented Aug 26, 2017

yuyichao commented Aug 26, 2017

StefanKarpinski commented Aug 26, 2017 •

edited

Loading

yuyichao commented Aug 26, 2017 •

edited

Loading

dlfivefifty commented Sep 23, 2017

dlfivefifty commented Sep 23, 2017

Fix docs explaining broadcasting, or change behaviour of min.(1,f) #23445

Fix docs explaining broadcasting, or change behaviour of min.(1,f) #23445

Comments

dlfivefifty commented Aug 25, 2017 • edited Loading

stevengj commented Aug 25, 2017

dlfivefifty commented Aug 25, 2017

stevengj commented Aug 25, 2017

dlfivefifty commented Aug 25, 2017

stevengj commented Aug 25, 2017 • edited Loading

dlfivefifty commented Aug 25, 2017

stevengj commented Aug 26, 2017 • edited Loading

dlfivefifty commented Aug 26, 2017

StefanKarpinski commented Aug 26, 2017

yuyichao commented Aug 26, 2017

StefanKarpinski commented Aug 26, 2017 • edited Loading

yuyichao commented Aug 26, 2017 • edited Loading

dlfivefifty commented Sep 23, 2017

dlfivefifty commented Sep 23, 2017

Fix docs explaining broadcasting, or change behaviour of `min.(1,f)` #23445

Fix docs explaining broadcasting, or change behaviour of `min.(1,f)` #23445

dlfivefifty commented Aug 25, 2017 •

edited

Loading

stevengj commented Aug 25, 2017 •

edited

Loading

stevengj commented Aug 26, 2017 •

edited

Loading

StefanKarpinski commented Aug 26, 2017 •

edited

Loading

yuyichao commented Aug 26, 2017 •

edited

Loading