better reducedim type inference #6994

stevengj · 2014-05-27T15:40:11Z

timholy · 2014-05-27T17:40:10Z

#6672 was filed for reductions other than those in reducedim, so probably wouldn't want to close #6672 just yet. I also don't think this will work for an example like sum(Vector{Int}[[1,2],[4,3]], 1) because IIUC it still relies on having zero or one defined.

On my todo-list is to completely eliminate the need to initialize the output. Basically generalize reductions like

julia> function mysum(x)
           xs = x[1]
           for i = 2:length(x)
               xs += x[i]
           end
           xs
       end

to more than one dimension. The logic here is a little tricky, but I hope it's doable (though it might possibly hurt efficiency). If we can do that, then reducedim will be more flexible than it ever has been.

But if we need a stopgap, one good option would be to at least allow the user to initialize the output manually and be able to run sum! or prod! without triggering an error. I believe that's where we were before the init keyword got added.

stevengj · 2014-05-27T18:25:54Z

@timholy, the problems in #6672 boil down to the limitations of reducedim, and are indeed fixed by this patch. e.g. std(FloatingPoint[1,2,3], 1) now gives [1.0] and Base.sumabs2(FloatingPoint[1,2,3],1) now gives [14.0].

stevengj · 2014-05-27T18:29:51Z

sum(Vector{Int}[[1,2],[4,3]], 1) should work but doesn't here because I erroneously checked for method_exists(zero, (T,)) rather than method_exists(zero, (Type{T},)). I'll post an updated patch shortly.

stevengj · 2014-05-27T18:32:41Z

Unfortunately, your mysum function is not type-stable; see #6069 and #6116. You need to special-case 3 cases: empty sums (return zero if possible, otherwise throw an error), 1-element sums (return x[1]+zero(x[1]) if possible, otherwise return x[1]), and sums of two or more elements (initialize to x[1] + x[2]).

Doing this for reducedim-type reductions is straightforward in principle, but a little bit hairy in practice which is why I didn't attempt it in #6116.

timholy · 2014-05-27T19:20:53Z

@timholy, the problems in #6672 boil down to the limitations of reducedim

I hadn't caught that back when it was filed & discussed. Rats.

Unfortunately, your mysum function is not type-stable

Right, but isn't that what you give up with type-flexibility? For example, for

A = FloatingPoint[1.0f0 2.0f0; 3.0 4.0]

shouldn't we really have sum(A, 1) == FloatingPoint[4.0 6.0] and

sum(A, 2) == reshape(FloatingPoint[3.0f0;  7.0], 2, 1)

?

Anyway, the revised patch looks better than where we are now (I didn't even realize zero([1,2]) == [0,0], that's quite handy). Fine with me if this gets merged, even if there may be yet another iteration someday.

stevengj · 2014-05-27T21:04:44Z

@timholy, obviously if you have an Array{Real} where every element is a different real type, then you aren't going to get any benefits of type stability. However, in the special case where all your elements have the same type then you want the sum function to be fast and type-stable. Your implementation does not have the second property.

timholy · 2014-05-27T22:10:30Z

It sounds great to get type-stability in cases where the type of the container is less specific than it could have been, but I'm being a little slow in understanding how you achieve it. For example, arguably we shouldn't have zero(FloatingPoint) defined at all. If you don't buy that, then convert everything I'm about to say to Vector{SIQuantity} (from the SIUnits package), for which it's pretty clear that you can't define zero(SIQuantity) (is it 0Meter? 0KiloGram?)

In such cases, IIUC with this patch the element type of sum(a, 1) is determined from sum(a). But

julia> Base.return_types(sum, (Vector{FloatingPoint},))
1-element Array{Any,1}:
 Any

and type-stability of Any is not, as far as I am aware, something that helps very much. Now, it might so happen that you get a Float64 back, and then you can do your reducedim using Float64 as the element type, and everything in the reduction is type-stable. But in the meantime, didn't you have to call sum and have it operate in what I assume must be a type-unstable fashion? Both sum(a) and sum(a, 1) are O(N), where N is the number of elements of a, so I'm not yet seeing what this really buys you.

But you seem to have thought about this, so I bet I'm just not getting it.

stevengj · 2014-05-27T22:36:57Z

@timholy, sorry I wasn't clear. sum will never be type-stable in a useful way for Vector{FloatingPoint}. However, you do want it to be type-stable in cases where it can be usefully type-stable.

For example, your mysum function is not type-stable for Vector{Int8}, where we certainly should be able to get type-stability.

timholy · 2014-05-28T00:21:06Z

Got it. Of course, that's fixed simply by changing the function to

function mysum(x)
           xs = x[1] + x[2]
           for i = 3:length(x)
               xs += x[i]
           end
           xs
       end

and has the advantage of doing its job in a single pass.

(I also think it's basically inevitable that someday, Stefan will agree that Int8 + Int8 = Int8 😄.)

timholy · 2014-05-28T00:26:58Z

(But I'll add that this is all vaporware right now, and I have no objections to your version being merged, particularly if there is not a better way of getting the same effect.)

stevengj · 2014-05-28T02:44:24Z

@timholy. Right, but you also have to special-case 0 and 1 elements as I mentioned above. This is what sum does now.... it is just a pain to generalize to reducedim.

timholy · 2014-05-28T08:50:59Z

Agreed, the logic for avoiding double-counting in reductions that involve more than one dimension simultaneously is not going to be entirely trivial. Adding the possibility of 0- and unit-length arrays on top of that makes it a bit worse, although I suspect that's going to be relatively minor ("do it on a separate code path") compared to the first.

lindahua · 2014-05-28T12:45:35Z

base/reducedim.jl

+    @eval function $f{T}(A::AbstractArray{T}, region)
+        if method_exists($init, (Type{T},))
+            z = $op($init(T), $init(T))
+            Tr = typeof(z) == typeof($init(T)) ? T : typeof(z)


Why so simply set Tr = typeof(z) ?

If T is Real, then typeof(z) will be Int, and then you will get errors if there are non-integer values in the array.

On the other hand, if T is Int8, then z will have type Int, and I think we want Tr to be Int too. Hence the conditional.

JeffBezanson · 2014-05-28T17:45:27Z

I get the sense we should merge this?

lindahua · 2014-05-28T18:50:59Z

I think it is good to merge.

better reducedim type inference

IainNZ · 2014-05-28T19:07:29Z

Was the perf test suite run before-after?

stevengj · 2014-05-28T20:25:54Z

@IainNZ, I didn't run the perf-test suite. This patch shouldn't really impact performance in the common case of sum for Array{T} where T is a concrete Number type, though.

fix JuliaLang#6672 (better reducedim type inference)

74ac755

lindahua reviewed May 28, 2014
View reviewed changes

JeffBezanson added a commit that referenced this pull request May 28, 2014

Merge pull request #6994 from stevengj/reducedim_type

916b2f5

better reducedim type inference

JeffBezanson merged commit 916b2f5 into JuliaLang:master May 28, 2014

stevengj mentioned this pull request May 28, 2014

Tweaks of sum functions #7013

Merged

stevengj mentioned this pull request Aug 17, 2016

any() and all() functions broken for empty collections #18073

Closed

stevengj mentioned this pull request Sep 6, 2016

Introduce UInt1 type to replace misuse of Bool #18367

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

better reducedim type inference #6994

better reducedim type inference #6994

stevengj commented May 27, 2014

timholy commented May 27, 2014

stevengj commented May 27, 2014

stevengj commented May 27, 2014

stevengj commented May 27, 2014

timholy commented May 27, 2014

stevengj commented May 27, 2014

timholy commented May 27, 2014

stevengj commented May 27, 2014

timholy commented May 28, 2014

timholy commented May 28, 2014

stevengj commented May 28, 2014

timholy commented May 28, 2014

lindahua May 28, 2014

stevengj May 28, 2014

JeffBezanson commented May 28, 2014

lindahua commented May 28, 2014

IainNZ commented May 28, 2014

stevengj commented May 28, 2014

better reducedim type inference #6994

better reducedim type inference #6994

Conversation

stevengj commented May 27, 2014

timholy commented May 27, 2014

stevengj commented May 27, 2014

stevengj commented May 27, 2014

stevengj commented May 27, 2014

timholy commented May 27, 2014

stevengj commented May 27, 2014

timholy commented May 27, 2014

stevengj commented May 27, 2014

timholy commented May 28, 2014

timholy commented May 28, 2014

stevengj commented May 28, 2014

timholy commented May 28, 2014

lindahua May 28, 2014

Choose a reason for hiding this comment

stevengj May 28, 2014

Choose a reason for hiding this comment

JeffBezanson commented May 28, 2014

lindahua commented May 28, 2014

IainNZ commented May 28, 2014

stevengj commented May 28, 2014