isqrt is a mess #4884

stevengj · 2013-11-21T19:08:25Z

The isqrt function is poorly documented, but a reasonable definition of isqrt(n) would be the largest integer m such that m*m <= n. This coincides with the definition:

isqrt(x::Integer) = oftype(x, trunc(sqrt(x)))

that we are using, more or less.

However, for Int128 values, the sqrt function converts to a Float64, which discards too many bits for the result to be accurate. e.g.

z = int128(typemax(Int64))
isqrt(z*z) - z

gives 1 rather than 0 (and there are other numbers where the difference is larger). A simple fix would be to use BigFloat with sufficient precision for isqrt(x::Int128), although I'm not sure if there is a more efficient solution (or whether we care).

The text was updated successfully, but these errors were encountered:

StefanKarpinski · 2013-11-21T19:19:05Z

This feels like one of those cases where you can use Float64 to approximate and then patch up the answer quickly if it's not quite correct.

StefanKarpinski · 2013-11-21T19:44:36Z

The computation n>>((128-leading_zeros(n))>>1) gets you into the range [sqrt(n),sqrt(2n)].

stevengj · 2013-11-21T22:04:19Z

I wonder if you can use Newton's method, operating on Int128, to get within ±n for some small n, and then just exhaustively search.

stevengj · 2013-11-21T22:14:40Z

The following seems to work, though it may be suboptimal:

function isqrt2(x::Int128)
    s = convert(Int128, trunc(sqrt(x)))
    s = div(s + div(x),s), 2) # Newton step
    while s*s > x
        s -= 1
    end
    while (s+1)*(s+1) <= x
        s += 1
    end
    return s
end

StefanKarpinski · 2013-11-21T22:15:14Z

I can't prove that it's correct, but a single iteration of Newton's method with integer ops works in all cases I've tried:

function isqrt(x::Int128)
    s = convert(Int128,trunc(sqrt(x)))
    (s + div(x,s)) >> 1
end

stevengj · 2013-11-21T22:17:59Z

You're right, the "fixup" loops in my version never seem to be needed. I wonder why it's not off by one occasionally?

StefanKarpinski · 2013-11-21T22:19:55Z

I think you're always close enough from the floating-point approximation that a single Newton step gets you there.

StefanKarpinski · 2013-11-21T22:50:27Z

Ok, this is incredibly hand-wavy at best, but Newton's method for sqrt converges quadratically, so every iteration should double the number of accurate digits. Since the floating-point computation gives 52 correct bits, that means that one more iteration should give 104 correct bits. Since the largest possible sqrt for a 128-bit integer only requires 64 bits, this suffices. Of course, that's some pretty vigorous handwaving.

RauliRuohonen · 2013-11-21T22:53:34Z

I was also about to handwave the same way :) Anyhow, loops or assert would be nice anyway to ensure that wrong results are never returned no matter what (<- simple paranoia).

BTW, the "(s+1)(s+1) <= x" check won't work near typemax(Int128), because (s+1)(s+1) wraps around to negatives. "(s+1)*(s+1)-x <= 0" is better.

stevengj · 2013-11-21T22:56:28Z

I know an exact Newton iteration would suffice. I was worried more about the round off errors in the integer divisions, but upon reflection I think those can be bounded to show that the final answer can't be off by more than one.

stevengj · 2013-11-21T22:57:30Z

Or rather, that the final answer is off by less than 1.

StefanKarpinski · 2013-11-21T23:04:41Z

Btw, isqrt is wrong for Int64 as well:

julia> isqrt(9223372030926249000)^2
9223372030926249001

The same trick fixes it:

julia> function Base.isqrt(x::Int64)
           s = convert(Int64,trunc(sqrt(x)))
           (s + div(x,s)) >> 1
       end
isqrt (generic function with 4 methods)

julia> isqrt(9223372030926249000)^2
9223372024852248004

julia> isqrt(9223372030926249000)
3037000498

stevengj · 2013-11-22T00:44:15Z

I would think we can do something cheaper for Int64...

StefanKarpinski · 2013-11-22T03:30:43Z

Probably, although if it's more than adding the result of a comparison, it's probably not much cheaper.

GunnarFarneback · 2013-11-26T22:42:56Z

Here's an off by one example.

julia> a = int128(typemax(Int64))
9223372036854775807

julia> isqrt(a*a-1)
9223372036854775807

In general x*x-y for large x and small y are susceptible to having isqrt off by one by the current implementation.

StefanKarpinski · 2013-11-26T23:07:17Z

Yup, good find. That's definitely a problem. Looks like 128-bit at least may need two Newton iterations.

stevengj · 2013-11-27T02:09:27Z

You don't need another Newton iteration to correct an off-by-one error.

StefanKarpinski · 2013-11-27T02:30:44Z

Perhaps not, but I'm finding all sorts of problems with these now that I'm looking harder.

jiahao · 2013-11-27T02:59:53Z

Perhaps you can get a more systematic sense of what's needed by taking a random sample and looking at how many Newton iterations it takes for the solution to change by less than (say) 0.1.

GunnarFarneback · 2013-11-27T06:50:32Z

Throwing Newton iterations at this problem does not solve anything since they won't converge for x^2-1 numbers. The off by one solution x is moved to x-1, but another Newton iteration takes you back to x.

To make matters worse isqrt is currently rather ugly in the small end:

julia> isqrt(3)                                                                 
2

julia> isqrt(0)
ERROR: integer division error
 in isqrt at intfuncs.jl:317

StefanKarpinski · 2013-11-27T07:25:31Z

Yes, the former isqrt behavior was arguably better, even though it gave numbers larger than desired sometimes. What we have now is rather a mess. This is why I didn't want to just "throw Newton iterations" at it, as you put it.

GunnarFarneback · 2013-11-27T10:11:21Z

It's not that far off. Obviously 0 needs special casing and negative values could do with a more specific domain error than the one thrown from the sqrt call. Otherwise it's just an off by one correction that's needed.

Some of the handwaving above can be tightened up by the following observation:

If y1 = sqrt(x)*(1+e), one exact Newton iteration will give
y2 = 0.5 * (sqrt(x)*(1+e)+sqrt(x)/(1+e))
Using the identity 1/(1+e) = 1-e+e^2/(1+e) gives
y2 = sqrt(x)*0.5*(1+e+1-e+e^2/(1+e)) = sqrt(x)*(1+e^2/(2*(1+e)))
Assuming e < 1 we have
sqrt(x) <= y2 <= sqrt(x) + e^2/(2*(1+e))*sqrt(x)

Even quite crude bounds on the relative error e from the trunc(sqrt(x)) computation in double precision are sufficient to prove that sqrt(x) <= y2 <= sqrt(x) + 1 in exact mathematics and taking the integer divisions into account that sqrt(x) - 1 <= y2 <= sqrt(x) + 1.

GunnarFarneback · 2013-11-27T10:18:27Z

The assumption should be e>-1, obviously, so that 1+e doesn't switch sign.

* upstream/master: (89 commits) fix JuliaLang#5225 update pcre fix off-by-1 in isqrt. closes JuliaLang#4884 Add more keywords to ctags regex, plus README annotate the types of arguments for derived trigonometric & hyperbolic functions fix doc for && and || and update helpdb only show ccall literal address warning in imaging mode. closes JuliaLang#5215 minor update of hypot to ensure consistency of output types Fix JuliaLang#5217 silence compiler warning hopefully more robust way of getting github URL (don't assume module name is Pkg name) add text/html writemime for MethodList and Method (fix JuliaLang#4952) update NEWS doc: `import M: single,name` syntax, close JuliaLang#5214 clean up native finalizers code specialized abs2 for bool remove use of callback API in REPL Some error message cleanup to fix segfault when transposing sparse vector with illegal values. test/git*.jl: don't use `echo` to read-and-write from processes. test/git*.jl: don't use `echo` to read-and-write from processes. ...

JeffBezanson closed this as completed in ed0374b Nov 22, 2013

StefanKarpinski reopened this Nov 27, 2013

JeffBezanson closed this as completed in 2e9cccb Dec 22, 2013

devmotion mentioned this issue Jun 30, 2023

Fix MultinomialSampler overflow JuliaStats/Distributions.jl#1744

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

isqrt is a mess #4884

isqrt is a mess #4884

stevengj commented Nov 21, 2013 •

edited by andreasnoack

Loading

StefanKarpinski commented Nov 21, 2013

StefanKarpinski commented Nov 21, 2013

stevengj commented Nov 21, 2013

stevengj commented Nov 21, 2013

StefanKarpinski commented Nov 21, 2013

stevengj commented Nov 21, 2013

StefanKarpinski commented Nov 21, 2013

StefanKarpinski commented Nov 21, 2013

RauliRuohonen commented Nov 21, 2013

stevengj commented Nov 21, 2013

stevengj commented Nov 21, 2013

StefanKarpinski commented Nov 21, 2013

stevengj commented Nov 22, 2013

StefanKarpinski commented Nov 22, 2013

GunnarFarneback commented Nov 26, 2013

StefanKarpinski commented Nov 26, 2013

stevengj commented Nov 27, 2013

StefanKarpinski commented Nov 27, 2013

jiahao commented Nov 27, 2013

GunnarFarneback commented Nov 27, 2013

StefanKarpinski commented Nov 27, 2013

GunnarFarneback commented Nov 27, 2013

GunnarFarneback commented Nov 27, 2013

isqrt is a mess #4884

isqrt is a mess #4884

Comments

stevengj commented Nov 21, 2013 • edited by andreasnoack Loading

StefanKarpinski commented Nov 21, 2013

StefanKarpinski commented Nov 21, 2013

stevengj commented Nov 21, 2013

stevengj commented Nov 21, 2013

StefanKarpinski commented Nov 21, 2013

stevengj commented Nov 21, 2013

StefanKarpinski commented Nov 21, 2013

StefanKarpinski commented Nov 21, 2013

RauliRuohonen commented Nov 21, 2013

stevengj commented Nov 21, 2013

stevengj commented Nov 21, 2013

StefanKarpinski commented Nov 21, 2013

stevengj commented Nov 22, 2013

StefanKarpinski commented Nov 22, 2013

GunnarFarneback commented Nov 26, 2013

StefanKarpinski commented Nov 26, 2013

stevengj commented Nov 27, 2013

StefanKarpinski commented Nov 27, 2013

jiahao commented Nov 27, 2013

GunnarFarneback commented Nov 27, 2013

StefanKarpinski commented Nov 27, 2013

GunnarFarneback commented Nov 27, 2013

GunnarFarneback commented Nov 27, 2013

stevengj commented Nov 21, 2013 •

edited by andreasnoack

Loading