Provide AD gradient for MLE/MAP #1369

cpfiffer · 2020-08-04T14:52:31Z

Currently, MLE/MAP is not using the AD gradients calculated with gradient_logp, as noted in #1365. This PR modifies the MLE/MAP code to use the AD-generated gradient.

I incremented the version number to 0.13.1, but this might have to be 0.14.0 since we are technically dropping support for second-order methods with this PR. I intend to follow up in a separate PR with code that adds a hessian_logp method so people can use Optim.Newton or whatever -- it'll also make the information matrix calculations much better to use the AD-Hessian.

src/modes/ModeEstimation.jl

devmotion · 2020-08-04T16:04:16Z

src/modes/ModeEstimation.jl

@@ -369,6 +395,11 @@ function _optimize(
    args...; 
    kwargs...
 )
+    # Throw an error if we received a second-order optimizer.


Do we have to do that? Doesn't Optim just use ForwardDiff (or FD?) to compute the Hessian in this case? If that's the case, then we shouldn't throw an error IMO. It might not be the most efficient approach and would not adhere to the user-provided AD settings but as long as it works we could only print a warning.

In FD yes; but (for example in the project that I am working on) it could be that users only define custom adjoints for the gradients but not the Hessian. Therefore even the user provides an AD backend, it might not be a great idea if it by default take that for Hessian function.

You mean we shouldn't even print a warning? Would be fine with me as well.

Oh, I think throwing an error when some Hessian-required optimizer is received is a great idea, just like what Cameron did here.

If we want to throw an error if the Hessian is evaluated, I suggest using only_fgh!(f) and implementing f(F, G, H, x) that contains the check

if H !== nothing error("second order methods are not supported at the moment") end

In general, this approach is more flexible, avoids baking in a hardcoded check for a special type of a different package in our implementation, and avoids incorrect and unexpected behaviour for second-order optimization algorithms that don't subtype this specific type (since multiple inheritance is not possible in Julia, that's not an impossible scenario per se).

It's a bug. I think there might be an issue for it.

I just found JuliaNLSolvers/Optim.jl#718, I guess that's the related issue.

Yeah, I have a fix. Sorry to cossio for waiting a year and a half 😬

I mean, I will tag a fix in an hour or so, so please don't special case with a branch.

JuliaRegistries/General#19141

devmotion · 2020-08-04T16:07:12Z

Project.toml

@@ -1,6 +1,6 @@
 name = "Turing"
 uuid = "fce5fe82-541a-59a6-adf8-730c64b5f9a0"
-version = "0.13.0"
+version = "0.13.1"


We should make sure that we haven't introduced any breaking changes since 0.13.0. (IMO we should adopt the ColPrac practice of making patch releases for every PR).

I'll just bump it up to 0.14.0. Honestly at this point we should consider moving to 1.0 as well.

More importantly, current 0.13.0 is by default failing so ] add Turing and using Turing will fail. I think maybe you guys want to bump up the version really soon...

We first have to fix the bug introduced by the changes in PDMats 0.10 on master before releasing 0.14.0. What was your package setup that failed, i.e. can you post the output of ] st?

Oh, I have mine on Turing#master, but there are more than one guys on slack that faces an issue: ] add Turing installs an older version and using Turing somehow fails.

Usually these problems are caused by unbounded compatibilities of old Turing versions (there are many closed issues in the repo here). These issues should be fixed by running ] add [email protected] and possibly adjusting conflicting packages (by users) and adding correct bounds in the registry (by us). I fixed some bounds a while ago, but it seems the old version are still missing some compatibilty bounds.

devmotion · 2020-08-08T06:54:25Z

src/modes/ModeEstimation.jl

@@ -147,6 +147,50 @@ function (f::OptimLogDensity)(z)
    return -DynamicPPL.getlogp(varinfo)
 end

+function (f::OptimLogDensity)(F, G, z)


I'm not sure if it's useful to keep this separate definition? It seems we only need f(F, G, H, z), so the implementation could just be included there directly.

wupeifan · 2020-08-19T18:20:57Z

It seems that there is nothing blocking this PR, I think?

devmotion · 2020-08-19T19:28:48Z

Tests fail currently.

cpfiffer · 2020-08-19T19:36:46Z

I'll take a look when I'm done teaching today, looks like I've done something strange to the methods.

codecov · 2020-08-20T06:53:21Z

Codecov Report

Merging #1369 into master will increase coverage by 0.16%.
The diff coverage is 88.88%.

@@            Coverage Diff             @@
##           master    #1369      +/-   ##
==========================================
+ Coverage   66.79%   66.95%   +0.16%     
==========================================
  Files          25       25              
  Lines        1605     1619      +14     
==========================================
+ Hits         1072     1084      +12     
- Misses        533      535       +2

Impacted Files	Coverage Δ
src/core/compat/reversediff.jl	`90.47% <ø> (ø)`
src/core/compat/zygote.jl	`100.00% <ø> (ø)`
src/modes/ModeEstimation.jl	`66.95% <85.71%> (+2.25%)`	⬆️
src/core/ad.jl	`74.24% <100.00%> (+0.39%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 43e2f20...fa7c59f. Read the comment docs.

devmotion

LGTM.

cpfiffer added 3 commits August 3, 2020 10:09

Use in-place gradients

da777ba

Fixing tests, tidying things up

fc2f8df

Increment patch version

284bbdf

devmotion reviewed Aug 4, 2020

View reviewed changes

cpfiffer added 2 commits August 5, 2020 09:40

Change version to 0.14.0, address comments

2ddd480

Remove hack to fix 2nd order optimizers

5ecc065

devmotion reviewed Aug 8, 2020

View reviewed changes

Remove redundant FG function

959ff51

devmotion closed this Aug 19, 2020

devmotion reopened this Aug 19, 2020

cpfiffer added 3 commits August 19, 2020 22:32

Merge branch 'master' into csp/explicit-gradient

f431e2b

Add contexts to gradient_logp for Zygote and ReverseDiff

53d9134

One day I'll fix all the files at once

fa7c59f

devmotion approved these changes Aug 20, 2020

View reviewed changes

cpfiffer merged commit f2f6665 into master Aug 20, 2020

devmotion deleted the csp/explicit-gradient branch August 20, 2020 14:02

bgroenks96 mentioned this pull request Aug 20, 2022

Hessian AD for logjoint and second-order optim interface broken #1878

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide AD gradient for MLE/MAP #1369

Provide AD gradient for MLE/MAP #1369

cpfiffer commented Aug 4, 2020

devmotion Aug 4, 2020

wupeifan Aug 4, 2020

devmotion Aug 4, 2020

wupeifan Aug 4, 2020

devmotion Aug 4, 2020

pkofod Aug 6, 2020

devmotion Aug 6, 2020

pkofod Aug 7, 2020

pkofod Aug 7, 2020

pkofod Aug 7, 2020 •

edited

Loading

devmotion Aug 4, 2020

cpfiffer Aug 5, 2020

wupeifan Aug 5, 2020

devmotion Aug 6, 2020

wupeifan Aug 6, 2020

devmotion Aug 6, 2020

devmotion Aug 8, 2020

wupeifan commented Aug 19, 2020

devmotion commented Aug 19, 2020

cpfiffer commented Aug 19, 2020

codecov bot commented Aug 20, 2020

devmotion left a comment

Provide AD gradient for MLE/MAP #1369

Provide AD gradient for MLE/MAP #1369

Conversation

cpfiffer commented Aug 4, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pkofod Aug 7, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wupeifan commented Aug 19, 2020

devmotion commented Aug 19, 2020

cpfiffer commented Aug 19, 2020

codecov bot commented Aug 20, 2020

Codecov Report

devmotion left a comment

Choose a reason for hiding this comment

pkofod Aug 7, 2020 •

edited

Loading