Which experimental configuration yields the largest expected gain in information? The utility can be expressed as the exected reduction in entropy between the prior and the posterior, i.e.,
This quantity is especially challenging because the posterior is intractable in most practical applications. We can however, draw samples from the likelihood model (our simulator). We seek to obtain the experimental configuration which maximizes the utility:
Problem: for every evaluation of the utility, the simulator needs to be called because the expectation depends on . Slow and cumbersome!
Proposal: Reweigh the samples from the joint to approximate with several ratio estimators that can be trained on samples from the joint alone! In doing so, we can estimate the EIG for every experimental configuration by reusing presimulated samples for specific experimental configurations. In addition, this specification allows us to maximize the expected information gain through gradient ascent by simply backpropagating through the neural networks. For more details check the notebook!
See LICENSE file.