RenyiELBO/IWAE fail to converge on AIR example #3289

horizon-blue · 2023-10-31T17:38:39Z

Issue Description

Hello Pyro folks. I was trying to use IWAE in the AIR example to see if the stricter lower bound yields better performance than the standard ELBO. However, after swapping out the elbo method with RenyiELBO(alpha=0), the model fails to converge completely. As a diagnostic, I also tried setting num_particles=1 to see if it at least falls back to the standard ELBO behavior, but the accuracy of the AIR model still does not improve at all. After reading #2220, I also tried reducing batch_size=1, yet there's no change in the performance of the model either.

I'm wondering if any of you might have some insights on what could cause the performance discrepancy between RenyiELBO vs TraceGraph_ELBO? Thank you very much :)

Environment

For any bugs, please provide the following:

Platform: MacOS 14.0, Python 3.11.5
Pyro 1.8.6
PyTorch 2.1.0

Code Snippet

The issue could be reproduced by running the AIR Example in Pyro's codebase and replacing this elbo setting with RenyiELBO().

The text was updated successfully, but these errors were encountered:

martinjankowiak · 2023-10-31T18:02:37Z

we would need to get something like #3123 merged.

what happens when you use TraceGraphELBO with multiple particles?

horizon-blue · 2023-11-01T21:23:18Z

Thanks for the reply @martinjankowiak .

I pull the changes from #3123, but sadly, it doesn't seem to fix the issue (and the ELBO is even worse). I also include the result of TraceGraph_ELBO with two particles:

martinjankowiak · 2023-11-01T22:51:45Z

oh sorry i wasn't thinking clearly when i first read this. AIR has discrete latent variables. to deal with that you can either sum them out (not really viable here) or use a stochastic gradient estimator. TraceGraphELBO uses a fancier and thus much lower variance gradient estimator that makes use of the fine-grained conditional independent structure of the model. RenyiELBO cannot do this and so results in a much higher variance gradient estimator---actually so much higher that it's evidently not usable. so this is expected

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RenyiELBO/IWAE fail to converge on AIR example #3289

RenyiELBO/IWAE fail to converge on AIR example #3289

horizon-blue commented Oct 31, 2023

martinjankowiak commented Oct 31, 2023

horizon-blue commented Nov 1, 2023

martinjankowiak commented Nov 1, 2023

RenyiELBO/IWAE fail to converge on AIR example #3289

RenyiELBO/IWAE fail to converge on AIR example #3289

Comments

horizon-blue commented Oct 31, 2023

Issue Description

Environment

Code Snippet

martinjankowiak commented Oct 31, 2023

horizon-blue commented Nov 1, 2023

martinjankowiak commented Nov 1, 2023