Confused by λ(i) #26

robz · 2021-05-17T01:44:04Z

In the "Black-Box Variational Inference" section of the VAE notes:

We first do per-sample optimization of q by iteratively applying the update
λ(i) ← λ(i) + ∇̃ λ ELBO(x(i); θ, λ(i))
We then perform a single update step based on the mini-batch
θ ← θ + ∇̃ θ ∑i ELBO(x(i); θ, λ(i))

If I understood correctly, x(i) is the ith sample from a batch B of the dataset D, and λ is a vector of parameters of the distribution "qλ(z)". What is λ(i)?

Is it the ith parameter of λ? That would imply that the length of B is equal to the dimension of the λ--if so, it's unclear to me why they would be equal.

Another possibility is that λ(i) is the ith update to λ. If so, perhaps it would be better rewritten like this:

λ(i+1) ← λ(i) + ∇̃ λ ELBO(x(i); θ, λ(i))

But if that's the case, then it's unclear to me why it appears in the θ update:

θ ← θ + ∇̃ θ ∑i ELBO(x(i); θ, λ(i))

Apologies if I've missed something obvious here. Also, thanks for notes--they've been very helpful!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confused by λ(i) #26

Confused by λ(i) #26

robz commented May 17, 2021

Confused by λ(i) #26

Confused by λ(i) #26

Comments

robz commented May 17, 2021