Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

wrong code to calculate the expected utility function with respect to the initial state distribution #1

Open
siyaoZHAO opened this issue Nov 27, 2024 · 0 comments

Comments

@siyaoZHAO
Copy link

Hi, great work but there is problem about the code in file dilo.py.
Lines 137~143 is to calculate the expected utility function with respect to the initial state distribution. The coefficient before the expected value is (1 - lambda), however in the formula of the paper the coefficient is (1 - discount)*beta. And there also should not be a divergence type conditional check in the code.

Is that wrong in this code?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant