Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vec discount fix #27

Merged
merged 4 commits into from
Feb 27, 2020
Merged

Conversation

galatolofederico
Copy link
Contributor

Sorry if I am opening so many PRs 😅
There was i bug in the td.discount function when using it in a vectorized environment. I fixed it and wrote a test that compute the discounted rewards of a replay coming from a non vectorized environment and the discounted rewards of a replay coming from a vectorized environment and check if they are the same.
This PR is needed to make #26 work because my A2C implementation uses a vectorized environment

@seba-1511
Copy link
Member

This looks great @galatolofederico, thanks a lot for catching this bug!

I'll ask for a small change: could you add one line about this PR in CHANGELOG.md explaining the fix and ending with your GitHub handle ? This makes it easy to keep track of what changed and when it did.

Thanks again!

@seba-1511 seba-1511 merged commit d44cfe1 into learnables:master Feb 27, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants