Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add test to enforce infinite buffer size for all applicable datapipes #5707

Merged
merged 4 commits into from
Mar 30, 2022

Conversation

pmeier
Copy link
Collaborator

@pmeier pmeier commented Mar 30, 2022

Given that is not acceptable for our datasets to drop any sample, we need to use an infinite buffer size in all datapipes that have a variable sized buffer. This PR adds a test that enforces this.

Currently we use

# pseudo-infinite until a true infinite buffer is supported by all datapipes
INFINITE_BUFFER_SIZE = 1_000_000_000

but that hopefully changes after pytorch/data#335 is resolved.

@facebook-github-bot
Copy link

facebook-github-bot commented Mar 30, 2022

💊 CI failures summary and remediations

As of commit 7e36fd9 (more details on the Dr. CI page):


💚 💚 Looks good so far! There are no failures yet. 💚 💚


This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

Copy link
Member

@NicolasHug NicolasHug left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @pmeier , approving to unblock but I have minor comments

test/test_prototype_builtin_datasets.py Outdated Show resolved Hide resolved
test/test_prototype_builtin_datasets.py Outdated Show resolved Hide resolved
@pmeier pmeier merged commit 93104c1 into pytorch:main Mar 30, 2022
@pmeier pmeier deleted the datasets-infinite-buffer branch March 30, 2022 14:31
facebook-github-bot pushed a commit that referenced this pull request Apr 6, 2022
…datapipes (#5707)

Summary:
* add test to enforce infinite buffer size for all applicable datapipes

* use utility function to extract datapipes

* check for buffer_size attr rather than type

* simplify

Reviewed By: NicolasHug

Differential Revision: D35393173

fbshipit-source-id: d6e1e28169abec0f206db4f6e8db3d1cd7b91edd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants