Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix ignoring sessions that have no data #1035

Open
wants to merge 13 commits into
base: main
Choose a base branch
from

Conversation

drammock
Copy link
Member

@drammock drammock commented Jan 7, 2025

The addition of allow_missing_sessions in #1000 had some problems:

  1. it was storing [None] in the subjs_sessions dict in cases where a subject had no valid sessions; this was not behaving as expected
  2. the use of mne_bids.get_entity_vals was not working as expected, because even if a subject had no data for a session for a given task, the presence of coordsystem.json or scans.tsv sidecars was causing it to report that the subject had some files for that session.

Item (2) was fixed in mne-tools/mne-bids#1362; item (1) is fixed here. Works on my data, but will self-review to add some questions about implementation/testing.

Before merging …

  • Changelog has been updated (docs/source/dev.md.inc)

Copy link
Member Author

@drammock drammock left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

regarding testing, @larsoner and I came up with the plan to use a small dataset we have from UW (after BIDSifying it and uploading to OSF). It could make sense to add that in this PR, or a separate one

@@ -159,9 +172,9 @@ def get_subjects_sessions(config: SimpleNamespace) -> dict[str, list[str] | list
f"{tuple(sorted(missing_sessions))}, and "
"`config.allow_missing_sessions` is False"
)
subj_sessions[subject] = sorted(set(cfg_sessions) & set(valid_sessions_subj))
if subj_sessions[subject] == []:
subj_sessions[subject] = [None]
Copy link
Member Author

@drammock drammock Jan 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this wasn't doing what was intended. We want [None] (or (None,)) for datasets where sessions aren't tracked (ie there's just one session). But for datasets where sessions are tracked, we should omit an entry from this subj_sessions dict if there aren't any sessions found for that subj.

Comment on lines +164 to +165
ignore_acquisitions=("calibration", "crosstalk"),
ignore_suffixes=("scans", "coordsystem"),
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These two lines are the important bit, for getting MNE-BIDS to only tell us if data files are present.
This ignore_suffixes line requires upstream mne-tools/mne-bids@41ce36a to work, so we'll need to pin MNE-BIDS to that commit, or wait for the next MNE-BIDS release before merging here.

I'm also not really sure if coordsystems.json and scans.tsv are the only possible sidecars that could have the undesirous effect (of having get_entity_vals return a false positive when no data files are actually present for the requested session --- cf. discussion at mne-tools/mne-bids#1351). If you can think of other sidecar suffixes to exclude, speak now.

@drammock drammock marked this pull request as ready for review January 10, 2025 16:00
@@ -43,16 +43,18 @@ def _download_from_web(*, ds_name: str, ds_path: Path) -> None:

ds_path.mkdir(parents=True, exist_ok=True)
path = ds_path.parent.resolve(strict=True)
fname = f"{ds_name}.zip"
ext = "tar.gz" if options.get("processor") == "untar" else "zip"
processor = pooch.Untar if options.get("processor") == "untar" else pooch.Unzip
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this sets up a problem that is fixed by fatiando/pooch#458, but I wonder if it's better to set up a warning filter here rather than pin to a pooch fork / wait for pooch PR merge or even pooch release... WDYT?

Comment on lines +9 to +18
l_freq = None
h_freq = 40.0
use_maxwell_filter: bool = True


# Epochs
epochs_tmin = -0.08
epochs_tmax = 0.18
epochs_decim = 10 # 2000->200 Hz
baseline = (None, 0)
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is all copied from a different dataset's config, may not be most suitable, TBD. LMK if you have suggestions here.

Comment on lines +290 to +291
# TODO edit this test to monkeypatch the dataset, making it seem like
# there's only one subject who has different MRIs for different sessions
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just calling out this TODO; you can mostly ignore this test function for now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant