Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add padding while saving ivf.pid.pt and release GIL #336

Merged
merged 1 commit into from
May 28, 2024

Conversation

VThejas
Copy link
Contributor

@VThejas VThejas commented Apr 19, 2024

We append the padding before saving ivf.pid.pt. This is so that when we load this back, we can avoid the memory spike caused while appending the padding after loading. Functionally, there is no difference.

Also GIL is released for more parallelism.

@VThejas VThejas requested review from santhnm2 and okhat April 19, 2024 20:42
@VThejas
Copy link
Contributor Author

VThejas commented May 28, 2024

@santhnm2 @okhat I tested this out and it works, will merge this.

@VThejas VThejas merged commit 862edcf into main May 28, 2024
@erjieyong
Copy link

erjieyong commented Dec 2, 2024

The padding sems to interfere when utilising IndexUpdater.

Specifically, there's an assert statement in the function that checks the length and the zero paddings results in an error.

assert ivf_runner == len(old_ivf)

submitted #380 to handle the padded zeros.

Also mentioned in this #369 (comment)

erjieyong added a commit to erjieyong/ColBERT that referenced this pull request Dec 2, 2024
to account for padded zeros added by stanford-futuredata#336
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants