fix: Switch to sequential processing in batch_process to resolve thread-safety issues #169

fg-nava · 2025-01-09T19:50:55Z

fix: Switch to sequential processing in batch_process to resolve thread-safety issues

Ticket

https://navalabs.atlassian.net/browse/DST-688

Changes

Modified batch_process.py to use sequential processing instead of parallel processing
Added explanatory comments about the thread-safety concerns with LiteLLM

Context for reviewers

This PR addresses the Docker deployment crashes (CPU >1400%) issue when processing CSVs with multiple rows. Investigation revealed that the high CPU usage was caused by thread-safety issues in the underlying LiteLLM client libraries when running in parallel.

The fix is simple but effective: we've removed the parallel processing implementation and switched to sequential processing. This change resolves the CPU spike issues we were seeing in Docker deployments.

Follow-up Items (to be tracked in separate tickets):

UI feedback issue: "File processed, results attached" message appears in logs but not in UI
Assessment needed: Determine if parallel processing is necessary for performance and if so, investigate thread-safe alternatives
Test coverage: Add integration tests for batch runs of _process_question (currently only have mock calls to chat engine)

Testing

Tested locally by:

Runnin processing with multi-row CSVs
Confirmed CPU usage remains stable
Verified successful processing of all rows in sequential order

The change eliminates the >1400% CPU spikes previously observed in Docker deployments while maintaining functionality.

…thread-safety issues

github-actions · 2025-01-09T19:56:12Z

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines	Covered	Coverage	Threshold	Status
2845	2543	89%	80%	🟢

New Files

No new covered files...

Modified Files

File	Coverage	Status
app/src/batch_process.py	100%	🟢
TOTAL	100%	🟢

updated for commit: 025092d by action🐍

app/src/batch_process.py

Co-authored-by: Yoom Lam <[email protected]>

fg-nava · 2025-01-10T18:31:01Z

This can be closed without merging as it is upstream of #170

yoomlam · 2025-01-10T18:40:51Z

@fg-nava You should have merged this PR so that the title "fix: Switch to sequential processing in batch_process" is included as a separate commit message in main. Then merge #170, which will have a separate commit message that says "fix: Ensure UI feedback message for batch processing completion"

DST-688: Switch to sequential processing in batch_process to resolve …

b7f5c76

…thread-safety issues

fg-nava requested a review from a team January 9, 2025 19:50

fix: lint errors

957f7d7

yoomlam reviewed Jan 9, 2025

View reviewed changes

app/src/batch_process.py Outdated Show resolved Hide resolved

yoomlam approved these changes Jan 9, 2025

View reviewed changes

yoomlam changed the title ~~DST-688: Switch to sequential processing in batch_process to resolve thread-safety issues~~ fix: Switch to sequential processing in batch_process to resolve thread-safety issues Jan 9, 2025

fg-nava and others added 2 commits January 10, 2025 06:26

Update app/src/batch_process.py

91e3d27

Co-authored-by: Yoom Lam <[email protected]>

fix: Ensure UI feedback message for batch processing completion (#170)

025092d

fg-nava closed this Jan 10, 2025

fg-nava reopened this Jan 10, 2025

fg-nava merged commit 9596f57 into main Jan 10, 2025
8 checks passed

fg-nava deleted the DST-688-batch-processing-limitations branch January 10, 2025 18:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Switch to sequential processing in batch_process to resolve thread-safety issues #169

fix: Switch to sequential processing in batch_process to resolve thread-safety issues #169

fg-nava commented Jan 9, 2025

github-actions bot commented Jan 9, 2025 •

edited

Loading

fg-nava commented Jan 10, 2025

yoomlam commented Jan 10, 2025

fix: Switch to sequential processing in batch_process to resolve thread-safety issues #169

fix: Switch to sequential processing in batch_process to resolve thread-safety issues #169

Conversation

fg-nava commented Jan 9, 2025

fix: Switch to sequential processing in batch_process to resolve thread-safety issues

Ticket

Changes

Context for reviewers

Follow-up Items (to be tracked in separate tickets):

Testing

github-actions bot commented Jan 9, 2025 • edited Loading

☂️ Python Coverage

Overall Coverage

New Files

Modified Files

fg-nava commented Jan 10, 2025

yoomlam commented Jan 10, 2025

github-actions bot commented Jan 9, 2025 •

edited

Loading