[PY] fix: #1705 - Output of multiple calls to same tool are overwritten #2228

andres-swax · 2024-12-10T21:13:07Z

The current implementation of the planner did not account for multiple calls to the same tool since it did not keep track of the call id (call.id) which is different for each call, regardless of which tool.

Therefore, the list was only being indexed by tool_id.

The result was that subsequent calls to the same tool within the same run would overwrite the result of the previous call, and at the end of the run, results would be missing triggering an Exception.

The issue is fixed by adding a sublevel to the list of tools called as part of a run (this list is stored on assistants_planner.py tool_map:dict) which does keep track of the unique call_id, so that their result values are not overwritten by subsequent calls to the same tool within the same run.

Linked issues

closes: #1705

Details

In order to trigger this issue, submit a prompt that triggers multiple calls to the same tool on the same run (i.e. which city is warmer today, LA or Chicago?). The same tool would be called once for each city (i.e. get-weather) but the data returned would only contain results for one city. This results on an Exception being raised. Steps can be found on #1705.

Change details

The fix consists on adding a second level to the dictionary of the tool calls (which contains the return values), so instead of being (pseudocode) tools[tool_id] it becomes tools[tool_id][call_id]. This way the return value of each and every call is retained.

Key change is that state.temp.action_outputs[command.action_id] becomes state.temp.action_outputs[command.action][command.action_id] on teams/ai/ai.py.

Attestation Checklist

My code follows the style guidelines of this project
I have checked for/fixed spelling, linting, and other errors
I have commented my code for clarity
I have made corresponding changes to the documentation (updating the doc strings in the code is sufficient)
My changes generate no new warnings
I have added tests that validates my changes, and provides sufficient test coverage. I have tested with:
- Local testing
- E2E testing in Teams
New and existing unit tests pass locally with my changes

Add a sublevel to the list of tools called as part of a run (this list is stored on assistants_planner.py `tool_map:dict`), so that their result values are not overwritten by subsequent calls to the same tool within the same run. The current implementation did not account for multiple calls to the same tool as it did not keep track of the call id (`call.id`) which is different for each call, regardless of which tool. The list was only being indexed by `tool_id`. The result was that subsequent calls to the same tool would overwrite the result of the previous call. The fix consists on adding a second level to the dictionary of the tool calls (which contains the return values), so instead of being (pseudocode) `tools[tool_id]` it becomes `tools[tool_id][call_id]`. This way the return value of each and every call is retained.

andres-swax · 2024-12-10T21:15:24Z

@microsoft-github-policy-service agree

corinagum · 2024-12-11T03:16:11Z

@andres-swax thanks for the contribution! Could you please fix the failing tests and add a new one for the multiple tool calls scenario?

andres-swax · 2024-12-11T17:57:32Z

@andres-swax thanks for the contribution! Could you please fix the failing tests and add a new one for the multiple tool calls scenario?

Hi, I'll try to do my best, I saw those failures yesterday and honestly, I could not understand a thing nor what I could do.

I felt the failures were on the Python libraries and hoping the messages were just noise.

My dev-ops software and workflow knowledge is still on Visual SourceSafe level or so. If I cannot make it work, i'll ask for the PR to be closed. On the meantime I'll give it a try for the next cpl of days.

corinagum · 2024-12-11T18:01:48Z

@andres-swax No rush, and we really appreciate your contribution! We may be able to keep the PR open and have someone help out the with tests, but I can't guarantee how soon that will be. Our bandwidth is pretty tight right now. Please add the tests if you can and feel free to ask questions!

andres-swax · 2024-12-26T23:02:10Z

d we really appreciate your contribution! We may be able to keep the PR open and have someone help out the with tests, but I can't guarant

I tried to see how I can update the tests but tbh I couldn't figure out. If necessary please do close pR so the bug can be taken care of by someone else, it's kind of important to fix it soon.

andres-swax requested review from aacebo, corinagum, lilyydu, singhk97 and rajan-chari as code owners December 10, 2024 21:13

Merge branch 'main' into 1705-py-assistants-planner-concurrent-calls

5fa98f7

andres-swax requested a review from heyitsaamir as a code owner January 6, 2025 16:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PY] fix: #1705 - Output of multiple calls to same tool are overwritten #2228

[PY] fix: #1705 - Output of multiple calls to same tool are overwritten #2228

andres-swax commented Dec 10, 2024 •

edited by corinagum

Loading

andres-swax commented Dec 10, 2024

corinagum commented Dec 11, 2024

andres-swax commented Dec 11, 2024

corinagum commented Dec 11, 2024

andres-swax commented Dec 26, 2024

[PY] fix: #1705 - Output of multiple calls to same tool are overwritten #2228

Are you sure you want to change the base?

[PY] fix: #1705 - Output of multiple calls to same tool are overwritten #2228

Conversation

andres-swax commented Dec 10, 2024 • edited by corinagum Loading

Linked issues

Details

Change details

Attestation Checklist

andres-swax commented Dec 10, 2024

corinagum commented Dec 11, 2024

andres-swax commented Dec 11, 2024

corinagum commented Dec 11, 2024

andres-swax commented Dec 26, 2024

andres-swax commented Dec 10, 2024 •

edited by corinagum

Loading