[#432] Groq Provider tool call tweaks #811

aidando73 · 2025-01-17T22:08:15Z

What does this PR do?

Follow up for @ashwinb's comments in #630

Contributes to issue (Add Remote Inference Adapter for Groq #432)

Test Plan

Environment

export GROQ_API_KEY=<api-key>

# Create environment if not already
conda create --name llamastack-groq python=3.10
conda activate llamastack-groq

wget https://raw.githubusercontent.com/aidando73/llama-stack/9165502582cd7cb178bc1dcf89955b45768ab6c1/build.yaml
wget https://raw.githubusercontent.com/meta-llama/llama-stack/918172c7fa92522c9ebc586bdb4f386b1d9ea224/run.yaml

# Build
pip install -e . && llama stack build --config ./build.yaml --image-type conda

# Activate built environment
conda activate llamastack-groq

# Test deps
pip install pytest pytest_html pytest_asyncio

Unit tests

# Setup
conda activate llamastack-groq
pytest llama_stack/providers/tests/inference/groq/test_groq_utils.py -vv -k groq -s

# Result
llama_stack/providers/tests/inference/groq/test_groq_utils.py .......................

========================================= 23 passed, 11 warnings in 0.06s =========================================

Integration tests

# Tests
 pytest llama_stack/providers/tests/inference/test_text_inference.py -k groq -s

# Results
___________________________ TestInference.test_chat_completion_with_tool_calling[-groq] ___________________________
llama_stack/providers/tests/inference/test_text_inference.py:403: in test_chat_completion_with_tool_calling
    assert len(message.tool_calls) > 0
E   assert 0 > 0
E    +  where 0 = len([])
E    +    where [] = CompletionMessage(role='assistant', content='<function=get_weather>{"location": "San Francisco, CA"}', stop_reason=<StopReason.end_of_turn: 'end_of_turn'>, tool_calls=[]).tool_calls
============================================= short test summary info =============================================
FAILED llama_stack/providers/tests/inference/test_text_inference.py::TestInference::test_chat_completion_with_tool_calling[-groq] - assert 0 > 0
======================== 1 failed, 3 passed, 5 skipped, 99 deselected, 7 warnings in 2.13s ========================

(One failure as expected from 3.2 3B - re: #630 (comment))

Sources

Please link relevant resources if necessary.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Ran pre-commit to handle lint / formatting issues.
Read the contributor guideline,
Pull Request section?
Updated relevant documentation.
Wrote necessary unit or integration tests.

aidando73 · 2025-01-17T23:18:14Z

llama_stack/providers/remote/inference/groq/groq_utils.py

+
+    call_id: str
+    tool_name: str
+    arguments: str


ToolCall.arguments must be a Dict, so we can't keep it within ToolCall

@json_schema_type class ToolCall(BaseModel): call_id: str tool_name: Union[BuiltinTool, str] arguments: Dict[str, RecursiveType]

aidando73 · 2025-01-17T23:21:59Z

llama_stack/providers/tests/inference/test_text_inference.py

-            and "Llama-3.2" in inference_model
-        ):
-            # TODO(aidand): Remove this skip once Groq's tool calling for Llama3.2 works better
-            pytest.skip("Groq's tool calling for Llama3.2 doesn't work very well")


RE:#630 (comment)

aidando73 requested review from ashwinb, yanxi0830, hardikjshah, dltn, raghotham, dineshyv, vladimirivic and sixianyi0721 as code owners January 17, 2025 22:08

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 17, 2025

aidando73 marked this pull request as draft January 17, 2025 22:48

aidando73 force-pushed the aidand-groq-tool-call-tweaks branch from 3ceb004 to 0c4d4a4 Compare January 17, 2025 23:16

PR tool call followups

76e08cf

aidando73 force-pushed the aidand-groq-tool-call-tweaks branch from 0c4d4a4 to 76e08cf Compare January 17, 2025 23:17

aidando73 commented Jan 17, 2025

View reviewed changes

aidando73 marked this pull request as ready for review January 17, 2025 23:20

aidando73 commented Jan 17, 2025

View reviewed changes

aidando73 mentioned this pull request Jan 17, 2025

[#432] Add Groq Provider - tool calls #630

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[#432] Groq Provider tool call tweaks #811

[#432] Groq Provider tool call tweaks #811

aidando73 commented Jan 17, 2025 •

edited

Loading

aidando73 Jan 17, 2025 •

edited

Loading

aidando73 Jan 17, 2025

[#432] Groq Provider tool call tweaks #811

Are you sure you want to change the base?

[#432] Groq Provider tool call tweaks #811

Conversation

aidando73 commented Jan 17, 2025 • edited Loading

What does this PR do?

Test Plan

Sources

Before submitting

aidando73 Jan 17, 2025 • edited Loading

Choose a reason for hiding this comment

aidando73 Jan 17, 2025

Choose a reason for hiding this comment

aidando73 commented Jan 17, 2025 •

edited

Loading

aidando73 Jan 17, 2025 •

edited

Loading