Skip to content

Commit

Permalink
[DOCS] Adds note about differences between chat completion and stream…
Browse files Browse the repository at this point in the history
… API (elastic#120636)
  • Loading branch information
szabosteve committed Jan 23, 2025
1 parent 229a4ae commit 70f0961
Show file tree
Hide file tree
Showing 2 changed files with 9 additions and 1 deletion.
6 changes: 5 additions & 1 deletion docs/reference/inference/chat-completion-inference.asciidoc
Original file line number Diff line number Diff line change
Expand Up @@ -34,9 +34,13 @@ However, if you do not plan to use the {infer} APIs to use these models or if yo
The chat completion {infer} API enables real-time responses for chat completion tasks by delivering answers incrementally, reducing response times during computation.
It only works with the `chat_completion` task type for `openai` and `elastic` {infer} services.


[NOTE]
====
The `chat_completion` task type is only available within the _unified API and only supports streaming.
* The `chat_completion` task type is only available within the _unified API and only supports streaming.
* The Chat completion {infer} API and the Stream {infer} API differ in their response structure and capabilities.
The Chat completion {infer} API provides more comprehensive customization options through more fields and function calling support.
If you use the `openai` service or the `elastic` service, use the Chat completion {infer} API.
====

[discrete]
Expand Down
4 changes: 4 additions & 0 deletions docs/reference/inference/stream-inference.asciidoc
Original file line number Diff line number Diff line change
Expand Up @@ -40,6 +40,10 @@ However, if you do not plan to use the {infer} APIs to use these models or if yo
The stream {infer} API enables real-time responses for completion tasks by delivering answers incrementally, reducing response times during computation.
It only works with the `completion` and `chat_completion` task types.

The Chat completion {infer} API and the Stream {infer} API differ in their response structure and capabilities.
The Chat completion {infer} API provides more comprehensive customization options through more fields and function calling support.
If you use the `openai` service or the `elastic` service, use the Chat completion {infer} API.

[NOTE]
====
include::inference-shared.asciidoc[tag=chat-completion-docs]
Expand Down

0 comments on commit 70f0961

Please sign in to comment.