✨ [Widgets] Enable streaming in the conversational widget #486

SBrandeis · 2024-02-16T16:24:35Z

Linked to #360 #410

Should unlock the conversational widget on Mistral if I'm not mistaken?

TL;DR

Leverage inference types from @huggingface/task to type input and output of the inference client
Use the inference client to call the inference serverless API
Use the streaming API when supported for the model

SBrandeis · 2024-02-16T16:31:27Z

Screen recording showcase (first answer probably cached)

Screen.Recording.2024-02-16.at.17.30.31.mov

julien-c

wow, thanks for working on this.

This is starting to all fit together very nicely!

packages/widgets/src/lib/components/InferenceWidget/shared/WidgetHeader/WidgetHeader.svelte

packages/widgets/src/lib/components/InferenceWidget/shared/WidgetWrapper/WidgetWrapper.svelte

mishig25 · 2024-02-21T14:46:44Z

packages/widgets/src/lib/components/InferenceWidget/shared/WidgetHeader/WidgetHeader.svelte

+			</a>
+			{#if $tgiSupportedModels?.has(model.id)}
+				<p class="text-xs text-gray-400">
+					Streaming with <a href="https://huggingface.co/docs/text-generation-inference" class="underline">TGI</a>


Suggested change

Streaming with <a href="https://huggingface.co/docs/text-generation-inference" class="underline">TGI</a>

Streaming with <a href="https://huggingface.co/docs/text-generation-inference" class="hover:underline">TGI</a>

made it to hover:underline

cc @gary149

mishig25 · 2024-02-21T14:50:38Z

packages/inference/src/tasks/nlp/textGenerationStream.ts

@@ -85,7 +86,7 @@ export interface TextGenerationStreamOutput {
 * Use to continue text from a prompt. Same as `textGeneration` but returns generator that can be read one token at a time
 */
 export async function* textGenerationStream(
-	args: TextGenerationArgs,
+	args: BaseArgs & TextGenerationInput,
 	options?: Options
 ): AsyncGenerator<TextGenerationStreamOutput> {


should TextGenerationStreamOutput live in @huggingface/tasks/src/tasks/text-generation/inference just like TextGenerationInput & TextGenerationOutput ?

Good question - cc @Wauplin with whom we discussed that previously

The current philosophy is to not type the streaming mode because it's transfer-specific, not inference-specific

Indeed, we mentioned it here once: #468 (comment).

As a matter of fact, I came to the conclusion today that we should specify the stream parameter and the streamed output in our JS specs. I am currently starting to use the generated types in Python (see ongoing PR) and for now I've kept text_generation apart since I'm missing TextGenerationStreamResponse (defined here but I don't want to mix the auto-generated types with previous definitions). Agree it's more "transfer-specific" rather than "inference-specific" but the thing is that setting stream=True is modifying the output format so we need to document that somewhere.

…getWrapper/WidgetWrapper.svelte Co-authored-by: Mishig <[email protected]>

gary149

Really nice!

krampstudio

Nice.
Just one optional comment

krampstudio · 2024-02-21T20:11:06Z

packages/widgets/src/lib/components/InferenceWidget/shared/WidgetWrapper/WidgetWrapper.svelte

+			if (!$tgiSupportedModels) {
+				$tgiSupportedModels = await getTgiSupportedModels(apiUrl);
+			}


Do we need it for every widgets?

If yes, that's ok to initialize it here,
otherwise, you can do something like this :

in store.js

import { get, writable } from 'svelte/store'; const tgiSupportedModels = writable<Set<string> | undefined>(undefined); export async function getTgiSupportedModels() { if (!get(tgiSupportedModels)) { const response = await fetch(`${url}/framework/text-generation-inference`); const output = await response.json(); if (response.ok) { tgiSupportedModels.set(new Set( (output as { model_id: string; task: string }[]) .filter(({ task }) => task === "text-generation") .map(({ model_id }) => model_id) )); } } return tgiSupportedModels; }

so inside widgets, you always get the store using const $tgiSupportedModels = getTgiSupportedModels() and it fetches the data only when needed.

so nice! thank you!

mishig25 · 2024-02-22T15:37:59Z

packages/widgets/src/lib/components/InferenceWidget/shared/WidgetHeader/WidgetHeader.svelte

-		</a>
+		<div class="flex gap-4 items-center mb-1.5">
+			<a
+				class={TASKS_DATA[task] ? "hover:underline" : undefined}


Suggested change

class={TASKS_DATA[task] ? "hover:underline" : undefined}

class:hover:underline={TASKS_DATA[task]}

didn't test. Can we use the new svelte syntax?

mishig25

lgtm !

@SBrandeis

https://huggingface.co/spaces/huggingfacejs/inference-widgets is broken , due to #486 cc @SBrandeis

### Fix `Model Loading` behaviour in Conversational Widget. Follow up to #486 Due to #486, Conversational Widget is different from other widgets in how it calls hf api-inference: Conversational Widget uses `@huggingface/inference client` while other widgets use `regular fetch`. Equivalent of lines below were missing in Conversational Widget https://github.com/huggingface/huggingface.js/blob/f2e9ce3c11822910d293ae3455e22bad093026a3/packages/widgets/src/lib/components/InferenceWidget/widgets/TextGenerationWidget/TextGenerationWidget.svelte#L144-L149 And this PR add this missing function. #### Screen cast https://github.com/huggingface/huggingface.js/assets/11827707/2201c471-964f-4943-8455-8a51800ade12 note: in the demo above `mrfakename/refusal-old` is not supported in TGI, therefore, the output is not streamed

SBrandeis requested a review from coyotte508 February 16, 2024 16:24

julien-c reviewed Feb 16, 2024

View reviewed changes

Base automatically changed from 8578-switch-conversational-to-text-generation to main February 20, 2024 09:31

SBrandeis added 2 commits February 20, 2024 11:12

wip: enable streaming in conversational widget

bb2eba4

factor away isTgiSupported

79fcfa3

SBrandeis force-pushed the chat-widget-streaming branch from bfc7fbe to 79fcfa3 Compare February 20, 2024 10:17

SBrandeis added 5 commits February 20, 2024 11:24

tweaks

679e367

mention TGI in the widget

e4961b8

Reset input text after submit

b271568

Update models displayed in the demo

cda8353

revert unrelated change

1576a53

SBrandeis changed the title ~~(wip) Enable streaming in the conversational widget~~ ✨ [Widgets] Enable streaming in the conversational widget Feb 20, 2024

SBrandeis commented Feb 20, 2024

View reviewed changes

packages/widgets/src/lib/components/InferenceWidget/shared/WidgetHeader/WidgetHeader.svelte Outdated Show resolved Hide resolved

SBrandeis commented Feb 20, 2024

View reviewed changes

packages/widgets/src/lib/components/InferenceWidget/shared/WidgetWrapper/WidgetWrapper.svelte Outdated Show resolved Hide resolved

SBrandeis added 5 commits February 20, 2024 11:54

lnt

83dba35

lint

119114b

Export inference types + fix import type

f23e873

lint

b0a4b26

types

98b8229

SBrandeis marked this pull request as ready for review February 20, 2024 14:10

SBrandeis requested review from mishig25, osanseviero, gary149, Wauplin, vvmnnnkv and radames as code owners February 20, 2024 14:10

mishig25 reviewed Feb 21, 2024

View reviewed changes

packages/widgets/src/lib/components/InferenceWidget/shared/WidgetWrapper/WidgetWrapper.svelte Outdated Show resolved Hide resolved

mishig25 reviewed Feb 21, 2024

View reviewed changes

Update packages/widgets/src/lib/components/InferenceWidget/shared/Wid…

350f69e

…getWrapper/WidgetWrapper.svelte Co-authored-by: Mishig <[email protected]>

gary149 requested a review from krampstudio February 21, 2024 17:38

gary149 added 3 commits February 21, 2024 19:03

update tailwindcss

0b708f2

add smd to tailwind config

72340e9

text adjustements

55b73a3

gary149 approved these changes Feb 21, 2024

View reviewed changes

krampstudio approved these changes Feb 21, 2024

View reviewed changes

SBrandeis added 2 commits February 22, 2024 15:23

Suggestion from @krampstudio

d369c83

Lint :)

77e3880

SBrandeis requested a review from mishig25 February 22, 2024 14:46

mishig25 reviewed Feb 22, 2024

View reviewed changes

mishig25 approved these changes Feb 22, 2024

View reviewed changes

SBrandeis added 3 commits February 22, 2024 16:58

use svelte class: syntax

1bbdaea

fix underline ptag

471a2b1

Merge branch 'main' into chat-widget-streaming

78196e5

SBrandeis force-pushed the chat-widget-streaming branch from ae2b965 to 78196e5 Compare February 22, 2024 16:05

post-merge fix

28f1b8d

SBrandeis merged commit bea807a into main Feb 22, 2024
2 checks passed

SBrandeis deleted the chat-widget-streaming branch February 22, 2024 16:14

coyotte508 mentioned this pull request Feb 27, 2024

💚 Fix sync of inference widgets demo #512

Merged

coyotte508 added a commit that referenced this pull request Feb 27, 2024

💚 Fix sync of inference widgets demo (#512)

15ace8f

https://huggingface.co/spaces/huggingfacejs/inference-widgets is broken , due to #486 cc @SBrandeis

mishig25 mentioned this pull request May 15, 2024

[widgets] Conv Widget with Model Loading #671

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ [Widgets] Enable streaming in the conversational widget #486

✨ [Widgets] Enable streaming in the conversational widget #486

SBrandeis commented Feb 16, 2024 •

edited

Loading

SBrandeis commented Feb 16, 2024

julien-c left a comment

mishig25 Feb 21, 2024

SBrandeis Feb 21, 2024

mishig25 Feb 21, 2024 •

edited

Loading

SBrandeis Feb 21, 2024

Wauplin Feb 21, 2024

gary149 left a comment

krampstudio left a comment

krampstudio Feb 21, 2024 •

edited

Loading

SBrandeis Feb 22, 2024

mishig25 Feb 22, 2024

mishig25 left a comment

	Streaming with <a href="https://huggingface.co/docs/text-generation-inference" class="underline">TGI</a>
	Streaming with <a href="https://huggingface.co/docs/text-generation-inference" class="hover:underline">TGI</a>

	class={TASKS_DATA[task] ? "hover:underline" : undefined}
	class:hover:underline={TASKS_DATA[task]}

✨ [Widgets] Enable streaming in the conversational widget #486

✨ [Widgets] Enable streaming in the conversational widget #486

Conversation

SBrandeis commented Feb 16, 2024 • edited Loading

TL;DR

SBrandeis commented Feb 16, 2024

julien-c left a comment

Choose a reason for hiding this comment

mishig25 Feb 21, 2024

Choose a reason for hiding this comment

SBrandeis Feb 21, 2024

Choose a reason for hiding this comment

mishig25 Feb 21, 2024 • edited Loading

Choose a reason for hiding this comment

SBrandeis Feb 21, 2024

Choose a reason for hiding this comment

Wauplin Feb 21, 2024

Choose a reason for hiding this comment

gary149 left a comment

Choose a reason for hiding this comment

krampstudio left a comment

Choose a reason for hiding this comment

krampstudio Feb 21, 2024 • edited Loading

Choose a reason for hiding this comment

SBrandeis Feb 22, 2024

Choose a reason for hiding this comment

mishig25 Feb 22, 2024

Choose a reason for hiding this comment

mishig25 left a comment

Choose a reason for hiding this comment

SBrandeis commented Feb 16, 2024 •

edited

Loading

mishig25 Feb 21, 2024 •

edited

Loading

krampstudio Feb 21, 2024 •

edited

Loading