How best to animate/display streaming LLM messages?

khara · December 21, 2023, 4:15am

High level:

I am trying to display LLM generated messages as they are generated. LLM generation tends to be slow in general, so displaying the message slowly, line by line, is a UI trick most sites use so that the user doesn’t feel like they are waiting forever.

I am using a custom model, not OpenAI, so I can’t use the WeWeb OpenAI plugin.

What is the best way to approach displaying this streaming text in WeWeb?

My Stack: Supabase collections in weweb, plus API calls to my coded server (have full flexibility there)

Some things I’ve been trying:

[Trying to see if real streaming could work]

Supabase realtime tables + dynamic collection (if I manually update the collection data, the change doesn’t appear in my WeWeb app until I manually refresh the page or collection. Is this how it is supposed to work, or should the data refresh itself with the change?)

[Trying a simpler task; how would I animate lines appearing one by one with a delay?]

Conditional rendering: I can get a simple conditional render to work - eg return false and the element doesn’t display. However, it doesn’t seem to render with a delay when a simple timeout is put in the JS function
Is there an easy way to animate display from none → block? It isn’t listed in transitions and I tried putting custom css in the box, but it is getting removed when I try it in preview mode.

Thank you in advance for the help!

geenius-cloud · December 21, 2023, 5:49am

@khara I am currently trying to figure this out as well as we are running the OpenAI Assistant API, all models from AWS Bedrock and now also Gemini Pro from Google Cloud.

@weweb-team Is the stream=true which is available for most Chat endpoints of GPT-4, Claude, Cohere, Jurassic, Titan, Llama possible to do in WeWeb natively or does this streaming functionality need to be implemented manually through an NPM package, etc?

khara · December 22, 2023, 6:38am

Right! It seems like something probably a lot of people might need right now? Fingers crossed that there’s already a fast way to do this in WeWeb, but wondering if it’s necessary to roll a custom component.

Lars · December 22, 2023, 9:17am

Yeah, looking for the same solution. It would be nice to have it natively in weweb

Lars · December 22, 2023, 10:27am

Make a feature request! Public Roadmap | WeWeb

carrano_dot_dev · December 22, 2023, 6:36pm

https://platform.openai.com/docs/api-reference/streaming

khara · December 23, 2023, 2:27am

@carrano_dot_dev thanks! I’m actually wondering about the frontend display for dynamically updating the streamed data though, I have the server part already.

I haven’t been able to get WeWeb to automatically update supabase collections when the data is updated, and wondering if there is a way to do that? Or alternatively if there is a way to stream directly to a WeWeb component.

@weweb-team couple of questions:

Is there a way to do this already in WeWeb that I’ve missed? (Dynamically display text/data as it is streamed in)
Or if not, is this on the roadmap already for the next month? If not, then will look into building a custom component, but was hoping to avoid learning vue!

carrano_dot_dev · December 23, 2023, 6:08am

I’d imagine you would just create a text variable and bind it to a text element on the frontend. Then you could run a workflow that updates that variable as the data from the API is streamed in.

khara · December 23, 2023, 9:13am

Ah, rad Much gratitude, many thanks! I’ve been overly focused on updating the collection itself.

So something like:

Fake a while loop:
Loop until condition is met - #4 by clncsports
Insert temporary text element and update as the API streams
When API is done streaming, hide temporary text element + add contents to collection, refresh collection

(For future reference, curious if there is a way to get collections to update in realtime that I might have missed!)

khara · December 23, 2023, 11:29am

Hmm ok - so far as I can tell, the WeWeb text won’t update correctly with the streaming text, it sort of updates all at once when the API call finishes.

There might be a way to read the filestream directly from WeWeb’s workflow output object, using JS, but I couldn’t find it in a first pass.

khara · December 24, 2023, 2:27am

lol just realized supabase was not updating because I didn’t have realtime enabled. Realtime updates are working there, so going to just try streaming directly to supabase for now.

I also looked into creating a coded component to handle streaming as well. Setup was pretty smooth, but then realized you need to be on a higher tier plan to actually use the coded components.

khara · December 25, 2023, 2:04am

Ok final update from me. For anyone else who needs a quick solution here, streaming directly to a supabase table with realtime updates works decently well in practice.

This does use more of the realtime message quota on the free tier, but supabase is quite generous with that allowance. I ended up chunking together some of the streamed bits to save on DB resources, so it isn’t hitting it constantly.

It would be nice to stream directly to the client, so curious to know if this is on the roadmap at all if anyone from @weweb-team sees this.

chadsmithclt · December 26, 2023, 5:01pm

I am doing this on my app what I have done is created a text variable and then used WeWeb copilot to create JavaScript where the value i want displayed is iterated one word at a time with a 200 ms delay and added to the text variable.

Works really well, I’ve built a form app that feels like a conversation as you respond to questions

khara · December 30, 2023, 4:16am

Oh wild, thanks for sharing! Great to know that it is possible to stream directly to client instead of the DB

Will have to try that and see if performance is better than what I have. Happy new year

raelyn · December 31, 2023, 8:08am

Are you using the Weweb open Ai plugin? How are you getting the open ai output to show up in the text variable?

chadsmithclt · January 2, 2024, 6:24pm

I use the OpenAI plugin to generate the response and then update a variable directly from the OpenAI output using a workflow. I update the database later based on another trigger so that the user doesn’t have to wait for the form to retrieve the OpenAI response, store to the data, retrieve the value, etc etc…

Topic		Replies	Views
Can weweb add a reasonable way to stream data via an api call? Ask us anything	6	356	May 12, 2024
Anyway to stream LLM calls from an API? How is the native Openai do it? Are we allowed to do the same? Ask us anything	0	197	February 8, 2024
Tutorial on how to implement a chat/ messaging functionality in weweb using supabase Show and Tell design , supabase	4	481	May 3, 2024
WeWeb and Supabase Realtime Struggles How do I?	1	207	May 24, 2024
Implementing OpenAI Streaming with WeWeb and Supabase Ask us anything supabase	0	64	August 7, 2024

How best to animate/display streaming LLM messages?

Related topics