Enum LlmStreamEvent

Source

pub enum LlmStreamEvent {
    TextDelta {
        content: String,
    },
    ReasoningDelta {
        content: String,
    },
    ToolCallDelta {
        index: usize,
        id: Option<String>,
        name: Option<String>,
        arguments: Option<String>,
    },
    PromptProgress {
        processed: u32,
        total: u32,
        cached: u32,
        time_ms: u64,
    },
    Done {
        finish_reason: String,
    },
}

Expand description

A single event produced by a streaming LLM response.

These low-level events are the currency of crate::ports::LlmCompletionPort; they are parsed by adapter crates from raw SSE frames and handed to gglib-agent’s stream collector, which:

Forwards TextDelta items directly to the caller’s AgentEvent channel so text appears in real time.
Accumulates ToolCallDelta fragments until the stream ends, then assembles them into ToolCall values.
Waits for Done before triggering tool execution.

Variants§

§

TextDelta

An incremental text fragment from the model’s response.

Fields

§content: String

The new text fragment (append to the running content buffer).

§

ReasoningDelta

An incremental reasoning/thinking fragment (CoT tokens).

Produced by reasoning-capable models (e.g. DeepSeek R1, QwQ) when llama-server is started with --reasoning-format deepseek. The runtime adapter maps delta["reasoning_content"] frames to this variant; the stream collector forwards them as AgentEvent::ReasoningDelta and accumulates them in a separate buffer that is never sent back to the LLM as context.

Fields

§content: String

The new reasoning fragment (append to the current reasoning buffer).

§

ToolCallDelta

An incremental fragment of a tool-call request.

The adapter crate streams these before the model has finished generating the full arguments JSON. The stream collector accumulates all deltas for a given index into a single ToolCall.

Fields

§index: usize

Zero-based index of the tool call within the current response.

§id: Option<String>

Call identifier (only present in the first delta for this index).

§name: Option<String>

Tool name (only present in the first delta for this index).

§arguments: Option<String>

Partial arguments JSON string fragment (accumulate with push_str).

§

PromptProgress

Prompt-processing progress from llama-server.

Emitted when the request includes return_progress: true. These frames arrive during the pre-fill phase (before any TextDelta), giving real-time visibility into how far along token ingestion is.

Fields

§processed: u32

Number of tokens processed so far.

§total: u32

Total number of tokens in the prompt.

§cached: u32

Number of tokens served from KV cache (already processed).

§time_ms: u64

Elapsed wall-clock time in milliseconds since processing began.

§

Done

Signals the end of the stream.

Every conforming stream must end with exactly one Done item.

Fields

§finish_reason: String

The OpenAI-compatible finish reason (e.g. "stop", "tool_calls", "length").

LlmStreamEvent

Enum LlmStreamEvent Copy item path

Variants§

TextDelta

Fields

ReasoningDelta

Fields

ToolCallDelta

Fields

PromptProgress

Fields

Done

Fields

Trait Implementations§

impl Clone for LlmStreamEvent

fn clone(&self) -> LlmStreamEvent

fn clone_from(&mut self, source: &Self)

impl Debug for LlmStreamEvent

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl PartialEq for LlmStreamEvent

fn eq(&self, other: &LlmStreamEvent) -> bool

fn ne(&self, other: &Rhs) -> bool

impl Eq for LlmStreamEvent

impl StructuralPartialEq for LlmStreamEvent

Auto Trait Implementations§

impl Freeze for LlmStreamEvent

impl RefUnwindSafe for LlmStreamEvent

impl Send for LlmStreamEvent

impl Sync for LlmStreamEvent

impl Unpin for LlmStreamEvent

impl UnwindSafe for LlmStreamEvent

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

fn in_current_span(self) -> Instrumented<Self>

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> Same for T

type Output = T

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>where S: Into<Dispatch>,

fn with_current_subscriber(self) -> WithDispatch<Self>

Enum LlmStreamEvent

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,