Table of Contents

Class MultimodalOutput<T>

Namespace
AiDotNet.Interfaces
Assembly
AiDotNet.dll

Represents an output from unified multimodal models.

public class MultimodalOutput<T>

Type Parameters

T

The numeric type used for calculations.

Inheritance
MultimodalOutput<T>
Inherited Members

Properties

Confidence

Confidence score for the output.

public T? Confidence { get; set; }

Property Value

T

Metadata

Optional metadata about the output.

public Dictionary<string, object>? Metadata { get; set; }

Property Value

Dictionary<string, object>

Modality

The modality type of this output.

public ModalityType Modality { get; set; }

Property Value

ModalityType

TextContent

Text content (for text outputs).

public string? TextContent { get; set; }

Property Value

string

Methods

GetAudioSamples()

Gets audio data as waveform samples for audio outputs.

public Vector<T>? GetAudioSamples()

Returns

Vector<T>

GetImageDimensions()

Gets the dimensions of image output data.

public (int Channels, int Height, int Width)? GetImageDimensions()

Returns

(int Channels, int Height, int Width)?

GetImagePixels()

Gets image data as pixel values for image outputs.

public Vector<T>? GetImagePixels()

Returns

Vector<T>

GetVideoDimensions()

Gets the dimensions of video output data.

public (int Frames, int Channels, int Height, int Width)? GetVideoDimensions()

Returns

(int X, int Y, int Width, int Height)?

GetVideoFrames()

Gets video frame data for video outputs.

public Vector<T>? GetVideoFrames()

Returns

Vector<T>