Interface ILanguageModel<T>

Namespace: AiDotNet.Interfaces

Assembly: AiDotNet.dll

Defines the base contract for language models that can generate text responses. This interface unifies both synchronous and asynchronous text generation capabilities.

public interface ILanguageModel<T>

Type Parameters

T: The numeric type used for model parameters and operations (e.g., double, float).

Remarks

For Beginners: A language model is an AI that understands and generates human-like text. Think of it as a very sophisticated autocomplete that can:

Answer questions
Write essays or code
Translate languages
Summarize documents
Have conversations

This interface is the foundation for all language models in AiDotNet, whether they:

Run in the cloud (OpenAI, Anthropic, Azure)
Run locally on your machine (Ollama, ONNX)
Are used for chat applications (IChatModel)
Are used for RAG systems (IGenerator)

The interface provides both synchronous and asynchronous methods:

Async methods (GenerateAsync): Better for web apps, don't block the UI
Sync methods (Generate): Simpler for scripts and batch processing

Example usage:

ILanguageModel<double> model = new OpenAIChatModel<double>("your-api-key");

// Async usage (recommended for most applications)
string response = await model.GenerateAsync("Explain quantum computing");

// Sync usage (for simple scripts)
string response = model.Generate("Explain quantum computing");

Properties

MaxContextTokens

Gets the maximum number of tokens this model can process in a single request (context window).

int MaxContextTokens { get; }

Property Value

int: The context window size in tokens.

Remarks

For Beginners: This is how much text the model can "remember" or process at once. Think of it as the model's working memory.

Token counts (approximate):

1 token ≈ 0.75 words
100 tokens ≈ 75 words ≈ 1 paragraph
1000 tokens ≈ 750 words ≈ 1 page
8000 tokens ≈ 6000 words ≈ 8-10 pages

Common context window sizes:

GPT-3.5-turbo: 4,096 tokens (3 pages)
GPT-4: 8,192 tokens (6 pages) or 32,768 tokens (24 pages)
Claude 3: 200,000 tokens (150 pages)
Gemini 1.5: 1,000,000 tokens (750 pages!)

Why it matters:

If your prompt + desired response exceeds this, the request will fail
Larger contexts let you provide more information but may be slower/costlier

MaxGenerationTokens

Gets the maximum number of tokens this model can generate in a single response.

int MaxGenerationTokens { get; }

Property Value

int: The maximum generation length in tokens.

Remarks

For Beginners: This limits how long the model's response can be. It's usually smaller than MaxContextTokens because you need room for your input prompt too.

Typical values:

512 tokens ≈ 384 words ≈ short answer (1-2 paragraphs)
2048 tokens ≈ 1536 words ≈ medium answer (1 page)
4096 tokens ≈ 3072 words ≈ long answer (2-3 pages)

You can usually configure this when creating the model to balance:

Shorter responses: Faster and cheaper
Longer responses: More detailed but slower and more expensive

ModelName

Gets the name or identifier of the language model.

string ModelName { get; }

Property Value

string: A string representing the model's name (e.g., "gpt-4", "claude-3-opus", "llama-2-7b").

Remarks

For Beginners: This identifies which specific model you're using. Different models have different:

Capabilities (some are better at code, others at creative writing)
Costs (GPT-4 is more expensive than GPT-3.5)
Speed (smaller models are faster)
Context windows (how much text they can process at once)

Examples:

"gpt-4" or "gpt-3.5-turbo" (OpenAI)
"claude-3-opus-20240229" or "claude-3-sonnet-20240229" (Anthropic)
"llama-2-7b" or "mixtral-8x7b" (Open source models)

Methods

Generate(string)

Generates a text response to the given prompt synchronously.

string Generate(string prompt)

Parameters

prompt string: The input text prompt to send to the language model.

Returns

string: The model's generated response as a string.

Remarks

For Beginners: This is a synchronous version of GenerateAsync - it blocks until the response is ready.

When to use this:

Simple command-line scripts
Batch processing where you process one request at a time
When you can't use async/await for some reason

When NOT to use this:

Web applications (will block request threads)
UI applications (will freeze the interface)
When processing multiple requests (use GenerateAsync and Task.WhenAll)

Example:

// Simple script usage
string response = model.Generate("What is 2 + 2?");
Console.WriteLine(response); // "2 + 2 equals 4."

Note: Many implementations just call GenerateAsync().GetAwaiter().GetResult() internally, so the async version is usually the "real" implementation.

GenerateAsync(string, CancellationToken)

Generates a text response to the given prompt asynchronously.

Task<string> GenerateAsync(string prompt, CancellationToken cancellationToken = default)

Parameters

prompt string: The input text prompt to send to the language model. This can be a question, instruction, or any text that requires a response.
cancellationToken CancellationToken: Optional cancellation token to cancel the generation operation. Use this to implement timeouts or allow users to cancel long-running requests.

Returns

Task<string>: A task that represents the asynchronous operation. The task result contains the model's generated response as a string.

Remarks

For Beginners: This is the main method for getting responses from the language model. It's asynchronous (uses async/await) which means your application won't freeze while waiting for the response.

The method is asynchronous because:

API calls to cloud models can take 1-10 seconds
Local models might need time to process
Your UI stays responsive while waiting

Example:

string prompt = "Write a haiku about programming";
string response = await model.GenerateAsync(prompt);
Console.WriteLine(response);
// Output: "Code flows like water
//          Bugs emerge then disappear
//          Peace in the logic"

Best practices:

Always use try-catch to handle errors (API failures, rate limits, etc.)
Consider retry logic for transient failures
Monitor token usage to control costs
Use CancellationToken to implement timeouts and allow cancellation (e.g., var cts = new CancellationTokenSource(TimeSpan.FromSeconds(30)))

Table of Contents

Interface ILanguageModel<T>

Type Parameters

Remarks

Properties

MaxContextTokens

Property Value

Remarks

MaxGenerationTokens

Property Value

Remarks

ModelName

Property Value

Remarks

Methods

Generate(string)

Parameters

Returns

Remarks

GenerateAsync(string, CancellationToken)

Parameters

Returns

Remarks