Class AdaptiveDistillationStrategyBase<T>

Namespace: AiDotNet.KnowledgeDistillation.Strategies

Assembly: AiDotNet.dll

Abstract base class for adaptive distillation strategies with performance tracking.

public abstract class AdaptiveDistillationStrategyBase<T> : DistillationStrategyBase<T>, IDistillationStrategy<T>, IAdaptiveDistillationStrategy<T>

Type Parameters

T: The numeric type for calculations (e.g., double, float).

Inheritance: object

DistillationStrategyBase<T>

AdaptiveDistillationStrategyBase<T>

Implements: IDistillationStrategy<T>

IAdaptiveDistillationStrategy<T>

Derived: AccuracyBasedAdaptiveStrategy<T>

ConfidenceBasedAdaptiveStrategy<T>

EntropyBasedAdaptiveStrategy<T>

Inherited Members: DistillationStrategyBase<T>.NumOps

DistillationStrategyBase<T>.Temperature

DistillationStrategyBase<T>.Alpha

DistillationStrategyBase<T>.ValidateOutputDimensions(Matrix<T>, Matrix<T>)

DistillationStrategyBase<T>.ValidateLabelDimensions(Matrix<T>, Matrix<T>)

DistillationStrategyBase<T>.Epsilon

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Remarks

For Beginners: This base class provides common functionality for all adaptive strategies, including performance tracking with exponential moving average and temperature range management.

For Implementers: Derive from this class and implement ComputeAdaptiveTemperature(Vector<T>, Vector<T>) to define your specific adaptation logic.

Shared Features: - Exponential moving average (EMA) for performance tracking - Temperature range validation and enforcement - Performance history management - Helper methods for confidence, entropy, and accuracy calculations

Constructors

AdaptiveDistillationStrategyBase(double, double, double, double, double)

Initializes a new instance of the AdaptiveDistillationStrategyBase class.

protected AdaptiveDistillationStrategyBase(double baseTemperature = 3, double alpha = 0.3, double minTemperature = 1, double maxTemperature = 5, double adaptationRate = 0.1)

Parameters

baseTemperature double: Base temperature for distillation (default: 3.0).
alpha double: Balance between hard and soft loss (default: 0.3).
minTemperature double: Minimum temperature for adaptation (default: 1.0).
maxTemperature double: Maximum temperature for adaptation (default: 5.0).
adaptationRate double: Rate for EMA performance tracking (default: 0.1).

Properties

AdaptationRate

Gets the adaptation rate for exponential moving average.

public double AdaptationRate { get; }

Property Value

double

MaxTemperature

Gets the maximum temperature for adaptation.

public double MaxTemperature { get; }

Property Value

double

MinTemperature

Gets the minimum temperature for adaptation.

public double MinTemperature { get; }

Property Value

double

Methods

ArgMax(Vector<T>)

Finds the index of the maximum value in a vector.

protected int ArgMax(Vector<T> vector)

Parameters

vector Vector<T>

Returns

int

ClampTemperature(double)

Clamps a value to the temperature range [MinTemperature, MaxTemperature].

protected double ClampTemperature(double temperature)

Parameters

temperature double

Returns

double

ComputeAdaptiveTemperature(Vector<T>, Vector<T>)

Computes the adaptive temperature for a specific sample.

public abstract double ComputeAdaptiveTemperature(Vector<T> studentOutput, Vector<T> teacherOutput)

Parameters

studentOutput Vector<T>
teacherOutput Vector<T>

Returns

double

Remarks

For Implementers: Override this to define strategy-specific temperature adaptation.

ComputeEntropy(Vector<T>)

Computes the entropy of a probability distribution.

protected double ComputeEntropy(Vector<T> probabilities)

Parameters

probabilities Vector<T>

Returns

double

Remarks

Entropy measures uncertainty. Higher entropy = more uncertain = harder sample.

ComputeGradient(Matrix<T>, Matrix<T>, Matrix<T>?)

Computes gradient with adaptive temperature.

public override Matrix<T> ComputeGradient(Matrix<T> studentBatchOutput, Matrix<T> teacherBatchOutput, Matrix<T>? trueLabelsBatch = null)

Parameters

studentBatchOutput Matrix<T>
teacherBatchOutput Matrix<T>
trueLabelsBatch Matrix<T>

Returns

Matrix<T>

ComputeLoss(Matrix<T>, Matrix<T>, Matrix<T>?)

Computes distillation loss with adaptive temperature.

public override T ComputeLoss(Matrix<T> studentBatchOutput, Matrix<T> teacherBatchOutput, Matrix<T>? trueLabelsBatch = null)

Parameters

studentBatchOutput Matrix<T>
teacherBatchOutput Matrix<T>
trueLabelsBatch Matrix<T>

Returns

T

ComputePerformance(Vector<T>, Vector<T>?)

Computes a performance metric for the student output.

protected virtual double ComputePerformance(Vector<T> studentOutput, Vector<T>? trueLabel)

Parameters

studentOutput Vector<T>
trueLabel Vector<T>

Returns

double

Remarks

For Implementers: Override to define strategy-specific performance metrics.

Default: Returns max confidence (highest probability).

GetMaxConfidence(Vector<T>)

Gets the maximum confidence (highest probability) from a probability distribution.

protected double GetMaxConfidence(Vector<T> probabilities)

Parameters

probabilities Vector<T>

Returns

double

GetPerformance(int)

Gets the current performance metric for a sample.

public virtual double GetPerformance(int sampleIndex)

Parameters

sampleIndex int

Returns

double

IsCorrect(Vector<T>, Vector<T>)

Checks if the student prediction is correct.

protected bool IsCorrect(Vector<T> studentOutput, Vector<T> trueLabel)

Parameters

studentOutput Vector<T>
trueLabel Vector<T>

Returns

bool

UpdatePerformance(int, Vector<T>, Vector<T>?)

Updates the performance metric for a specific sample using exponential moving average.

public virtual void UpdatePerformance(int sampleIndex, Vector<T> studentOutput, Vector<T>? trueLabel = null)

Parameters

sampleIndex int
studentOutput Vector<T>
trueLabel Vector<T>

Table of Contents

Class AdaptiveDistillationStrategyBase<T>

Type Parameters

Remarks

Constructors

AdaptiveDistillationStrategyBase(double, double, double, double, double)

Parameters

Properties

AdaptationRate

Property Value

MaxTemperature

Property Value

MinTemperature

Property Value

Methods

ArgMax(Vector<T>)

Parameters

Returns

ClampTemperature(double)

Parameters

Returns

ComputeAdaptiveTemperature(Vector<T>, Vector<T>)

Parameters

Returns

Remarks

ComputeEntropy(Vector<T>)

Parameters

Returns

Remarks

ComputeGradient(Matrix<T>, Matrix<T>, Matrix<T>?)

Parameters

Returns

ComputeLoss(Matrix<T>, Matrix<T>, Matrix<T>?)

Parameters

Returns

ComputePerformance(Vector<T>, Vector<T>?)

Parameters

Returns

Remarks

GetMaxConfidence(Vector<T>)

Parameters

Returns

GetPerformance(int)

Parameters

Returns

IsCorrect(Vector<T>, Vector<T>)

Parameters

Returns

UpdatePerformance(int, Vector<T>, Vector<T>?)

Parameters