Class AttentiveNAS<T>

Namespace: AiDotNet.AutoML.NAS

Assembly: AiDotNet.dll

AttentiveNAS: Improving Neural Architecture Search via Attentive Sampling. Uses an attention-based meta-network to guide the sampling of sub-networks, focusing search on promising regions of the architecture space.

Reference: "AttentiveNAS: Improving Neural Architecture Search via Attentive Sampling" (CVPR 2021)

public class AttentiveNAS<T> : NasAutoMLModelBase<T>, IAutoMLModel<T, Tensor<T>, Tensor<T>>, IFullModel<T, Tensor<T>, Tensor<T>>, IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>, IModelSerializer, ICheckpointableModel, IParameterizable<T, Tensor<T>, Tensor<T>>, IFeatureAware, IFeatureImportance<T>, ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>, IGradientComputable<T, Tensor<T>, Tensor<T>>, IJitCompilable<T>

Type Parameters

T: The numeric type for calculations

Inheritance: object

AutoMLModelBase<T, Tensor<T>, Tensor<T>>

NasAutoMLModelBase<T>

AttentiveNAS<T>

Implements: IAutoMLModel<T, Tensor<T>, Tensor<T>>

IFullModel<T, Tensor<T>, Tensor<T>>

IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>

IModelSerializer

ICheckpointableModel

IParameterizable<T, Tensor<T>, Tensor<T>>

IFeatureAware

IFeatureImportance<T>

ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>

IGradientComputable<T, Tensor<T>, Tensor<T>>

IJitCompilable<T>

Inherited Members: NasAutoMLModelBase<T>.BestArchitecture

NasAutoMLModelBase<T>.SearchAsync(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>, TimeSpan, CancellationToken)

NasAutoMLModelBase<T>.SuggestNextTrialAsync()

NasAutoMLModelBase<T>.CreateModelAsync(ModelType, Dictionary<string, object>)

NasAutoMLModelBase<T>.GetDefaultSearchSpace(ModelType)

NasAutoMLModelBase<T>.ApplyArchitectureToModel(SuperNet<T>, Architecture<T>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._trialHistory

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._searchSpace

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._candidateModels

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._constraints

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._lock

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._optimizationMetric

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._maximize

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._optimizationMetricExplicitlySet

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._earlyStoppingPatience

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._earlyStoppingMinDelta

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._trialsSinceImprovement

AutoMLModelBase<T, Tensor<T>, Tensor<T>>._modelEvaluator

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.OptimizationMetric

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.MaximizeOptimizationMetric

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Type

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Status

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.BestModel

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.BestScore

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.TimeLimit

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.TrialLimit

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SearchAsync(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>, TimeSpan, CancellationToken)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SetSearchSpace(Dictionary<string, ParameterRange>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SetCandidateModels(List<ModelType>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SetOptimizationMetric(MetricType, bool)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.GetTrialHistory()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.GetFeatureImportanceAsync()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SuggestNextTrialAsync()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.ReportTrialResultAsync(Dictionary<string, object>, double, TimeSpan)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.EnableEarlyStopping(int, double)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SetConstraints(List<SearchConstraint>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Train(double[][], double[])

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Predict(double[][])

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.GetModelMetadata()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.ShouldStop()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.ValidateConstraints(Dictionary<string, object>, IFullModel<T, Tensor<T>, Tensor<T>>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.CreateModelAsync(ModelType, Dictionary<string, object>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.EvaluateModelAsync(IFullModel<T, Tensor<T>, Tensor<T>>, Tensor<T>, Tensor<T>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.GetDefaultSearchSpace(ModelType)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Train(Tensor<T>, Tensor<T>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Predict(Tensor<T>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SaveModel(string)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.LoadModel(string)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Serialize()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Deserialize(byte[])

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.GetParameters()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SetParameters(Vector<T>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.ParameterCount

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.WithParameters(Vector<T>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.FeatureNames

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.GetFeatureImportance()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.GetActiveFeatureIndices()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.IsFeatureUsed(int)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SetActiveFeatureIndices(IEnumerable<int>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Clone()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.DeepCopy()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.CreateInstanceForCopy()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SetModelEvaluator(IModelEvaluator<T, Tensor<T>, Tensor<T>>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.ExtractMetricFromEvaluation(ModelEvaluationData<T, Tensor<T>, Tensor<T>>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.ReportTrialFailureAsync(Dictionary<string, object>, Exception, TimeSpan)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.ConfigureSearchSpace(Dictionary<string, ParameterRange>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SetTimeLimit(TimeSpan)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SetTrialLimit(int)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.EnableNAS(bool)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SearchBestModel(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Search(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.GetResults()

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.Run(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SetModelsToTry(List<ModelType>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.DefaultLossFunction

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.ComputeGradients(Tensor<T>, Tensor<T>, ILossFunction<T>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.ApplyGradients(Vector<T>, T)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SupportsJitCompilation

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.ExportComputationGraph(List<ComputationNode<T>>)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.SaveState(Stream)

AutoMLModelBase<T, Tensor<T>, Tensor<T>>.LoadState(Stream)

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Extension Methods: DistributedExtensions.AsDistributedForHighBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributedForLowBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, IShardingConfiguration<T>)

Constructors

AttentiveNAS(SearchSpaceBase<T>, List<int>?, List<double>?, List<int>?, int)

public AttentiveNAS(SearchSpaceBase<T> searchSpace, List<int>? elasticDepths = null, List<double>? elasticWidthMultipliers = null, List<int>? elasticKernelSizes = null, int attentionHiddenSize = 128)

Parameters

searchSpace SearchSpaceBase<T>
elasticDepths List<int>
elasticWidthMultipliers List<double>
elasticKernelSizes List<int>
attentionHiddenSize int

Properties

NasNumNodes

Gets the number of nodes to search over.

protected override int NasNumNodes { get; }

Property Value

int

NasSearchSpace

Gets the NAS search space.

protected override SearchSpaceBase<T> NasSearchSpace { get; }

Property Value

SearchSpaceBase<T>

NumOps

Gets the numeric operations provider for T.

protected override INumericOperations<T> NumOps { get; }

Property Value

INumericOperations<T>

Methods

AttentiveSample(Vector<T>)

Samples architecture using attention-based sampling strategy. The attention module learns to focus on high-performing architecture regions.

public AttentiveNASConfig<T> AttentiveSample(Vector<T> contextVector)

Parameters

contextVector Vector<T>

Returns

AttentiveNASConfig<T>

CreateContextVector()

Creates a context vector from recent architecture performance history

public Vector<T> CreateContextVector()

Returns

Vector<T>

CreateInstanceForCopy()

Factory method for creating a new instance for deep copy. Derived classes must implement this to return a new instance of themselves. This ensures each copy has its own collections and lock object.

protected override AutoMLModelBase<T, Tensor<T>, Tensor<T>> CreateInstanceForCopy()

Returns

AutoMLModelBase<T, Tensor<T>, Tensor<T>>: A fresh instance of the derived class with default parameters

Remarks

When implementing this method, derived classes should create a fresh instance with default parameters, and should not attempt to preserve runtime or initialization state from the original instance. The deep copy logic will transfer relevant state (trial history, search space, etc.) after construction.

GetAttentionWeights()

Gets the attention weights

public Matrix<T> GetAttentionWeights()

Returns

Matrix<T>

GetPerformanceMemory()

Gets the performance memory

public Dictionary<string, T> GetPerformanceMemory()

Returns

Dictionary<string, T>

Search(HardwareConstraints<T>, int, int, int)

Searches for optimal architecture using attentive sampling

public AttentiveNASConfig<T> Search(HardwareConstraints<T> constraints, int inputChannels, int spatialSize, int numIterations = 100)

Parameters

constraints HardwareConstraints<T>
inputChannels int
spatialSize int
numIterations int

Returns

AttentiveNASConfig<T>

SearchArchitecture(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>, TimeSpan, CancellationToken)

Performs algorithm-specific architecture search.

protected override Architecture<T> SearchArchitecture(Tensor<T> inputs, Tensor<T> targets, Tensor<T> validationInputs, Tensor<T> validationTargets, TimeSpan timeLimit, CancellationToken cancellationToken)

Parameters

inputs Tensor<T>
targets Tensor<T>
validationInputs Tensor<T>
validationTargets Tensor<T>
timeLimit TimeSpan
cancellationToken CancellationToken

Returns

Architecture<T>

UpdateAttention(AttentiveNASConfig<T>, T, T)

Updates the attention module based on architecture performance. High-performing architectures increase attention to similar regions.

public void UpdateAttention(AttentiveNASConfig<T> config, T performance, T learningRate)

Parameters

config AttentiveNASConfig<T>
performance T
learningRate T

Table of Contents

Class AttentiveNAS<T>

Type Parameters

Constructors

AttentiveNAS(SearchSpaceBase<T>, List<int>?, List<double>?, List<int>?, int)

Parameters

Properties

NasNumNodes

Property Value

NasSearchSpace

Property Value

NumOps

Property Value

Methods

AttentiveSample(Vector<T>)

Parameters

Returns

CreateContextVector()

Returns

CreateInstanceForCopy()

Returns

Remarks

GetAttentionWeights()

Returns

GetPerformanceMemory()

Returns

Search(HardwareConstraints<T>, int, int, int)

Parameters

Returns

SearchArchitecture(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>, TimeSpan, CancellationToken)

Parameters

Returns

UpdateAttention(AttentiveNASConfig<T>, T, T)

Parameters