Class SpiralNet<T>

Namespace: AiDotNet.NeuralNetworks

Assembly: AiDotNet.dll

Implements the SpiralNet++ architecture for mesh-based deep learning.

public class SpiralNet<T> : NeuralNetworkBase<T>, INeuralNetworkModel<T>, INeuralNetwork<T>, IFullModel<T, Tensor<T>, Tensor<T>>, IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>, IModelSerializer, ICheckpointableModel, IParameterizable<T, Tensor<T>, Tensor<T>>, IFeatureAware, IFeatureImportance<T>, ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>, IGradientComputable<T, Tensor<T>, Tensor<T>>, IJitCompilable<T>, IInterpretableModel<T>, IInputGradientComputable<T>, IDisposable

Type Parameters

T: The numeric type used for calculations (typically float or double).

Inheritance: object

NeuralNetworkBase<T>

SpiralNet<T>

Implements: INeuralNetworkModel<T>

INeuralNetwork<T>

IFullModel<T, Tensor<T>, Tensor<T>>

IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>

IModelSerializer

ICheckpointableModel

IParameterizable<T, Tensor<T>, Tensor<T>>

IFeatureAware

IFeatureImportance<T>

ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>

IGradientComputable<T, Tensor<T>, Tensor<T>>

IJitCompilable<T>

IInterpretableModel<T>

IInputGradientComputable<T>

IDisposable

Inherited Members: NeuralNetworkBase<T>.Layers

NeuralNetworkBase<T>.LayerCount

NeuralNetworkBase<T>.Architecture

NeuralNetworkBase<T>.NumOps

NeuralNetworkBase<T>.Engine

NeuralNetworkBase<T>._layerInputs

NeuralNetworkBase<T>._layerOutputs

NeuralNetworkBase<T>.Random

NeuralNetworkBase<T>.LossFunction

NeuralNetworkBase<T>.LastLoss

NeuralNetworkBase<T>.IsTrainingMode

NeuralNetworkBase<T>.SupportsTraining

NeuralNetworkBase<T>.SupportsGpuTraining

NeuralNetworkBase<T>.CanTrainOnGpu

NeuralNetworkBase<T>.GpuEngine

NeuralNetworkBase<T>.MaxGradNorm

NeuralNetworkBase<T>._mixedPrecisionContext

NeuralNetworkBase<T>._memoryManager

NeuralNetworkBase<T>.IsMemoryManagementEnabled

NeuralNetworkBase<T>.IsGradientCheckpointingEnabled

NeuralNetworkBase<T>.IsMixedPrecisionEnabled

NeuralNetworkBase<T>.ClipGradients(List<Tensor<T>>)

NeuralNetworkBase<T>.ClipGradient(Tensor<T>)

NeuralNetworkBase<T>.ClipGradient(Vector<T>)

NeuralNetworkBase<T>.GetParameters()

NeuralNetworkBase<T>.Backpropagate(Tensor<T>)

NeuralNetworkBase<T>.BackpropagateWithRecompute(Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpuDeferred(IGpuTensor<T>, GpuExecutionOptions)

NeuralNetworkBase<T>.UpdateParametersGpu(T, T, T)

NeuralNetworkBase<T>.UpdateParametersGpu(IGpuOptimizerConfig)

NeuralNetworkBase<T>.UpdateParametersGpuDeferred(IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferred(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferredAsync(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions, CancellationToken)

NeuralNetworkBase<T>.UploadWeightsToGpu()

NeuralNetworkBase<T>.DownloadWeightsFromGpu()

NeuralNetworkBase<T>.ZeroGradientsGpu()

NeuralNetworkBase<T>.ExtractSingleExample(Tensor<T>, int)

NeuralNetworkBase<T>.ForwardWithMemory(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithCheckpointing(Tensor<T>)

NeuralNetworkBase<T>.CanUseGpuResidentPath()

NeuralNetworkBase<T>.TryForwardGpuOptimized(Tensor<T>, out Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferred(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferredAsync(Tensor<T>, CancellationToken)

NeuralNetworkBase<T>.BeginGpuExecution(GpuExecutionOptions)

NeuralNetworkBase<T>.ForwardWithGpuContext(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithGpuContext(IGpuTensor<T>)

NeuralNetworkBase<T>.GetGpuMemoryStats()

NeuralNetworkBase<T>.ForwardWithFeatures(Tensor<T>, int[])

NeuralNetworkBase<T>.ParameterCount

NeuralNetworkBase<T>.GetParameterCount()

NeuralNetworkBase<T>.InvalidateParameterCountCache()

NeuralNetworkBase<T>.AddLayerToCollection(ILayer<T>)

NeuralNetworkBase<T>.RemoveLayerFromCollection(ILayer<T>)

NeuralNetworkBase<T>.ClearLayers()

NeuralNetworkBase<T>.ValidateCustomLayers(List<ILayer<T>>)

NeuralNetworkBase<T>.ValidateCustomLayersInternal(List<ILayer<T>>)

NeuralNetworkBase<T>.IsValidInputLayer(ILayer<T>)

NeuralNetworkBase<T>.IsValidOutputLayer(ILayer<T>)

NeuralNetworkBase<T>.AreLayersCompatible(ILayer<T>, ILayer<T>)

NeuralNetworkBase<T>.GetParameterGradients()

NeuralNetworkBase<T>.EnsureArchitectureInitialized()

NeuralNetworkBase<T>.SetTrainingMode(bool)

NeuralNetworkBase<T>.EnableMemoryManagement(TrainingMemoryConfig)

NeuralNetworkBase<T>.DisableMemoryManagement()

NeuralNetworkBase<T>.GetMemoryEstimate(int, int)

NeuralNetworkBase<T>.GetLastLoss()

NeuralNetworkBase<T>.ResetState()

NeuralNetworkBase<T>.BackwardWithInputGradient(Tensor<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Vector<T>, Vector<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.SaveModel(string)

NeuralNetworkBase<T>.LoadModel(string)

NeuralNetworkBase<T>.Serialize()

NeuralNetworkBase<T>.Deserialize(byte[])

NeuralNetworkBase<T>.WithParameters(Vector<T>)

NeuralNetworkBase<T>.GetActiveFeatureIndices()

NeuralNetworkBase<T>.IsFeatureUsed(int)

NeuralNetworkBase<T>.DeepCopy()

NeuralNetworkBase<T>.Clone()

NeuralNetworkBase<T>.SetActiveFeatureIndices(IEnumerable<int>)

NeuralNetworkBase<T>._enabledMethods

NeuralNetworkBase<T>._sensitiveFeatures

NeuralNetworkBase<T>._fairnessMetrics

NeuralNetworkBase<T>._baseModel

NeuralNetworkBase<T>.GetGlobalFeatureImportanceAsync()

NeuralNetworkBase<T>.GetLocalFeatureImportanceAsync(Tensor<T>)

NeuralNetworkBase<T>.GetShapValuesAsync(Tensor<T>)

NeuralNetworkBase<T>.GetLimeExplanationAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetPartialDependenceAsync(Vector<int>, int)

NeuralNetworkBase<T>.GetCounterfactualAsync(Tensor<T>, Tensor<T>, int)

NeuralNetworkBase<T>.GetModelSpecificInterpretabilityAsync()

NeuralNetworkBase<T>.GenerateTextExplanationAsync(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.GetFeatureInteractionAsync(int, int)

NeuralNetworkBase<T>.ValidateFairnessAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetAnchorExplanationAsync(Tensor<T>, T)

NeuralNetworkBase<T>.SetBaseModel<TInput, TOutput>(IFullModel<T, TInput, TOutput>)

NeuralNetworkBase<T>.EnableMethod(params InterpretationMethod[])

NeuralNetworkBase<T>.ConfigureFairness(Vector<int>, params FairnessMetric[])

NeuralNetworkBase<T>.GetNamedLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.GetArchitecture()

NeuralNetworkBase<T>.GetFeatureImportance()

NeuralNetworkBase<T>.SetParameters(Vector<T>)

NeuralNetworkBase<T>.AddLayer(LayerType, int, ActivationFunction)

NeuralNetworkBase<T>.AddConvolutionalLayer(int, int, int, ActivationFunction)

NeuralNetworkBase<T>.AddLSTMLayer(int, bool)

NeuralNetworkBase<T>.AddDropoutLayer(double)

NeuralNetworkBase<T>.AddBatchNormalizationLayer(int, double, double)

NeuralNetworkBase<T>.AddPoolingLayer(int[], PoolingType, int, int?)

NeuralNetworkBase<T>.GetGradients()

NeuralNetworkBase<T>.GetInputShape()

NeuralNetworkBase<T>.GetLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.DefaultLossFunction

NeuralNetworkBase<T>.ComputeGradients(Tensor<T>, Tensor<T>, ILossFunction<T>)

NeuralNetworkBase<T>.ApplyGradients(Vector<T>, T)

NeuralNetworkBase<T>.SaveState(Stream)

NeuralNetworkBase<T>.LoadState(Stream)

NeuralNetworkBase<T>.Dispose()

NeuralNetworkBase<T>.Dispose(bool)

NeuralNetworkBase<T>.SupportsJitCompilation

NeuralNetworkBase<T>.ExportComputationGraph(List<ComputationNode<T>>)

NeuralNetworkBase<T>.ConvertLayerToGraph(ILayer<T>, ComputationNode<T>)

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Extension Methods: DistributedExtensions.AsDistributedForHighBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributedForLowBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, IShardingConfiguration<T>)

Remarks

SpiralNet++ processes 3D meshes by applying convolutions along spiral sequences of vertex neighbors. This creates translation-equivariant operations on irregular mesh structures without requiring mesh registration.

For Beginners: SpiralNet++ is designed for learning from 3D mesh data.

Key concepts:

Mesh: A 3D surface made of vertices connected by edges/triangles
Spiral ordering: A consistent way to visit vertex neighbors (like a clock hand)
Spiral convolution: Apply weights to neighbors in spiral order

How it works:

For each vertex, define a spiral ordering of its neighbors
Gather neighbor features in spiral order
Apply learned weights to the gathered features
Pool vertices to create hierarchical representations
Classify or segment the mesh

Applications:

3D face reconstruction and expression recognition
Human body shape analysis
Medical surface analysis (organs, bones)
CAD model classification

Reference: "SpiralNet++: A Fast and Highly Efficient Mesh Convolution Operator" by Gong et al.

Constructors

SpiralNet()

Initializes a new instance of the SpiralNet<T> class with default options.

public SpiralNet()

Remarks

Creates a SpiralNet with default configuration suitable for common mesh tasks.

SpiralNet(SpiralNetOptions, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, ILossFunction<T>?)

Initializes a new instance of the SpiralNet<T> class with specified options.

public SpiralNet(SpiralNetOptions options, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>? optimizer = null, ILossFunction<T>? lossFunction = null)

Parameters

options SpiralNetOptions: Configuration options for the SpiralNet.
optimizer IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>: The optimizer for training. Defaults to Adam if null.
lossFunction ILossFunction<T>: The loss function. Defaults based on task type if null.

Exceptions

ArgumentNullException: Thrown when options is null.

SpiralNet(int, int, int, ILossFunction<T>?)

Initializes a new instance of the SpiralNet<T> class with simple parameters.

public SpiralNet(int numClasses, int inputFeatures = 3, int spiralLength = 9, ILossFunction<T>? lossFunction = null)

Parameters

numClasses int: Number of output classes for classification.
inputFeatures int: Number of input features per vertex. Default is 3.
spiralLength int: Length of spiral sequences. Default is 9.
lossFunction ILossFunction<T>: The loss function. Defaults based on task type if null.

Properties

ConvChannels

Gets the channel configuration for spiral convolution layers.

public int[] ConvChannels { get; }

Property Value

int[]

InputFeatures

Gets the number of input features per vertex.

public int InputFeatures { get; }

Property Value

int

NumClasses

Gets the number of output classes for classification.

public int NumClasses { get; }

Property Value

int

SpiralLength

Gets the spiral sequence length.

public int SpiralLength { get; }

Property Value

int

Methods

Backward(Tensor<T>)

Performs a backward pass to compute gradients.

public Tensor<T> Backward(Tensor<T> lossGradient)

Parameters

lossGradient Tensor<T>: Gradient of the loss with respect to network output.

Returns

Tensor<T>: Gradient with respect to input.

CreateNewInstance()

Creates a new instance for cloning.

protected override IFullModel<T, Tensor<T>, Tensor<T>> CreateNewInstance()

Returns

IFullModel<T, Tensor<T>, Tensor<T>>: New SpiralNet instance.

Remarks

For Beginners: This creates a blank version of the same type of neural network.

It's used internally by methods like DeepCopy and Clone to create the right type of network before copying the data into it.

DeserializeNetworkSpecificData(BinaryReader)

Deserializes network-specific data.

protected override void DeserializeNetworkSpecificData(BinaryReader reader)

Parameters

reader BinaryReader: Binary reader.

Remarks

This method is called at the end of the general deserialization process to allow derived classes to read any additional data specific to their implementation.

For Beginners: Continuing the suitcase analogy, this is like unpacking that special compartment. After the main deserialization method has unpacked the common items (layers, parameters), this method allows each specific type of neural network to unpack its own unique items that were stored during serialization.

Forward(Tensor<T>)

Performs a forward pass through the network.

public Tensor<T> Forward(Tensor<T> input)

Parameters

input Tensor<T>: Vertex features tensor with shape [numVertices, InputFeatures].

Returns

Tensor<T>: Classification logits with shape [NumClasses].

Exceptions

InvalidOperationException: Thrown when spiral indices are not set.

GetModelMetadata()

Gets metadata about this model.

public override ModelMetadata<T> GetModelMetadata()

Returns

ModelMetadata<T>: Model metadata.

InitializeLayers()

Initializes the layers of the SpiralNet network.

protected override void InitializeLayers()

Remarks

If the architecture provides custom layers, those are used. Otherwise, default layers are created using CreateDefaultSpiralNetLayers(NeuralNetworkArchitecture<T>, int, int, int[]?, double[]?, int[]?, bool, double, bool).

Predict(Tensor<T>)

Generates predictions for the given input.

public override Tensor<T> Predict(Tensor<T> input)

Parameters

input Tensor<T>: Vertex features tensor.

Returns

Tensor<T>: Classification logits.

Remarks

For Beginners: This is the main method you'll use to get results from your trained neural network. You provide some input data (like an image or text), and the network processes it through all its layers to produce an output (like a classification or prediction).

PredictClass(Tensor<T>, int[,])

Predicts the class for a single mesh.

public int PredictClass(Tensor<T> meshFeatures, int[,] meshSpiralIndices)

Parameters

meshFeatures Tensor<T>: Vertex features tensor.
meshSpiralIndices int[,]: Spiral indices for the mesh.

Returns

int: Predicted class index.

PredictProbabilities(Tensor<T>, int[,])

Computes class probabilities for a single mesh using softmax.

public Vector<T> PredictProbabilities(Tensor<T> meshFeatures, int[,] meshSpiralIndices)

Parameters

meshFeatures Tensor<T>: Vertex features tensor.
meshSpiralIndices int[,]: Spiral indices for the mesh.

Returns

Vector<T>: Probability distribution over classes.

SerializeNetworkSpecificData(BinaryWriter)

Serializes network-specific data.

protected override void SerializeNetworkSpecificData(BinaryWriter writer)

Parameters

writer BinaryWriter: Binary writer.

Remarks

This method is called at the end of the general serialization process to allow derived classes to write any additional data specific to their implementation.

For Beginners: Think of this as packing a special compartment in your suitcase. While the main serialization method packs the common items (layers, parameters), this method allows each specific type of neural network to pack its own unique items that other networks might not have.

SetMultiResolutionSpiralIndices(List<int[,]>)

Sets spiral indices for multiple resolution levels (for hierarchical processing).

public void SetMultiResolutionSpiralIndices(List<int[,]> spiralIndicesPerLevel)

Parameters

spiralIndicesPerLevel List<int[,]>: List of spiral indices for each resolution level.

Exceptions

ArgumentNullException: Thrown when list is null.
ArgumentException: Thrown when list is empty.

SetSpiralIndices(int[,])

Sets the spiral indices for the current mesh being processed.

public void SetSpiralIndices(int[,] spiralIndices)

Parameters

spiralIndices int[,]: A 2D array of shape [numVertices, SpiralLength] containing neighbor vertex indices in spiral order for each vertex.

Remarks

For Beginners: Before processing a mesh, you must define how vertices are connected in spiral order. This method sets that connectivity.

Exceptions

ArgumentNullException: Thrown when spiralIndices is null.

Train(Tensor<T>, Tensor<T>)

Trains the network on a single batch.

public override void Train(Tensor<T> input, Tensor<T> expectedOutput)

Parameters

input Tensor<T>: Vertex features tensor.
expectedOutput Tensor<T>: Ground truth labels.

Remarks

This method performs one training step on the neural network using the provided input and expected output. It updates the network's parameters to reduce the error between the network's prediction and the expected output.

For Beginners: This is how your neural network learns. You provide: - An input (what the network should process) - The expected output (what the correct answer should be)

The network then:

Makes a prediction based on the input
Compares its prediction to the expected output
Calculates how wrong it was (the loss)
Adjusts its internal values to do better next time

After training, you can get the loss value using the GetLastLoss() method to see how well the network is learning.

Train(List<Tensor<T>>, List<int[,]>, List<int>, int, T)

Trains the network on mesh data.

public List<double> Train(List<Tensor<T>> meshFeatures, List<int[,]> spiralIndices, List<int> labels, int epochs, T learningRate)

Parameters

meshFeatures List<Tensor<T>>: List of vertex feature tensors for training meshes.
spiralIndices List<int[,]>: List of spiral indices for each training mesh.
labels List<int>: List of class labels for each mesh.
epochs int: Number of training epochs.
learningRate T: Learning rate for optimization.

Returns

List<double>: Training loss history.

Exceptions

ArgumentException: Thrown when input lists have mismatched lengths.

UpdateParameters(Vector<T>)

Updates network parameters using a flat parameter vector.

public override void UpdateParameters(Vector<T> parameters)

Parameters

parameters Vector<T>: Vector containing all parameters.

Remarks

For Beginners: During training, a neural network's internal values (parameters) get adjusted to improve its performance. This method allows you to update all those values at once by providing a complete set of new parameters.

This is typically used by optimization algorithms that calculate better parameter values based on training data.

UpdateParameters(T)

Updates network parameters using the optimizer.

public void UpdateParameters(T learningRate)

Parameters

learningRate T: Learning rate for parameter updates.

Table of Contents

Class SpiralNet<T>

Type Parameters

Remarks

Constructors

SpiralNet()

Remarks

SpiralNet(SpiralNetOptions, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, ILossFunction<T>?)

Parameters

Exceptions

SpiralNet(int, int, int, ILossFunction<T>?)

Parameters

Properties

ConvChannels

Property Value

InputFeatures

Property Value

NumClasses

Property Value

SpiralLength

Property Value

Methods

Backward(Tensor<T>)

Parameters

Returns

CreateNewInstance()

Returns

Remarks

DeserializeNetworkSpecificData(BinaryReader)

Parameters

Remarks

Forward(Tensor<T>)

Parameters

Returns

Exceptions

GetModelMetadata()

Returns

InitializeLayers()

Remarks

Predict(Tensor<T>)

Parameters

Returns

Remarks

PredictClass(Tensor<T>, int[,])

Parameters

Returns

PredictProbabilities(Tensor<T>, int[,])

Parameters

Returns

SerializeNetworkSpecificData(BinaryWriter)

Parameters

Remarks

SetMultiResolutionSpiralIndices(List<int[,]>)

Parameters

Exceptions

SetSpiralIndices(int[,])

Parameters

Remarks

Exceptions

Train(Tensor<T>, Tensor<T>)

Parameters

Remarks

Train(List<Tensor<T>>, List<int[,]>, List<int>, int, T)

Parameters

Returns

Exceptions

UpdateParameters(Vector<T>)

Parameters

Remarks

UpdateParameters(T)

Parameters