Class SpectralNormalizationLayer<T>

Namespace: AiDotNet.NeuralNetworks.Layers

Assembly: AiDotNet.dll

Represents a spectral normalization layer that normalizes the weights of a layer by their spectral norm.

public class SpectralNormalizationLayer<T> : LayerBase<T>, ILayer<T>, IJitCompilable<T>, IDiagnosticsProvider, IWeightLoadable<T>, IDisposable

Type Parameters

T: The numeric type used for calculations, typically float or double.

Inheritance: object

LayerBase<T>

SpectralNormalizationLayer<T>

Implements: ILayer<T>

IJitCompilable<T>

IDiagnosticsProvider

IWeightLoadable<T>

IDisposable

Inherited Members: LayerBase<T>.Engine

LayerBase<T>.ScalarActivation

LayerBase<T>.VectorActivation

LayerBase<T>.UsingVectorActivation

LayerBase<T>.NumOps

LayerBase<T>.Random

LayerBase<T>.Parameters

LayerBase<T>.ParameterGradients

LayerBase<T>.InputShape

LayerBase<T>.InputShapes

LayerBase<T>.UpdateInputShape(int[])

LayerBase<T>.OutputShape

LayerBase<T>.IsTrainingMode

LayerBase<T>.InitializationStrategy

LayerBase<T>.IsInitialized

LayerBase<T>.InitializationLock

LayerBase<T>.EnsureInitialized()

LayerBase<T>.UseAutodiff

LayerBase<T>.SetTrainingMode(bool)

LayerBase<T>.ClearGradients()

LayerBase<T>.GetInputShape()

LayerBase<T>.GetInputShapes()

LayerBase<T>.GetOutputShape()

LayerBase<T>.GetWeights()

LayerBase<T>.GetBiases()

LayerBase<T>.MapActivationToFused()

LayerBase<T>.CanExecuteOnGpu

LayerBase<T>.CanTrainOnGpu

LayerBase<T>.UploadWeightsToGpu()

LayerBase<T>.DownloadWeightsFromGpu()

LayerBase<T>.ZeroGradientsGpu()

LayerBase<T>.GetActivationTypes()

LayerBase<T>.Forward(params Tensor<T>[])

LayerBase<T>.ApplyActivation(Tensor<T>)

LayerBase<T>.ApplyActivation(Vector<T>)

LayerBase<T>.ActivateTensor(IActivationFunction<T>, Tensor<T>)

LayerBase<T>.ActivateTensor(IVectorActivationFunction<T>, Tensor<T>)

LayerBase<T>.CalculateInputShape(int, int, int)

LayerBase<T>.CalculateOutputShape(int, int, int)

LayerBase<T>.Clone()

LayerBase<T>.DerivativeTensor(IActivationFunction<T>, Tensor<T>)

LayerBase<T>.ApplyActivationDerivative(T, T)

LayerBase<T>.ApplyActivationDerivative(Tensor<T>, Tensor<T>)

LayerBase<T>.ComputeActivationJacobian(Vector<T>)

LayerBase<T>.ApplyActivationDerivative(Vector<T>, Vector<T>)

LayerBase<T>.UpdateParameters(Vector<T>)

LayerBase<T>.ParameterCount

LayerBase<T>.Serialize(BinaryWriter)

LayerBase<T>.Deserialize(BinaryReader)

LayerBase<T>.GetDiagnostics()

LayerBase<T>.ApplyActivationToGraph(ComputationNode<T>)

LayerBase<T>.CanActivationBeJitted()

LayerBase<T>.RegisterTrainableParameter(Tensor<T>, PersistentTensorRole)

LayerBase<T>.InvalidateTrainableParameter(Tensor<T>)

LayerBase<T>.HasGpuActivation()

LayerBase<T>.ApplyActivationForwardGpu(IDirectGpuBackend, IGpuBuffer, IGpuBuffer, int)

LayerBase<T>.ApplyActivationBackwardGpu(IDirectGpuBackend, IGpuBuffer, IGpuBuffer, IGpuBuffer, IGpuBuffer, int)

LayerBase<T>.GetFusedActivationType()

LayerBase<T>.ApplyGpuActivation(IDirectGpuBackend, IGpuBuffer, IGpuBuffer, int, FusedActivationType)

LayerBase<T>.ApplyGpuActivationBackward(IDirectGpuBackend, IGpuBuffer, IGpuBuffer, IGpuBuffer, IGpuBuffer, int, FusedActivationType, float)

LayerBase<T>.Dispose()

LayerBase<T>.Dispose(bool)

LayerBase<T>.WeightParameterName

LayerBase<T>.BiasParameterName

LayerBase<T>.SetWeights(Tensor<T>)

LayerBase<T>.SetBiases(Tensor<T>)

LayerBase<T>.GetParameterNames()

LayerBase<T>.TryGetParameter(string, out Tensor<T>)

LayerBase<T>.SetParameter(string, Tensor<T>)

LayerBase<T>.GetParameterShape(string)

LayerBase<T>.NamedParameterCount

LayerBase<T>.ValidateWeights(IEnumerable<string>, Func<string, string>)

LayerBase<T>.LoadWeights(Dictionary<string, Tensor<T>>, Func<string, string>, bool)

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Remarks

Spectral normalization is a weight normalization technique that constrains the Lipschitz constant of a neural network layer. It does this by dividing the weight matrix by its largest singular value (spectral norm). This technique is particularly effective for stabilizing GAN training.

For Beginners: Spectral normalization keeps layer weights from getting too large.

Key benefits:

Stabilizes GAN training by preventing extreme weight values
Ensures the discriminator doesn't become too powerful too quickly
Helps prevent mode collapse in GANs
Computationally efficient compared to other normalization methods

How it works:

Computes the largest singular value of the weight matrix
Divides all weights by this value
Keeps weights normalized throughout training

Reference: Miyato et al., "Spectral Normalization for Generative Adversarial Networks" (2018)

Constructors

SpectralNormalizationLayer(ILayer<T>, int)

Initializes a new instance of the SpectralNormalizationLayer<T> class.

public SpectralNormalizationLayer(ILayer<T> innerLayer, int powerIterations = 1)

Parameters

innerLayer ILayer<T>: The layer whose weights will be spectrally normalized.
powerIterations int: The number of power iterations to perform. Default is 1.

Properties

SupportsGpuExecution

Gets a value indicating whether this layer supports GPU execution.

protected override bool SupportsGpuExecution { get; }

Property Value

bool

SupportsGpuTraining

Gets a value indicating whether this layer supports GPU-resident training. Delegates to the inner layer's capability.

public override bool SupportsGpuTraining { get; }

Property Value

bool

SupportsJitCompilation

Gets a value indicating whether this layer supports JIT compilation.

public override bool SupportsJitCompilation { get; }

Property Value

bool

Remarks

JIT compilation is supported if the inner layer supports it. At JIT export time, the spectral normalization is applied to create normalized weights, which are then used in the exported computation graph for inference.

SupportsTraining

Gets a value indicating whether this layer supports training.

public override bool SupportsTraining { get; }

Property Value

bool

Methods

Backward(Tensor<T>)

Performs backpropagation through the layer.

public override Tensor<T> Backward(Tensor<T> outputGradient)

Parameters

outputGradient Tensor<T>

Returns

Tensor<T>

Remarks

Backpropagation uses the normalized weights (applied during Forward) to ensure gradients correspond to the actual weights used in the forward pass. After computing gradients, the original weights are restored.

BackwardGpu(IGpuTensor<T>)

GPU-resident backward pass for spectral normalization layer. Delegates to the inner layer's BackwardGpu method.

public override IGpuTensor<T> BackwardGpu(IGpuTensor<T> outputGradient)

Parameters

outputGradient IGpuTensor<T>: The gradient of the loss with respect to the layer's output (GPU tensor).

Returns

IGpuTensor<T>: The gradient of the loss with respect to the layer's input (GPU tensor).

ExportComputationGraph(List<ComputationNode<T>>)

Exports the computation graph for JIT compilation.

public override ComputationNode<T> ExportComputationGraph(List<ComputationNode<T>> inputNodes)

Parameters

inputNodes List<ComputationNode<T>>: List to populate with input computation nodes.

Returns

ComputationNode<T>: The output computation node representing the spectrally normalized layer.

Remarks

For JIT compilation, spectral normalization is applied at export time to produce normalized weights. These normalized weights are then used in the inner layer's computation graph. This approach is suitable for inference, where the weights are fixed after training.

Note: The exported computation graph uses a snapshot of the normalized weights at the time of export. If the underlying weights change, the graph must be re-exported to reflect those changes.

Forward(Tensor<T>)

Performs the forward pass through the layer with spectrally normalized weights.

public override Tensor<T> Forward(Tensor<T> input)

Parameters

input Tensor<T>

Returns

Tensor<T>

ForwardGpu(params IGpuTensor<T>[])

Performs the forward pass using GPU-resident tensors with GPU-accelerated spectral normalization.

public override IGpuTensor<T> ForwardGpu(params IGpuTensor<T>[] inputs)

Parameters

inputs IGpuTensor<T>[]

Returns

IGpuTensor<T>: A GPU-resident output tensor.

Remarks

This method performs spectral normalization using GPU-accelerated power iteration, keeping all computations on GPU for maximum performance.

GetParameterGradients()

Gets the parameter gradients from the inner layer.

public override Vector<T> GetParameterGradients()

Returns

Vector<T>

GetParameters()

Gets the parameters of the inner layer.

public override Vector<T> GetParameters()

Returns

Vector<T>

ResetState()

Resets the internal state of the layer.

public override void ResetState()

SetParameters(Vector<T>)

Sets the parameters of the inner layer.

public override void SetParameters(Vector<T> parameters)

Parameters

parameters Vector<T>

UpdateParameters(T)

Updates the parameters of the inner layer.

public override void UpdateParameters(T learningRate)

Parameters

learningRate T

UpdateParametersGpu(IGpuOptimizerConfig)

GPU-resident parameter update using the provided optimizer configuration. Delegates to the inner layer's UpdateParametersGpu method.

public override void UpdateParametersGpu(IGpuOptimizerConfig config)

Parameters

config IGpuOptimizerConfig: GPU optimizer configuration specifying the optimizer type and hyperparameters.

Table of Contents

Class SpectralNormalizationLayer<T>

Type Parameters

Remarks

Constructors

SpectralNormalizationLayer(ILayer<T>, int)

Parameters

Properties

SupportsGpuExecution

Property Value

SupportsGpuTraining

Property Value

SupportsJitCompilation

Property Value

Remarks

SupportsTraining

Property Value

Methods

Backward(Tensor<T>)

Parameters

Returns

Remarks

BackwardGpu(IGpuTensor<T>)

Parameters

Returns

ExportComputationGraph(List<ComputationNode<T>>)

Parameters

Returns

Remarks

Forward(Tensor<T>)

Parameters

Returns

ForwardGpu(params IGpuTensor<T>[])

Parameters

Returns

Remarks

GetParameterGradients()

Returns

GetParameters()

Returns

ResetState()

SetParameters(Vector<T>)

Parameters

UpdateParameters(T)

Parameters

UpdateParametersGpu(IGpuOptimizerConfig)

Parameters