Class GraphAttentionNetwork<T>

Namespace: AiDotNet.NeuralNetworks

Assembly: AiDotNet.dll

Represents a Graph Attention Network (GAT) that uses attention mechanisms to process graph-structured data.

public class GraphAttentionNetwork<T> : NeuralNetworkBase<T>, INeuralNetworkModel<T>, INeuralNetwork<T>, IFullModel<T, Tensor<T>, Tensor<T>>, IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>, IModelSerializer, ICheckpointableModel, IParameterizable<T, Tensor<T>, Tensor<T>>, IFeatureAware, IFeatureImportance<T>, ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>, IGradientComputable<T, Tensor<T>, Tensor<T>>, IJitCompilable<T>, IInterpretableModel<T>, IInputGradientComputable<T>, IDisposable

Type Parameters

T: The numeric type used for calculations, typically float or double.

Inheritance: object

NeuralNetworkBase<T>

GraphAttentionNetwork<T>

Implements: INeuralNetworkModel<T>

INeuralNetwork<T>

IFullModel<T, Tensor<T>, Tensor<T>>

IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>

IModelSerializer

ICheckpointableModel

IParameterizable<T, Tensor<T>, Tensor<T>>

IFeatureAware

IFeatureImportance<T>

ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>

IGradientComputable<T, Tensor<T>, Tensor<T>>

IJitCompilable<T>

IInterpretableModel<T>

IInputGradientComputable<T>

IDisposable

Inherited Members: NeuralNetworkBase<T>.Layers

NeuralNetworkBase<T>.LayerCount

NeuralNetworkBase<T>.Architecture

NeuralNetworkBase<T>.NumOps

NeuralNetworkBase<T>.Engine

NeuralNetworkBase<T>._layerInputs

NeuralNetworkBase<T>._layerOutputs

NeuralNetworkBase<T>.Random

NeuralNetworkBase<T>.LossFunction

NeuralNetworkBase<T>.LastLoss

NeuralNetworkBase<T>.IsTrainingMode

NeuralNetworkBase<T>.SupportsTraining

NeuralNetworkBase<T>.SupportsGpuTraining

NeuralNetworkBase<T>.CanTrainOnGpu

NeuralNetworkBase<T>.GpuEngine

NeuralNetworkBase<T>.MaxGradNorm

NeuralNetworkBase<T>._mixedPrecisionContext

NeuralNetworkBase<T>._memoryManager

NeuralNetworkBase<T>.IsMemoryManagementEnabled

NeuralNetworkBase<T>.IsGradientCheckpointingEnabled

NeuralNetworkBase<T>.IsMixedPrecisionEnabled

NeuralNetworkBase<T>.ClipGradients(List<Tensor<T>>)

NeuralNetworkBase<T>.ClipGradient(Tensor<T>)

NeuralNetworkBase<T>.ClipGradient(Vector<T>)

NeuralNetworkBase<T>.Backpropagate(Tensor<T>)

NeuralNetworkBase<T>.BackpropagateWithRecompute(Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpuDeferred(IGpuTensor<T>, GpuExecutionOptions)

NeuralNetworkBase<T>.UpdateParametersGpu(T, T, T)

NeuralNetworkBase<T>.UpdateParametersGpu(IGpuOptimizerConfig)

NeuralNetworkBase<T>.UpdateParametersGpuDeferred(IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferred(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferredAsync(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions, CancellationToken)

NeuralNetworkBase<T>.UploadWeightsToGpu()

NeuralNetworkBase<T>.DownloadWeightsFromGpu()

NeuralNetworkBase<T>.ZeroGradientsGpu()

NeuralNetworkBase<T>.ExtractSingleExample(Tensor<T>, int)

NeuralNetworkBase<T>.ForwardWithMemory(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithCheckpointing(Tensor<T>)

NeuralNetworkBase<T>.CanUseGpuResidentPath()

NeuralNetworkBase<T>.TryForwardGpuOptimized(Tensor<T>, out Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferred(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferredAsync(Tensor<T>, CancellationToken)

NeuralNetworkBase<T>.BeginGpuExecution(GpuExecutionOptions)

NeuralNetworkBase<T>.ForwardWithGpuContext(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithGpuContext(IGpuTensor<T>)

NeuralNetworkBase<T>.GetGpuMemoryStats()

NeuralNetworkBase<T>.ForwardWithFeatures(Tensor<T>, int[])

NeuralNetworkBase<T>.ParameterCount

NeuralNetworkBase<T>.InvalidateParameterCountCache()

NeuralNetworkBase<T>.AddLayerToCollection(ILayer<T>)

NeuralNetworkBase<T>.RemoveLayerFromCollection(ILayer<T>)

NeuralNetworkBase<T>.ClearLayers()

NeuralNetworkBase<T>.ValidateCustomLayers(List<ILayer<T>>)

NeuralNetworkBase<T>.ValidateCustomLayersInternal(List<ILayer<T>>)

NeuralNetworkBase<T>.IsValidInputLayer(ILayer<T>)

NeuralNetworkBase<T>.IsValidOutputLayer(ILayer<T>)

NeuralNetworkBase<T>.AreLayersCompatible(ILayer<T>, ILayer<T>)

NeuralNetworkBase<T>.GetParameterGradients()

NeuralNetworkBase<T>.EnsureArchitectureInitialized()

NeuralNetworkBase<T>.SetTrainingMode(bool)

NeuralNetworkBase<T>.EnableMemoryManagement(TrainingMemoryConfig)

NeuralNetworkBase<T>.DisableMemoryManagement()

NeuralNetworkBase<T>.GetMemoryEstimate(int, int)

NeuralNetworkBase<T>.GetLastLoss()

NeuralNetworkBase<T>.ResetState()

NeuralNetworkBase<T>.BackwardWithInputGradient(Tensor<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Vector<T>, Vector<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.SaveModel(string)

NeuralNetworkBase<T>.LoadModel(string)

NeuralNetworkBase<T>.Serialize()

NeuralNetworkBase<T>.Deserialize(byte[])

NeuralNetworkBase<T>.WithParameters(Vector<T>)

NeuralNetworkBase<T>.GetActiveFeatureIndices()

NeuralNetworkBase<T>.IsFeatureUsed(int)

NeuralNetworkBase<T>.DeepCopy()

NeuralNetworkBase<T>.Clone()

NeuralNetworkBase<T>.SetActiveFeatureIndices(IEnumerable<int>)

NeuralNetworkBase<T>._enabledMethods

NeuralNetworkBase<T>._sensitiveFeatures

NeuralNetworkBase<T>._fairnessMetrics

NeuralNetworkBase<T>._baseModel

NeuralNetworkBase<T>.GetGlobalFeatureImportanceAsync()

NeuralNetworkBase<T>.GetLocalFeatureImportanceAsync(Tensor<T>)

NeuralNetworkBase<T>.GetShapValuesAsync(Tensor<T>)

NeuralNetworkBase<T>.GetLimeExplanationAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetPartialDependenceAsync(Vector<int>, int)

NeuralNetworkBase<T>.GetCounterfactualAsync(Tensor<T>, Tensor<T>, int)

NeuralNetworkBase<T>.GetModelSpecificInterpretabilityAsync()

NeuralNetworkBase<T>.GenerateTextExplanationAsync(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.GetFeatureInteractionAsync(int, int)

NeuralNetworkBase<T>.ValidateFairnessAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetAnchorExplanationAsync(Tensor<T>, T)

NeuralNetworkBase<T>.SetBaseModel<TInput, TOutput>(IFullModel<T, TInput, TOutput>)

NeuralNetworkBase<T>.EnableMethod(params InterpretationMethod[])

NeuralNetworkBase<T>.ConfigureFairness(Vector<int>, params FairnessMetric[])

NeuralNetworkBase<T>.GetNamedLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.GetArchitecture()

NeuralNetworkBase<T>.GetFeatureImportance()

NeuralNetworkBase<T>.SetParameters(Vector<T>)

NeuralNetworkBase<T>.AddLayer(LayerType, int, ActivationFunction)

NeuralNetworkBase<T>.AddConvolutionalLayer(int, int, int, ActivationFunction)

NeuralNetworkBase<T>.AddLSTMLayer(int, bool)

NeuralNetworkBase<T>.AddDropoutLayer(double)

NeuralNetworkBase<T>.AddBatchNormalizationLayer(int, double, double)

NeuralNetworkBase<T>.AddPoolingLayer(int[], PoolingType, int, int?)

NeuralNetworkBase<T>.GetGradients()

NeuralNetworkBase<T>.GetInputShape()

NeuralNetworkBase<T>.GetLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.DefaultLossFunction

NeuralNetworkBase<T>.ComputeGradients(Tensor<T>, Tensor<T>, ILossFunction<T>)

NeuralNetworkBase<T>.ApplyGradients(Vector<T>, T)

NeuralNetworkBase<T>.SaveState(Stream)

NeuralNetworkBase<T>.LoadState(Stream)

NeuralNetworkBase<T>.Dispose()

NeuralNetworkBase<T>.Dispose(bool)

NeuralNetworkBase<T>.SupportsJitCompilation

NeuralNetworkBase<T>.ExportComputationGraph(List<ComputationNode<T>>)

NeuralNetworkBase<T>.ConvertLayerToGraph(ILayer<T>, ComputationNode<T>)

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Extension Methods: DistributedExtensions.AsDistributedForHighBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributedForLowBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, IShardingConfiguration<T>)

Remarks

Graph Attention Networks introduce attention mechanisms to graph neural networks, allowing the model to learn which neighbors are most important for each node. Unlike GCN which treats all neighbors equally, GAT learns attention weights that determine how much each neighbor contributes to a node's representation.

For Beginners: GAT is like having a smart filter for your social network.

How it works:

Each node looks at its neighbors and decides which ones are most important
Important neighbors get more "attention" (higher weights)
Less relevant neighbors get less attention

Example - Movie Recommendations:

You're a node connected to movies you've watched
Some movies better represent your taste than others
GAT learns to pay more attention to movies that define your preferences
Result: Better recommendations by focusing on what matters most

Key Features:

Multi-head attention: Multiple attention "perspectives" for robustness
Dynamic weights: Attention weights are learned, not fixed
Dropout support: Prevents overfitting during training
Configurable heads: Adjust number of attention heads for your task

Architecture: The standard GAT architecture consists of:

Multiple GAT layers with attention mechanisms
Optional dropout between layers
Final classification or regression head

When to use GAT:

When some neighbors are more informative than others
When you need interpretable importance scores
For heterogeneous graphs where relationships vary in importance
Citation networks, social networks, knowledge graphs

Constructors

GraphAttentionNetwork(NeuralNetworkArchitecture<T>, int, int, double, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, ILossFunction<T>?, double)

Initializes a new instance of the GraphAttentionNetwork<T> class with specified architecture.

public GraphAttentionNetwork(NeuralNetworkArchitecture<T> architecture, int numHeads = 8, int numLayers = 2, double dropoutRate = 0.6, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>? optimizer = null, ILossFunction<T>? lossFunction = null, double maxGradNorm = 1)

Parameters

architecture NeuralNetworkArchitecture<T>: The neural network architecture defining the structure of the network.
numHeads int: Number of attention heads per layer (default: 8). Used only when creating default layers.
numLayers int: Number of GAT layers (default: 2). Used only when creating default layers.
dropoutRate double: Dropout rate for attention coefficients (default: 0.6).
optimizer IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>: Optional optimizer for training.
lossFunction ILossFunction<T>: Optional loss function for training.
maxGradNorm double: Maximum gradient norm for clipping (default: 1.0).

Remarks

For Beginners: Creating a GAT network:

// Create architecture for node classification
var architecture = new NeuralNetworkArchitecture<double>(
    InputType.OneDimensional,
    NeuralNetworkTaskType.MultiClassClassification,
    NetworkComplexity.Simple,
    inputSize: 1433,    // Cora has 1433 word features
    outputSize: 7);     // 7 paper categories

// Create GAT with default layers
var gat = new GraphAttentionNetwork<double>(architecture);

// Or create with custom layers by adding them to architecture
var gatCustom = new GraphAttentionNetwork<double>(architectureWithCustomLayers);

// Train on graph data
gat.TrainOnGraph(nodeFeatures, adjacencyMatrix, labels, epochs: 200);

Properties

DropoutRate

Gets the dropout rate applied to attention coefficients during training.

public double DropoutRate { get; }

Property Value

double

HiddenDim

Gets the hidden dimension size for each layer.

public int HiddenDim { get; }

Property Value

int

IsLoRAEnabled

Gets whether LoRA fine-tuning is currently enabled.

public bool IsLoRAEnabled { get; }

Property Value

bool

LoRARank

Gets the LoRA rank when LoRA is enabled.

public int LoRARank { get; }

Property Value

int

NumHeads

Gets the number of attention heads used in each GAT layer.

public int NumHeads { get; }

Property Value

int

NumLayers

Gets the number of GAT layers in the network.

public int NumLayers { get; }

Property Value

int

Methods

Backward(Tensor<T>)

Performs a backward pass through the network to calculate gradients.

public Tensor<T> Backward(Tensor<T> outputGradient)

Parameters

outputGradient Tensor<T>: The gradient of the loss with respect to the network's output.

Returns

Tensor<T>: The gradient of the loss with respect to the network's input.

CreateNewInstance()

Creates a new instance of this network type for cloning or deserialization.

protected override IFullModel<T, Tensor<T>, Tensor<T>> CreateNewInstance()

Returns

IFullModel<T, Tensor<T>, Tensor<T>>: A new GraphAttentionNetwork instance.

DeserializeNetworkSpecificData(BinaryReader)

Deserializes network-specific data from a binary reader.

protected override void DeserializeNetworkSpecificData(BinaryReader reader)

Parameters

reader BinaryReader: The binary reader to deserialize from.

DisableLoRA()

Disables LoRA fine-tuning and restores original layers.

public void DisableLoRA()

Remarks

This removes the LoRA adapters and restores the original base layers. Any LoRA adaptations that were not merged will be lost.

EnableLoRAFineTuning(int, double, bool)

Enables LoRA (Low-Rank Adaptation) fine-tuning for parameter-efficient training.

public void EnableLoRAFineTuning(int rank = 8, double alpha = -1, bool freezeBaseLayers = true)

Parameters

rank int: The rank of the LoRA decomposition (default: 8).
alpha double: The LoRA scaling factor (default: same as rank).
freezeBaseLayers bool: Whether to freeze base layer parameters (default: true).

Remarks

For Beginners: LoRA allows you to fine-tune the GAT network with far fewer trainable parameters:

// Create and pre-train a GAT network
var gat = new GraphAttentionNetwork<double>(128, 64, 7, numHeads: 8);
gat.TrainOnGraph(features, adjacency, labels, epochs: 200);

// Enable LoRA for efficient fine-tuning on new task
gat.EnableLoRAFineTuning(rank: 8, alpha: 16);

// Now only ~4% of parameters are trainable!
Console.WriteLine($"LoRA parameters: {gat.GetLoRAParameterCount()}");
Console.WriteLine($"Total parameters: {gat.GetParameterCount()}");

// Fine-tune on new data
gat.TrainOnGraph(newFeatures, newAdjacency, newLabels, epochs: 50);

// Optionally merge LoRA weights for deployment
gat.MergeLoRAWeights();

Evaluate(Tensor<T>, Tensor<T>, Tensor<T>, bool[])

Evaluates the model on test data and returns accuracy.

public double Evaluate(Tensor<T> nodeFeatures, Tensor<T> adjacencyMatrix, Tensor<T> labels, bool[] testMask)

Parameters

nodeFeatures Tensor<T>: Node feature tensor.
adjacencyMatrix Tensor<T>: Adjacency matrix.
labels Tensor<T>: Ground truth labels.
testMask bool[]: Boolean mask for test nodes.

Returns

double: Classification accuracy on test nodes.

Forward(Tensor<T>, Tensor<T>)

Performs a forward pass through the network with node features and adjacency matrix.

public Tensor<T> Forward(Tensor<T> nodeFeatures, Tensor<T> adjacencyMatrix)

Parameters

nodeFeatures Tensor<T>: Node feature tensor of shape [batchSize, numNodes, inputFeatures] or [numNodes, inputFeatures].
adjacencyMatrix Tensor<T>: Adjacency matrix of shape [batchSize, numNodes, numNodes] or [numNodes, numNodes].

Returns

Tensor<T>: The output tensor after processing through all layers.

GetAttentionWeights()

Gets attention weights from all GAT layers for interpretability.

public List<Tensor<T>?> GetAttentionWeights()

Returns

List<Tensor<T>>: List of attention weight tensors (currently returns nulls as implementation is pending).

Remarks

Note: This method is a placeholder. Full attention coefficient retrieval requires exposing internal state from GraphAttentionLayer, which will be added in a future update.

GetLoRAParameterCount()

Gets the number of trainable LoRA parameters when LoRA is enabled.

public int GetLoRAParameterCount()

Returns

int: The count of LoRA parameters, or 0 if LoRA is not enabled.

GetLoRATrainablePercentage()

Gets the percentage of parameters that are trainable when using LoRA.

public double GetLoRATrainablePercentage()

Returns

double: The percentage of trainable parameters (0-100).

GetModelMetadata()

Gets metadata about this model for serialization and identification.

public override ModelMetadata<T> GetModelMetadata()

Returns

ModelMetadata<T>: Model metadata including type and configuration.

GetParameterCount()

Gets the total number of trainable parameters in the network.

public int GetParameterCount()

Returns

int

GetParameters()

Gets all parameters as a vector.

public override Vector<T> GetParameters()

Returns

Vector<T>

InitializeLayers()

Initializes the layers of the neural network based on the provided architecture.

protected override void InitializeLayers()

MergeLoRAWeights()

Merges LoRA weights into the base layers and disables LoRA mode.

public void MergeLoRAWeights()

Remarks

For Beginners: After fine-tuning with LoRA, you can "bake in" the learned adaptations to create a standard network for deployment:

Before merge: Forward pass requires computing both base and LoRA outputs
After merge: Single forward pass through merged layers (faster)

This is useful when deploying the fine-tuned model to production where you want maximum inference speed and don't need to track LoRA parameters separately.

Predict(Tensor<T>)

Makes a prediction using the trained network.

public override Tensor<T> Predict(Tensor<T> input)

Parameters

input Tensor<T>: The input tensor containing node features.

Returns

Tensor<T>: The prediction tensor.

Remarks

For Beginners: This is the main method for using a trained GAT network. Pass in node features and get predictions back. For classification, the output will be class probabilities for each node. If no adjacency matrix has been set, a fully-connected adjacency matrix is generated for convenience. Note that this treats every node as connected to every other node, which can mask real graph structure; call SetAdjacencyMatrix(Tensor<T>) to supply the true graph.

SerializeNetworkSpecificData(BinaryWriter)

Serializes network-specific data to a binary writer.

protected override void SerializeNetworkSpecificData(BinaryWriter writer)

Parameters

writer BinaryWriter: The binary writer to serialize to.

SetAdjacencyMatrix(Tensor<T>)

Sets the adjacency matrix for graph operations.

public void SetAdjacencyMatrix(Tensor<T> adjacencyMatrix)

Parameters

adjacencyMatrix Tensor<T>: The adjacency matrix defining graph structure (shape [numNodes, numNodes]).

Train(Tensor<T>, Tensor<T>)

Trains the network on a single batch of data.

public override void Train(Tensor<T> input, Tensor<T> expectedOutput)

Parameters

input Tensor<T>: The input node features.
expectedOutput Tensor<T>: The expected output (labels).

Remarks

For Beginners: This method performs one training step. For full training, call TrainOnGraph which handles multiple epochs and adjacency matrix setup. If no adjacency matrix has been set, a fully-connected adjacency matrix is generated for convenience. This means every node is treated as connected to every other node, which can hide the true graph structure unless you provide an explicit adjacency matrix.

TrainOnGraph(Tensor<T>, Tensor<T>, Tensor<T>, bool[]?, int, double)

Trains the GAT network on graph-structured data.

public void TrainOnGraph(Tensor<T> nodeFeatures, Tensor<T> adjacencyMatrix, Tensor<T> labels, bool[]? trainMask = null, int epochs = 200, double learningRate = 0.005)

Parameters

nodeFeatures Tensor<T>: Node feature tensor of shape [numNodes, inputFeatures].
adjacencyMatrix Tensor<T>: Adjacency matrix of shape [numNodes, numNodes].
labels Tensor<T>: Label tensor for supervised learning.
trainMask bool[]: Optional boolean mask indicating which nodes to train on.
epochs int: Number of training epochs (default: 200).
learningRate double: Learning rate for optimization (default: 0.005).

UpdateParameters(Vector<T>)

Updates the parameters of all layers in the network.

public override void UpdateParameters(Vector<T> parameters)

Parameters

parameters Vector<T>: A vector containing all parameters for the network.

Table of Contents

Class GraphAttentionNetwork<T>

Type Parameters

Remarks

Constructors

GraphAttentionNetwork(NeuralNetworkArchitecture<T>, int, int, double, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, ILossFunction<T>?, double)

Parameters

Remarks

Properties

DropoutRate

Property Value

HiddenDim

Property Value

IsLoRAEnabled

Property Value

LoRARank

Property Value

NumHeads

Property Value

NumLayers

Property Value

Methods

Backward(Tensor<T>)

Parameters

Returns

CreateNewInstance()

Returns

DeserializeNetworkSpecificData(BinaryReader)

Parameters

DisableLoRA()

Remarks

EnableLoRAFineTuning(int, double, bool)

Parameters

Remarks

Evaluate(Tensor<T>, Tensor<T>, Tensor<T>, bool[])

Parameters

Returns

Forward(Tensor<T>, Tensor<T>)

Parameters

Returns

GetAttentionWeights()

Returns

Remarks

GetLoRAParameterCount()

Returns

GetLoRATrainablePercentage()

Returns

GetModelMetadata()

Returns

GetParameterCount()

Returns

GetParameters()

Returns

InitializeLayers()

MergeLoRAWeights()

Remarks

Predict(Tensor<T>)

Parameters

Returns

Remarks

SerializeNetworkSpecificData(BinaryWriter)

Parameters

SetAdjacencyMatrix(Tensor<T>)

Parameters

Train(Tensor<T>, Tensor<T>)

Parameters

Remarks

TrainOnGraph(Tensor<T>, Tensor<T>, Tensor<T>, bool[]?, int, double)

Parameters

UpdateParameters(Vector<T>)

Parameters