Class NeuralNetwork<T>

Namespace: AiDotNet.NeuralNetworks

Assembly: AiDotNet.dll

A neural network implementation that processes data through multiple layers to make predictions.

public class NeuralNetwork<T> : NeuralNetworkBase<T>, INeuralNetworkModel<T>, INeuralNetwork<T>, IFullModel<T, Tensor<T>, Tensor<T>>, IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>, IModelSerializer, ICheckpointableModel, IParameterizable<T, Tensor<T>, Tensor<T>>, IFeatureAware, IFeatureImportance<T>, ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>, IGradientComputable<T, Tensor<T>, Tensor<T>>, IJitCompilable<T>, IInterpretableModel<T>, IInputGradientComputable<T>, IDisposable

Type Parameters

T: The numeric type used for calculations (e.g., float, double)

Inheritance: object

NeuralNetworkBase<T>

NeuralNetwork<T>

Implements: INeuralNetworkModel<T>

INeuralNetwork<T>

IFullModel<T, Tensor<T>, Tensor<T>>

IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>

IModelSerializer

ICheckpointableModel

IParameterizable<T, Tensor<T>, Tensor<T>>

IFeatureAware

IFeatureImportance<T>

ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>

IGradientComputable<T, Tensor<T>, Tensor<T>>

IJitCompilable<T>

IInterpretableModel<T>

IInputGradientComputable<T>

IDisposable

Derived: BayesianNeuralNetwork<T>

MCDropoutNeuralNetwork<T>

Inherited Members: NeuralNetworkBase<T>.Layers

NeuralNetworkBase<T>.LayerCount

NeuralNetworkBase<T>.Architecture

NeuralNetworkBase<T>.NumOps

NeuralNetworkBase<T>.Engine

NeuralNetworkBase<T>._layerInputs

NeuralNetworkBase<T>._layerOutputs

NeuralNetworkBase<T>.Random

NeuralNetworkBase<T>.LossFunction

NeuralNetworkBase<T>.LastLoss

NeuralNetworkBase<T>.IsTrainingMode

NeuralNetworkBase<T>.SupportsGpuTraining

NeuralNetworkBase<T>.CanTrainOnGpu

NeuralNetworkBase<T>.GpuEngine

NeuralNetworkBase<T>.MaxGradNorm

NeuralNetworkBase<T>._mixedPrecisionContext

NeuralNetworkBase<T>._memoryManager

NeuralNetworkBase<T>.IsMemoryManagementEnabled

NeuralNetworkBase<T>.IsGradientCheckpointingEnabled

NeuralNetworkBase<T>.IsMixedPrecisionEnabled

NeuralNetworkBase<T>.ClipGradients(List<Tensor<T>>)

NeuralNetworkBase<T>.ClipGradient(Tensor<T>)

NeuralNetworkBase<T>.ClipGradient(Vector<T>)

NeuralNetworkBase<T>.GetParameters()

NeuralNetworkBase<T>.Backpropagate(Tensor<T>)

NeuralNetworkBase<T>.BackpropagateWithRecompute(Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpuDeferred(IGpuTensor<T>, GpuExecutionOptions)

NeuralNetworkBase<T>.UpdateParametersGpu(T, T, T)

NeuralNetworkBase<T>.UpdateParametersGpu(IGpuOptimizerConfig)

NeuralNetworkBase<T>.UpdateParametersGpuDeferred(IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferred(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferredAsync(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions, CancellationToken)

NeuralNetworkBase<T>.UploadWeightsToGpu()

NeuralNetworkBase<T>.DownloadWeightsFromGpu()

NeuralNetworkBase<T>.ZeroGradientsGpu()

NeuralNetworkBase<T>.ExtractSingleExample(Tensor<T>, int)

NeuralNetworkBase<T>.ForwardWithMemory(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithCheckpointing(Tensor<T>)

NeuralNetworkBase<T>.CanUseGpuResidentPath()

NeuralNetworkBase<T>.TryForwardGpuOptimized(Tensor<T>, out Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferred(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferredAsync(Tensor<T>, CancellationToken)

NeuralNetworkBase<T>.BeginGpuExecution(GpuExecutionOptions)

NeuralNetworkBase<T>.ForwardWithGpuContext(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithGpuContext(IGpuTensor<T>)

NeuralNetworkBase<T>.GetGpuMemoryStats()

NeuralNetworkBase<T>.ForwardWithFeatures(Tensor<T>, int[])

NeuralNetworkBase<T>.ParameterCount

NeuralNetworkBase<T>.GetParameterCount()

NeuralNetworkBase<T>.InvalidateParameterCountCache()

NeuralNetworkBase<T>.AddLayerToCollection(ILayer<T>)

NeuralNetworkBase<T>.RemoveLayerFromCollection(ILayer<T>)

NeuralNetworkBase<T>.ClearLayers()

NeuralNetworkBase<T>.ValidateCustomLayers(List<ILayer<T>>)

NeuralNetworkBase<T>.ValidateCustomLayersInternal(List<ILayer<T>>)

NeuralNetworkBase<T>.IsValidInputLayer(ILayer<T>)

NeuralNetworkBase<T>.IsValidOutputLayer(ILayer<T>)

NeuralNetworkBase<T>.AreLayersCompatible(ILayer<T>, ILayer<T>)

NeuralNetworkBase<T>.GetParameterGradients()

NeuralNetworkBase<T>.EnsureArchitectureInitialized()

NeuralNetworkBase<T>.SetTrainingMode(bool)

NeuralNetworkBase<T>.EnableMemoryManagement(TrainingMemoryConfig)

NeuralNetworkBase<T>.DisableMemoryManagement()

NeuralNetworkBase<T>.GetMemoryEstimate(int, int)

NeuralNetworkBase<T>.GetLastLoss()

NeuralNetworkBase<T>.ResetState()

NeuralNetworkBase<T>.BackwardWithInputGradient(Tensor<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Vector<T>, Vector<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.SaveModel(string)

NeuralNetworkBase<T>.LoadModel(string)

NeuralNetworkBase<T>.Serialize()

NeuralNetworkBase<T>.Deserialize(byte[])

NeuralNetworkBase<T>.WithParameters(Vector<T>)

NeuralNetworkBase<T>.GetActiveFeatureIndices()

NeuralNetworkBase<T>.IsFeatureUsed(int)

NeuralNetworkBase<T>.DeepCopy()

NeuralNetworkBase<T>.Clone()

NeuralNetworkBase<T>.SetActiveFeatureIndices(IEnumerable<int>)

NeuralNetworkBase<T>._enabledMethods

NeuralNetworkBase<T>._sensitiveFeatures

NeuralNetworkBase<T>._fairnessMetrics

NeuralNetworkBase<T>._baseModel

NeuralNetworkBase<T>.GetGlobalFeatureImportanceAsync()

NeuralNetworkBase<T>.GetLocalFeatureImportanceAsync(Tensor<T>)

NeuralNetworkBase<T>.GetShapValuesAsync(Tensor<T>)

NeuralNetworkBase<T>.GetLimeExplanationAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetPartialDependenceAsync(Vector<int>, int)

NeuralNetworkBase<T>.GetCounterfactualAsync(Tensor<T>, Tensor<T>, int)

NeuralNetworkBase<T>.GetModelSpecificInterpretabilityAsync()

NeuralNetworkBase<T>.GenerateTextExplanationAsync(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.GetFeatureInteractionAsync(int, int)

NeuralNetworkBase<T>.ValidateFairnessAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetAnchorExplanationAsync(Tensor<T>, T)

NeuralNetworkBase<T>.SetBaseModel<TInput, TOutput>(IFullModel<T, TInput, TOutput>)

NeuralNetworkBase<T>.EnableMethod(params InterpretationMethod[])

NeuralNetworkBase<T>.ConfigureFairness(Vector<int>, params FairnessMetric[])

NeuralNetworkBase<T>.GetNamedLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.GetArchitecture()

NeuralNetworkBase<T>.GetFeatureImportance()

NeuralNetworkBase<T>.SetParameters(Vector<T>)

NeuralNetworkBase<T>.AddLayer(LayerType, int, ActivationFunction)

NeuralNetworkBase<T>.AddConvolutionalLayer(int, int, int, ActivationFunction)

NeuralNetworkBase<T>.AddLSTMLayer(int, bool)

NeuralNetworkBase<T>.AddDropoutLayer(double)

NeuralNetworkBase<T>.AddBatchNormalizationLayer(int, double, double)

NeuralNetworkBase<T>.AddPoolingLayer(int[], PoolingType, int, int?)

NeuralNetworkBase<T>.GetGradients()

NeuralNetworkBase<T>.GetInputShape()

NeuralNetworkBase<T>.GetLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.DefaultLossFunction

NeuralNetworkBase<T>.ComputeGradients(Tensor<T>, Tensor<T>, ILossFunction<T>)

NeuralNetworkBase<T>.ApplyGradients(Vector<T>, T)

NeuralNetworkBase<T>.SaveState(Stream)

NeuralNetworkBase<T>.LoadState(Stream)

NeuralNetworkBase<T>.Dispose()

NeuralNetworkBase<T>.Dispose(bool)

NeuralNetworkBase<T>.SupportsJitCompilation

NeuralNetworkBase<T>.ExportComputationGraph(List<ComputationNode<T>>)

NeuralNetworkBase<T>.ConvertLayerToGraph(ILayer<T>, ComputationNode<T>)

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Extension Methods: DistributedExtensions.AsDistributedForHighBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributedForLowBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, IShardingConfiguration<T>)

Remarks

Neural networks are computing systems inspired by the human brain. They consist of multiple layers of interconnected nodes (neurons) that process input data to produce predictions. This class provides a straightforward implementation that can be used for various machine learning tasks.

For Beginners: A neural network is like a brain-inspired system that learns from examples.

Think of a neural network as an assembly line for information:

Input data enters the "factory" (like features of an image or text)
It passes through several processing stations (layers of neurons)
Each station transforms the information in specific ways
Finally, it produces an output (like a prediction or classification)

For example, if you want to classify images of cats and dogs:

The input would be the pixel values of the image
The neural network processes these values through its layers
Each layer learns to recognize different patterns (edges, shapes, textures, etc.)
The output tells you the probability of the image containing a cat or dog

The network "learns" by adjusting its internal parameters based on examples, gradually improving its predictions through a process called training.

Constructors

NeuralNetwork(NeuralNetworkArchitecture<T>, ILossFunction<T>?)

Creates a new neural network with the specified architecture.

public NeuralNetwork(NeuralNetworkArchitecture<T> architecture, ILossFunction<T>? lossFunction = null)

Parameters

architecture NeuralNetworkArchitecture<T>: The architecture defining the structure and configuration of the neural network
lossFunction ILossFunction<T>

Remarks

The architecture determines important aspects of the neural network such as: - The number and types of layers - The number of neurons in each layer - The activation functions used - Other configuration parameters

After creating the neural network, it automatically initializes the layers based on the provided architecture.

For Beginners: This creates a new neural network with your desired structure.

When creating a neural network, you need to define its "architecture" - the blueprint that specifies:

How many inputs it will accept (like the number of features in your data)
How many layers it has (more layers can learn more complex patterns)
How many neurons are in each layer (more neurons can capture more information)
What activation functions to use (these add non-linearity, allowing the network to learn complex patterns)

Think of it like designing a building - you're establishing the foundation and framework before you start "training" it (like furnishing the rooms).

For example, a simple network for classifying handwritten digits might have:

784 inputs (for a 28x28 pixel image)
2 hidden layers with 128 neurons each
10 outputs (one for each digit 0-9)

Properties

SupportsTraining

Indicates whether this network supports training (learning from data).

public override bool SupportsTraining { get; }

Property Value

bool

Remarks

A neural network is considered trainable when at least one layer supports training.

Methods

CreateNewInstance()

Creates a new instance of the neural network with the same architecture.

protected override IFullModel<T, Tensor<T>, Tensor<T>> CreateNewInstance()

Returns

IFullModel<T, Tensor<T>, Tensor<T>>: A new instance of the neural network.

Remarks

This method creates a new neural network with the same architecture as the current instance. The new instance is initialized with fresh layers and parameters, making it useful for creating multiple networks with the same structure or for resetting a network while preserving its architecture.

For Beginners: This creates a brand new neural network with the same structure.

This is useful when you want to:

Start over with a fresh network but keep the same structure
Create multiple networks with identical layouts
Reset a network to its initial state

The new network will have:

The same number of layers and neurons
The same activation functions
Newly initialized weights and biases

Think of it like creating a twin of your neural network, but with a "blank slate" - it has the same structure but hasn't learned anything yet.

DeserializeNetworkSpecificData(BinaryReader)

Deserializes neural network-specific data from a binary reader.

protected override void DeserializeNetworkSpecificData(BinaryReader reader)

Parameters

reader BinaryReader: The binary reader to read from.

Remarks

This method loads any neural network-specific data from the binary stream. In this implementation, there is no additional data beyond what the base class deserializes, but this method could be extended for specialized neural network types.

For Beginners: This method loads neural network-specific information.

When loading a neural network from a file:

The base class already loads the basic structure and weights
This method loads any additional information specific to this type of network

For a standard neural network, there's typically no additional information needed beyond what the base class already loads.

GetModelMetadata()

Gets metadata about the neural network.

public override ModelMetadata<T> GetModelMetadata()

Returns

ModelMetadata<T>: A ModelMetaData object containing information about the neural network.

Remarks

This method returns comprehensive metadata about the neural network, including its architecture, layer configuration, and other relevant parameters. This information is useful for model management, tracking experiments, and reporting.

For Beginners: This provides detailed information about your neural network.

The metadata includes:

The type of neural network
Details about its structure (layers, neurons, etc.)
The total number of parameters (weights and biases)
Additional configuration information

This information is useful for:

Documentation
Comparing different network architectures
Debugging and analyzing network behavior
Creating reports or visualizations

InitializeLayers()

Initializes the layers of the neural network based on the architecture.

protected override void InitializeLayers()

Remarks

This method sets up the neural network's structure by either: 1. Using custom layers provided in the architecture, or 2. Creating default layers if none were specified

The layers determine how data flows through the network and how computations are performed.

For Beginners: This method sets up the building blocks of your neural network.

Think of this as assembling the components of your network:

If you've specified exactly what layers you want, those are used
If not, standard layers are created based on your architecture settings

Layers are the key processing units in a neural network. Common types include:

Input Layer: Receives your data
Hidden Layers: Process the information, extracting patterns
Output Layer: Produces the final prediction

Each layer contains neurons that apply mathematical operations and activation functions to transform the data as it flows through the network.

Predict(Tensor<T>)

Makes a prediction using the neural network.

public override Tensor<T> Predict(Tensor<T> input)

Parameters

input Tensor<T>: The input tensor to process.

Returns

Tensor<T>: The output tensor after processing through all layers.

Remarks

This method performs a forward pass through all layers of the neural network without updating any internal states. It's used for making predictions on new data after the network has been trained.

For Beginners: This method takes your input data and gives you the network's prediction.

When making a prediction:

Your input data (like an image or set of features) enters the network
It passes through each layer, being transformed along the way
Each neuron applies its weights, bias, and activation function
The final layer produces the output (like a classification or regression value)

This is the main method you'll use when applying your trained network to new data. For example, if you've trained a network to recognize handwritten digits, you would use this method to classify new digit images.

SerializeNetworkSpecificData(BinaryWriter)

Serializes neural network-specific data to a binary writer.

protected override void SerializeNetworkSpecificData(BinaryWriter writer)

Parameters

writer BinaryWriter: The binary writer to write to.

Remarks

This method saves any neural network-specific data to the binary stream. In this implementation, there is no additional data beyond what the base class serializes, but this method could be extended for specialized neural network types.

For Beginners: This method saves neural network-specific information.

When saving a neural network to a file:

The base class already saves the basic structure and weights
This method saves any additional information specific to this type of network

For a standard neural network, there's typically no additional information needed beyond what the base class already saves.

Train(Tensor<T>, Tensor<T>)

Trains the neural network on input-output pairs.

public override void Train(Tensor<T> input, Tensor<T> expectedOutput)

Parameters

input Tensor<T>: The input tensor for training.
expectedOutput Tensor<T>: The expected output tensor.

Remarks

This method performs one step of training on a single input-output pair or batch. It computes the forward pass, calculates the error, and backpropagates to update the network's parameters. For full training, this method should be called repeatedly with different inputs from the training dataset.

For Beginners: This method teaches the network to make better predictions.

The training process works like this:

Input data is fed into the network
The network makes a prediction (forward pass)
The prediction is compared to the expected output to calculate error
The error is propagated backward through the network (backpropagation)
The network's parameters are adjusted to reduce the error

Think of it like learning from mistakes:

The network makes a guess
It sees how far off it was
It adjusts its approach to do better next time

This method performs one iteration of this process. To fully train a network, you'd typically call this method many times with different examples from your training data.

UpdateParameters(Vector<T>)

Updates the parameters (weights and biases) of the neural network.

public override void UpdateParameters(Vector<T> parameters)

Parameters

parameters Vector<T>: A vector containing all parameters for the entire network

Remarks

This method distributes the provided parameters to each layer of the neural network. It's typically used during training when an optimization algorithm has calculated new parameter values to improve the network's performance.

The parameters vector must contain values for all trainable parameters in the network, arranged in the same order as the layers.

For Beginners: This method updates the internal values that the network has learned.

Neural networks learn by adjusting their "parameters" (weights and biases):

Weights determine how strongly neurons are connected to each other
Biases allow neurons to activate more or less easily

During training, the network figures out what parameters work best by:

Making predictions on training examples
Comparing predictions to correct answers
Calculating how to change parameters to improve accuracy
Using this method to update those parameters

Think of it like adjusting the settings on a complex machine to improve its performance. This method takes a long list of new parameter values and distributes them to the right places throughout the network.

Table of Contents

Class NeuralNetwork<T>

Type Parameters

Remarks

Constructors

NeuralNetwork(NeuralNetworkArchitecture<T>, ILossFunction<T>?)

Parameters

Remarks

Properties

SupportsTraining

Property Value

Remarks

Methods

CreateNewInstance()

Returns

Remarks

DeserializeNetworkSpecificData(BinaryReader)

Parameters

Remarks

GetModelMetadata()

Returns

Remarks

InitializeLayers()

Remarks

Predict(Tensor<T>)

Parameters

Returns

Remarks

SerializeNetworkSpecificData(BinaryWriter)

Parameters

Remarks

Train(Tensor<T>, Tensor<T>)

Parameters

Remarks

UpdateParameters(Vector<T>)

Parameters

Remarks