Class PhysicsInformedNeuralNetwork<T>

Namespace: AiDotNet.PhysicsInformed.PINNs

Assembly: AiDotNet.dll

Represents a Physics-Informed Neural Network (PINN) for solving PDEs.

public class PhysicsInformedNeuralNetwork<T> : NeuralNetworkBase<T>, INeuralNetworkModel<T>, INeuralNetwork<T>, IFullModel<T, Tensor<T>, Tensor<T>>, IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>, IModelSerializer, ICheckpointableModel, IParameterizable<T, Tensor<T>, Tensor<T>>, IFeatureAware, IFeatureImportance<T>, ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>, IGradientComputable<T, Tensor<T>, Tensor<T>>, IJitCompilable<T>, IInterpretableModel<T>, IInputGradientComputable<T>, IDisposable

Type Parameters

T: The numeric type used for calculations.

Inheritance: object

NeuralNetworkBase<T>

PhysicsInformedNeuralNetwork<T>

Implements: INeuralNetworkModel<T>

INeuralNetwork<T>

IFullModel<T, Tensor<T>, Tensor<T>>

IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>

IModelSerializer

ICheckpointableModel

IParameterizable<T, Tensor<T>, Tensor<T>>

IFeatureAware

IFeatureImportance<T>

ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>

IGradientComputable<T, Tensor<T>, Tensor<T>>

IJitCompilable<T>

IInterpretableModel<T>

IInputGradientComputable<T>

IDisposable

Derived: DomainDecompositionPINN<T>

MultiFidelityPINN<T>

Inherited Members: NeuralNetworkBase<T>.Layers

NeuralNetworkBase<T>.LayerCount

NeuralNetworkBase<T>.Architecture

NeuralNetworkBase<T>.NumOps

NeuralNetworkBase<T>.Engine

NeuralNetworkBase<T>._layerInputs

NeuralNetworkBase<T>._layerOutputs

NeuralNetworkBase<T>.Random

NeuralNetworkBase<T>.LossFunction

NeuralNetworkBase<T>.LastLoss

NeuralNetworkBase<T>.IsTrainingMode

NeuralNetworkBase<T>.SupportsGpuTraining

NeuralNetworkBase<T>.CanTrainOnGpu

NeuralNetworkBase<T>.GpuEngine

NeuralNetworkBase<T>.MaxGradNorm

NeuralNetworkBase<T>._mixedPrecisionContext

NeuralNetworkBase<T>._memoryManager

NeuralNetworkBase<T>.IsMemoryManagementEnabled

NeuralNetworkBase<T>.IsGradientCheckpointingEnabled

NeuralNetworkBase<T>.IsMixedPrecisionEnabled

NeuralNetworkBase<T>.ClipGradients(List<Tensor<T>>)

NeuralNetworkBase<T>.ClipGradient(Tensor<T>)

NeuralNetworkBase<T>.ClipGradient(Vector<T>)

NeuralNetworkBase<T>.GetParameters()

NeuralNetworkBase<T>.Backpropagate(Tensor<T>)

NeuralNetworkBase<T>.BackpropagateWithRecompute(Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpuDeferred(IGpuTensor<T>, GpuExecutionOptions)

NeuralNetworkBase<T>.UpdateParametersGpu(T, T, T)

NeuralNetworkBase<T>.UpdateParametersGpu(IGpuOptimizerConfig)

NeuralNetworkBase<T>.UpdateParametersGpuDeferred(IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferred(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferredAsync(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions, CancellationToken)

NeuralNetworkBase<T>.UploadWeightsToGpu()

NeuralNetworkBase<T>.DownloadWeightsFromGpu()

NeuralNetworkBase<T>.ZeroGradientsGpu()

NeuralNetworkBase<T>.ExtractSingleExample(Tensor<T>, int)

NeuralNetworkBase<T>.ForwardWithMemory(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithCheckpointing(Tensor<T>)

NeuralNetworkBase<T>.CanUseGpuResidentPath()

NeuralNetworkBase<T>.TryForwardGpuOptimized(Tensor<T>, out Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferred(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferredAsync(Tensor<T>, CancellationToken)

NeuralNetworkBase<T>.BeginGpuExecution(GpuExecutionOptions)

NeuralNetworkBase<T>.ForwardWithGpuContext(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithGpuContext(IGpuTensor<T>)

NeuralNetworkBase<T>.GetGpuMemoryStats()

NeuralNetworkBase<T>.ForwardWithFeatures(Tensor<T>, int[])

NeuralNetworkBase<T>.ParameterCount

NeuralNetworkBase<T>.GetParameterCount()

NeuralNetworkBase<T>.InvalidateParameterCountCache()

NeuralNetworkBase<T>.AddLayerToCollection(ILayer<T>)

NeuralNetworkBase<T>.RemoveLayerFromCollection(ILayer<T>)

NeuralNetworkBase<T>.ClearLayers()

NeuralNetworkBase<T>.ValidateCustomLayers(List<ILayer<T>>)

NeuralNetworkBase<T>.ValidateCustomLayersInternal(List<ILayer<T>>)

NeuralNetworkBase<T>.IsValidInputLayer(ILayer<T>)

NeuralNetworkBase<T>.IsValidOutputLayer(ILayer<T>)

NeuralNetworkBase<T>.AreLayersCompatible(ILayer<T>, ILayer<T>)

NeuralNetworkBase<T>.GetParameterGradients()

NeuralNetworkBase<T>.EnsureArchitectureInitialized()

NeuralNetworkBase<T>.SetTrainingMode(bool)

NeuralNetworkBase<T>.EnableMemoryManagement(TrainingMemoryConfig)

NeuralNetworkBase<T>.DisableMemoryManagement()

NeuralNetworkBase<T>.GetMemoryEstimate(int, int)

NeuralNetworkBase<T>.GetLastLoss()

NeuralNetworkBase<T>.ResetState()

NeuralNetworkBase<T>.BackwardWithInputGradient(Tensor<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Vector<T>, Vector<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.SaveModel(string)

NeuralNetworkBase<T>.LoadModel(string)

NeuralNetworkBase<T>.Serialize()

NeuralNetworkBase<T>.Deserialize(byte[])

NeuralNetworkBase<T>.WithParameters(Vector<T>)

NeuralNetworkBase<T>.GetActiveFeatureIndices()

NeuralNetworkBase<T>.IsFeatureUsed(int)

NeuralNetworkBase<T>.DeepCopy()

NeuralNetworkBase<T>.Clone()

NeuralNetworkBase<T>.SetActiveFeatureIndices(IEnumerable<int>)

NeuralNetworkBase<T>._enabledMethods

NeuralNetworkBase<T>._sensitiveFeatures

NeuralNetworkBase<T>._fairnessMetrics

NeuralNetworkBase<T>._baseModel

NeuralNetworkBase<T>.GetGlobalFeatureImportanceAsync()

NeuralNetworkBase<T>.GetLocalFeatureImportanceAsync(Tensor<T>)

NeuralNetworkBase<T>.GetShapValuesAsync(Tensor<T>)

NeuralNetworkBase<T>.GetLimeExplanationAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetPartialDependenceAsync(Vector<int>, int)

NeuralNetworkBase<T>.GetCounterfactualAsync(Tensor<T>, Tensor<T>, int)

NeuralNetworkBase<T>.GetModelSpecificInterpretabilityAsync()

NeuralNetworkBase<T>.GenerateTextExplanationAsync(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.GetFeatureInteractionAsync(int, int)

NeuralNetworkBase<T>.ValidateFairnessAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetAnchorExplanationAsync(Tensor<T>, T)

NeuralNetworkBase<T>.SetBaseModel<TInput, TOutput>(IFullModel<T, TInput, TOutput>)

NeuralNetworkBase<T>.EnableMethod(params InterpretationMethod[])

NeuralNetworkBase<T>.ConfigureFairness(Vector<int>, params FairnessMetric[])

NeuralNetworkBase<T>.GetNamedLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.GetArchitecture()

NeuralNetworkBase<T>.GetFeatureImportance()

NeuralNetworkBase<T>.SetParameters(Vector<T>)

NeuralNetworkBase<T>.AddLayer(LayerType, int, ActivationFunction)

NeuralNetworkBase<T>.AddConvolutionalLayer(int, int, int, ActivationFunction)

NeuralNetworkBase<T>.AddLSTMLayer(int, bool)

NeuralNetworkBase<T>.AddDropoutLayer(double)

NeuralNetworkBase<T>.AddBatchNormalizationLayer(int, double, double)

NeuralNetworkBase<T>.AddPoolingLayer(int[], PoolingType, int, int?)

NeuralNetworkBase<T>.GetGradients()

NeuralNetworkBase<T>.GetInputShape()

NeuralNetworkBase<T>.GetLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.DefaultLossFunction

NeuralNetworkBase<T>.ComputeGradients(Tensor<T>, Tensor<T>, ILossFunction<T>)

NeuralNetworkBase<T>.ApplyGradients(Vector<T>, T)

NeuralNetworkBase<T>.SaveState(Stream)

NeuralNetworkBase<T>.LoadState(Stream)

NeuralNetworkBase<T>.Dispose()

NeuralNetworkBase<T>.Dispose(bool)

NeuralNetworkBase<T>.SupportsJitCompilation

NeuralNetworkBase<T>.ExportComputationGraph(List<ComputationNode<T>>)

NeuralNetworkBase<T>.ConvertLayerToGraph(ILayer<T>, ComputationNode<T>)

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Extension Methods: DistributedExtensions.AsDistributedForHighBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributedForLowBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, IShardingConfiguration<T>)

Remarks

For Beginners: A Physics-Informed Neural Network (PINN) is a neural network that learns to solve Partial Differential Equations (PDEs) by incorporating physical laws directly into the training process.

Traditional Approach (Finite Elements/Differences):

Discretize the domain into a grid
Approximate derivatives using neighboring points
Solve a large system of equations
Works well but can be slow for complex geometries

PINN Approach:

Neural network represents the solution u(x,t)
Use automatic differentiation to compute ∂u/∂x, ∂²u/∂x², etc.
Train the network to minimize:
- PDE residual (how much the PDE is violated)
- Boundary condition errors
- Initial condition errors
- Data fitting errors (if measurements are available)

Key Advantages:

Meshless: No need to discretize the domain
Data-efficient: Can work with sparse or noisy data
Flexible: Easy to handle complex geometries and boundary conditions
Interpolation: Get solution at any point by evaluating the network
Inverse problems: Can discover unknown parameters in the PDE

Key Challenges:

Training can be difficult (multiple objectives to balance)
May require careful tuning of loss weights
Network architecture affects accuracy
Computational cost during training (many derivative evaluations)

Applications:

Fluid dynamics (Navier-Stokes equations)
Heat transfer
Structural mechanics
Quantum mechanics
Financial modeling (Black-Scholes PDE)
Climate and weather modeling

Historical Context: PINNs were introduced by Raissi, Perdikaris, and Karniadakis in 2019. They've revolutionized scientific machine learning by showing that deep learning can be guided by physics rather than just data.

Constructors

PhysicsInformedNeuralNetwork(NeuralNetworkArchitecture<T>, IPDESpecification<T>, IBoundaryCondition<T>[], IInitialCondition<T>?, int, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, double?, double?, double?, double?)

Initializes a new instance of the PINN class.

public PhysicsInformedNeuralNetwork(NeuralNetworkArchitecture<T> architecture, IPDESpecification<T> pdeSpecification, IBoundaryCondition<T>[] boundaryConditions, IInitialCondition<T>? initialCondition = null, int numCollocationPoints = 10000, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>? optimizer = null, double? dataWeight = null, double? pdeWeight = null, double? boundaryWeight = null, double? initialWeight = null)

Parameters

architecture NeuralNetworkArchitecture<T>: The neural network architecture (typically a deep feedforward network).
pdeSpecification IPDESpecification<T>: The PDE that the solution must satisfy.
boundaryConditions IBoundaryCondition<T>[]: Boundary conditions for the problem.
initialCondition IInitialCondition<T>: Initial condition for time-dependent problems (optional).
numCollocationPoints int: Number of points in the domain where to enforce the PDE.
optimizer IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>: Optimization algorithm (Adam is recommended for PINNs).
dataWeight double?: Weight for data loss component.
pdeWeight double?: Weight for PDE residual loss (often needs tuning).
boundaryWeight double?: Weight for boundary condition loss.
initialWeight double?: Weight for initial condition loss.

Remarks

For Beginners: When creating a PINN, you need to specify:

Network architecture: Usually a deep network (5-10 hidden layers, 20-50 neurons each)
- Activation: tanh or sin often work well for smooth solutions
- Input: spatial coordinates (x, y, z) and possibly time (t)
- Output: solution values u(x,t)
PDE specification: Defines the physics (e.g., Heat Equation, Navier-Stokes)
Boundary conditions: What happens at the edges of your domain
Collocation points: Where to enforce the PDE
- More points = better accuracy but slower training
- Typically 10,000-100,000 points
- Can use random sampling or quasi-random (Sobol, Latin hypercube)
Loss weights: Balance between different objectives
- Start with all weights = 1.0
- If PDE residual is large, increase pdeWeight
- If boundary conditions are violated, increase boundaryWeight
- This is often the trickiest part of PINN training!

Fields

_pdeSpecification

The PDE specification that defines the physics constraints. Protected to allow derived classes (e.g., MultiFidelityPINN) to evaluate residuals on custom solutions.

protected readonly IPDESpecification<T> _pdeSpecification

Field Value

IPDESpecification<T>

Properties

SupportsTraining

Indicates whether this PINN supports training.

public override bool SupportsTraining { get; }

Property Value

bool

Methods

CreateNewInstance()

Creates a new instance with the same configuration.

protected override IFullModel<T, Tensor<T>, Tensor<T>> CreateNewInstance()

Returns

IFullModel<T, Tensor<T>, Tensor<T>>: New PINN instance.

DeserializeNetworkSpecificData(BinaryReader)

Deserializes PINN-specific data.

protected override void DeserializeNetworkSpecificData(BinaryReader reader)

Parameters

reader BinaryReader: Binary reader.

EvaluatePDEResidual(T[])

Evaluates the PDE residual at a point (for validation).

public T EvaluatePDEResidual(T[] point)

Parameters

point T[]: The point coordinates.

Returns

T: The PDE residual (should be close to zero for a good solution).

Forward(Tensor<T>)

Performs a forward pass through the network.

public Tensor<T> Forward(Tensor<T> input)

Parameters

input Tensor<T>: Input tensor for evaluation.

Returns

Tensor<T>: Network output tensor.

GetModelMetadata()

Gets metadata about the PINN model.

public override ModelMetadata<T> GetModelMetadata()

Returns

ModelMetadata<T>: Model metadata.

GetSolution(T[])

Gets the solution at a specific point in the domain.

public T[] GetSolution(T[] point)

Parameters

point T[]: The point coordinates (x, y, t, etc.).

Returns

T[]: The solution value(s) at that point.

InitializeLayers()

Initializes the neural network layers.

protected override void InitializeLayers()

Predict(Tensor<T>)

Makes a prediction using the PINN.

public override Tensor<T> Predict(Tensor<T> input)

Parameters

input Tensor<T>: Input tensor.

Returns

Tensor<T>: Predicted output tensor.

SerializeNetworkSpecificData(BinaryWriter)

Serializes PINN-specific data.

protected override void SerializeNetworkSpecificData(BinaryWriter writer)

Parameters

writer BinaryWriter: Binary writer.

SetCollocationPoints(T[,])

Sets custom collocation points (for advanced users who want specific sampling).

public void SetCollocationPoints(T[,] points)

Parameters

points T[,]: Collocation points [numPoints, inputDim].

Solve(T[,]?, T[,]?, int, double, bool, int)

Solves the PDE by training the PINN using automatic differentiation.

public TrainingHistory<T> Solve(T[,]? dataInputs = null, T[,]? dataOutputs = null, int epochs = 10000, double learningRate = 0.001, bool verbose = true, int batchSize = 256)

Parameters

dataInputs T[,]: Optional measured input data.
dataOutputs T[,]: Optional measured output data.
epochs int: Number of training epochs.
learningRate double: Learning rate for optimization.
verbose bool: Whether to print progress.
batchSize int: Number of points per batch.

Returns

TrainingHistory<T>: Training history (losses over epochs).

Remarks

For Beginners: Training a PINN is like training a regular neural network, but with a special loss function.

Training Process:

Sample collocation points
For each point: a) Evaluate network: u = NN(x) b) Compute derivatives using automatic differentiation: ∂u/∂x, ∂²u/∂x², etc. c) Evaluate PDE residual: PDE(u, ∂u/∂x, ...)
Evaluate boundary and initial conditions
Compute total loss
Backpropagate and update network weights
Repeat

This implementation uses GradientTape-based automatic differentiation for computing spatial derivatives (∂u/∂x), which is more accurate than finite differences.

Tips for Success:

Start with simpler PDEs (heat, Poisson) before trying complex ones
Monitor individual loss components (data, PDE, BC, IC)
If one component dominates, adjust the weights
Learning rate scheduling can help
Sometimes training is unstable - try different architectures or optimizers

SumDerivatives(PDEDerivatives<T>, PDEDerivatives<T>)

Sums two sets of PDE derivatives element-wise. Used by derived classes (e.g., MultiFidelityPINN) to compute derivatives of combined solutions.

protected PDEDerivatives<T> SumDerivatives(PDEDerivatives<T> a, PDEDerivatives<T> b)

Parameters

a PDEDerivatives<T>: First set of derivatives.
b PDEDerivatives<T>: Second set of derivatives.

Returns

PDEDerivatives<T>: Combined derivatives where each element is the sum of corresponding elements.

Train(Tensor<T>, Tensor<T>)

Performs a basic supervised training step using MSE loss.

public override void Train(Tensor<T> input, Tensor<T> expectedOutput)

Parameters

input Tensor<T>: Training input tensor.
expectedOutput Tensor<T>: Expected output tensor.

UpdateParameters(Vector<T>)

Updates the network parameters from a flattened vector.

public override void UpdateParameters(Vector<T> parameters)

Parameters

parameters Vector<T>: Parameter vector.

Table of Contents

Class PhysicsInformedNeuralNetwork<T>

Type Parameters

Remarks

Constructors

PhysicsInformedNeuralNetwork(NeuralNetworkArchitecture<T>, IPDESpecification<T>, IBoundaryCondition<T>[], IInitialCondition<T>?, int, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, double?, double?, double?, double?)

Parameters

Remarks

Fields

_pdeSpecification

Field Value

Properties

SupportsTraining

Property Value

Methods

CreateNewInstance()

Returns

DeserializeNetworkSpecificData(BinaryReader)

Parameters

EvaluatePDEResidual(T[])

Parameters

Returns

Forward(Tensor<T>)

Parameters

Returns

GetModelMetadata()

Returns

GetSolution(T[])

Parameters

Returns

InitializeLayers()

Predict(Tensor<T>)

Parameters

Returns

SerializeNetworkSpecificData(BinaryWriter)

Parameters

SetCollocationPoints(T[,])

Parameters

Solve(T[,]?, T[,]?, int, double, bool, int)

Parameters

Returns

Remarks

SumDerivatives(PDEDerivatives<T>, PDEDerivatives<T>)

Parameters

Returns

Train(Tensor<T>, Tensor<T>)

Parameters

UpdateParameters(Vector<T>)

Parameters