Class CycleGAN<T>

Namespace: AiDotNet.NeuralNetworks

Assembly: AiDotNet.dll

Represents a CycleGAN for unpaired image-to-image translation.

public class CycleGAN<T> : NeuralNetworkBase<T>, INeuralNetworkModel<T>, INeuralNetwork<T>, IFullModel<T, Tensor<T>, Tensor<T>>, IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>, IModelSerializer, ICheckpointableModel, IParameterizable<T, Tensor<T>, Tensor<T>>, IFeatureAware, IFeatureImportance<T>, ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>, IGradientComputable<T, Tensor<T>, Tensor<T>>, IJitCompilable<T>, IInterpretableModel<T>, IInputGradientComputable<T>, IDisposable

Type Parameters

T: The numeric type.

Inheritance: object

NeuralNetworkBase<T>

CycleGAN<T>

Implements: INeuralNetworkModel<T>

INeuralNetwork<T>

IFullModel<T, Tensor<T>, Tensor<T>>

IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>

IModelSerializer

ICheckpointableModel

IParameterizable<T, Tensor<T>, Tensor<T>>

IFeatureAware

IFeatureImportance<T>

ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>

IGradientComputable<T, Tensor<T>, Tensor<T>>

IJitCompilable<T>

IInterpretableModel<T>

IInputGradientComputable<T>

IDisposable

Inherited Members: NeuralNetworkBase<T>.Layers

NeuralNetworkBase<T>.LayerCount

NeuralNetworkBase<T>.Architecture

NeuralNetworkBase<T>.NumOps

NeuralNetworkBase<T>.Engine

NeuralNetworkBase<T>._layerInputs

NeuralNetworkBase<T>._layerOutputs

NeuralNetworkBase<T>.Random

NeuralNetworkBase<T>.LossFunction

NeuralNetworkBase<T>.LastLoss

NeuralNetworkBase<T>.IsTrainingMode

NeuralNetworkBase<T>.SupportsTraining

NeuralNetworkBase<T>.SupportsGpuTraining

NeuralNetworkBase<T>.CanTrainOnGpu

NeuralNetworkBase<T>.GpuEngine

NeuralNetworkBase<T>.MaxGradNorm

NeuralNetworkBase<T>._mixedPrecisionContext

NeuralNetworkBase<T>._memoryManager

NeuralNetworkBase<T>.IsMemoryManagementEnabled

NeuralNetworkBase<T>.IsGradientCheckpointingEnabled

NeuralNetworkBase<T>.IsMixedPrecisionEnabled

NeuralNetworkBase<T>.ClipGradients(List<Tensor<T>>)

NeuralNetworkBase<T>.ClipGradient(Tensor<T>)

NeuralNetworkBase<T>.ClipGradient(Vector<T>)

NeuralNetworkBase<T>.GetParameters()

NeuralNetworkBase<T>.Backpropagate(Tensor<T>)

NeuralNetworkBase<T>.BackpropagateWithRecompute(Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpuDeferred(IGpuTensor<T>, GpuExecutionOptions)

NeuralNetworkBase<T>.UpdateParametersGpu(T, T, T)

NeuralNetworkBase<T>.UpdateParametersGpu(IGpuOptimizerConfig)

NeuralNetworkBase<T>.UpdateParametersGpuDeferred(IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferred(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferredAsync(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions, CancellationToken)

NeuralNetworkBase<T>.UploadWeightsToGpu()

NeuralNetworkBase<T>.DownloadWeightsFromGpu()

NeuralNetworkBase<T>.ZeroGradientsGpu()

NeuralNetworkBase<T>.ExtractSingleExample(Tensor<T>, int)

NeuralNetworkBase<T>.ForwardWithMemory(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithCheckpointing(Tensor<T>)

NeuralNetworkBase<T>.CanUseGpuResidentPath()

NeuralNetworkBase<T>.TryForwardGpuOptimized(Tensor<T>, out Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferred(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferredAsync(Tensor<T>, CancellationToken)

NeuralNetworkBase<T>.BeginGpuExecution(GpuExecutionOptions)

NeuralNetworkBase<T>.ForwardWithGpuContext(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithGpuContext(IGpuTensor<T>)

NeuralNetworkBase<T>.GetGpuMemoryStats()

NeuralNetworkBase<T>.ForwardWithFeatures(Tensor<T>, int[])

NeuralNetworkBase<T>.ParameterCount

NeuralNetworkBase<T>.GetParameterCount()

NeuralNetworkBase<T>.InvalidateParameterCountCache()

NeuralNetworkBase<T>.AddLayerToCollection(ILayer<T>)

NeuralNetworkBase<T>.RemoveLayerFromCollection(ILayer<T>)

NeuralNetworkBase<T>.ClearLayers()

NeuralNetworkBase<T>.ValidateCustomLayers(List<ILayer<T>>)

NeuralNetworkBase<T>.ValidateCustomLayersInternal(List<ILayer<T>>)

NeuralNetworkBase<T>.IsValidInputLayer(ILayer<T>)

NeuralNetworkBase<T>.IsValidOutputLayer(ILayer<T>)

NeuralNetworkBase<T>.AreLayersCompatible(ILayer<T>, ILayer<T>)

NeuralNetworkBase<T>.GetParameterGradients()

NeuralNetworkBase<T>.EnsureArchitectureInitialized()

NeuralNetworkBase<T>.SetTrainingMode(bool)

NeuralNetworkBase<T>.EnableMemoryManagement(TrainingMemoryConfig)

NeuralNetworkBase<T>.DisableMemoryManagement()

NeuralNetworkBase<T>.GetMemoryEstimate(int, int)

NeuralNetworkBase<T>.GetLastLoss()

NeuralNetworkBase<T>.ResetState()

NeuralNetworkBase<T>.BackwardWithInputGradient(Tensor<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Vector<T>, Vector<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.SaveModel(string)

NeuralNetworkBase<T>.LoadModel(string)

NeuralNetworkBase<T>.Serialize()

NeuralNetworkBase<T>.Deserialize(byte[])

NeuralNetworkBase<T>.WithParameters(Vector<T>)

NeuralNetworkBase<T>.GetActiveFeatureIndices()

NeuralNetworkBase<T>.IsFeatureUsed(int)

NeuralNetworkBase<T>.DeepCopy()

NeuralNetworkBase<T>.Clone()

NeuralNetworkBase<T>.SetActiveFeatureIndices(IEnumerable<int>)

NeuralNetworkBase<T>._enabledMethods

NeuralNetworkBase<T>._sensitiveFeatures

NeuralNetworkBase<T>._fairnessMetrics

NeuralNetworkBase<T>._baseModel

NeuralNetworkBase<T>.GetGlobalFeatureImportanceAsync()

NeuralNetworkBase<T>.GetLocalFeatureImportanceAsync(Tensor<T>)

NeuralNetworkBase<T>.GetShapValuesAsync(Tensor<T>)

NeuralNetworkBase<T>.GetLimeExplanationAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetPartialDependenceAsync(Vector<int>, int)

NeuralNetworkBase<T>.GetCounterfactualAsync(Tensor<T>, Tensor<T>, int)

NeuralNetworkBase<T>.GetModelSpecificInterpretabilityAsync()

NeuralNetworkBase<T>.GenerateTextExplanationAsync(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.GetFeatureInteractionAsync(int, int)

NeuralNetworkBase<T>.ValidateFairnessAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetAnchorExplanationAsync(Tensor<T>, T)

NeuralNetworkBase<T>.SetBaseModel<TInput, TOutput>(IFullModel<T, TInput, TOutput>)

NeuralNetworkBase<T>.EnableMethod(params InterpretationMethod[])

NeuralNetworkBase<T>.ConfigureFairness(Vector<int>, params FairnessMetric[])

NeuralNetworkBase<T>.GetNamedLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.GetArchitecture()

NeuralNetworkBase<T>.GetFeatureImportance()

NeuralNetworkBase<T>.SetParameters(Vector<T>)

NeuralNetworkBase<T>.AddLayer(LayerType, int, ActivationFunction)

NeuralNetworkBase<T>.AddConvolutionalLayer(int, int, int, ActivationFunction)

NeuralNetworkBase<T>.AddLSTMLayer(int, bool)

NeuralNetworkBase<T>.AddDropoutLayer(double)

NeuralNetworkBase<T>.AddBatchNormalizationLayer(int, double, double)

NeuralNetworkBase<T>.AddPoolingLayer(int[], PoolingType, int, int?)

NeuralNetworkBase<T>.GetGradients()

NeuralNetworkBase<T>.GetInputShape()

NeuralNetworkBase<T>.GetLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.DefaultLossFunction

NeuralNetworkBase<T>.ComputeGradients(Tensor<T>, Tensor<T>, ILossFunction<T>)

NeuralNetworkBase<T>.ApplyGradients(Vector<T>, T)

NeuralNetworkBase<T>.SaveState(Stream)

NeuralNetworkBase<T>.LoadState(Stream)

NeuralNetworkBase<T>.Dispose()

NeuralNetworkBase<T>.Dispose(bool)

NeuralNetworkBase<T>.SupportsJitCompilation

NeuralNetworkBase<T>.ExportComputationGraph(List<ComputationNode<T>>)

NeuralNetworkBase<T>.ConvertLayerToGraph(ILayer<T>, ComputationNode<T>)

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Extension Methods: DistributedExtensions.AsDistributedForHighBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributedForLowBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, IShardingConfiguration<T>)

Remarks

CycleGAN enables image-to-image translation without paired training data: - Uses two generators (A→B and B→A) and two discriminators - Enforces cycle consistency: A→B→A should equal A - Works without paired examples (e.g., can learn horses→zebras from separate collections) - Uses adversarial loss + cycle consistency loss + identity loss

For Beginners: CycleGAN translates images without matched pairs.

Key innovation:

Doesn't need paired training data
Learns from two separate collections of images
Example: Photos of horses + Photos of zebras → can convert horses to zebras

How it works:

Two generators: G (A→B) and F (B→A)
Two discriminators: D_A and D_B
Cycle consistency: G(F(B)) ≈ B and F(G(A)) ≈ A
This prevents mode collapse and maintains content

Applications:

Style transfer (Monet → Photo, Photo → Monet)
Season transfer (Summer → Winter)
Object transfiguration (Horse → Zebra)
Domain adaptation

Reference: Zhu et al., "Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks" (2017)

Constructors

CycleGAN(NeuralNetworkArchitecture<T>, NeuralNetworkArchitecture<T>, NeuralNetworkArchitecture<T>, NeuralNetworkArchitecture<T>, InputType, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, ILossFunction<T>?, double, double)

Initializes a new instance of the CycleGAN<T> class with the specified architecture and training parameters.

public CycleGAN(NeuralNetworkArchitecture<T> generatorAtoB, NeuralNetworkArchitecture<T> generatorBtoA, NeuralNetworkArchitecture<T> discriminatorA, NeuralNetworkArchitecture<T> discriminatorB, InputType inputType, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>? generatorAtoBOptimizer = null, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>? generatorBtoAOptimizer = null, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>? discriminatorAOptimizer = null, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>? discriminatorBOptimizer = null, ILossFunction<T>? lossFunction = null, double cycleConsistencyLambda = 10, double identityLambda = 5)

Parameters

generatorAtoB NeuralNetworkArchitecture<T>: The architecture for the generator that transforms images from domain A to domain B.
generatorBtoA NeuralNetworkArchitecture<T>: The architecture for the generator that transforms images from domain B to domain A.
discriminatorA NeuralNetworkArchitecture<T>: The architecture for the discriminator that evaluates images in domain A.
discriminatorB NeuralNetworkArchitecture<T>: The architecture for the discriminator that evaluates images in domain B.
inputType InputType: The type of input data (e.g., ThreeDimensional for images).
generatorAtoBOptimizer IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>: Optional optimizer for the A→B generator. If null, an Adam optimizer with default GAN settings is created.
generatorBtoAOptimizer IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>: Optional optimizer for the B→A generator. If null, an Adam optimizer with default GAN settings is created.
discriminatorAOptimizer IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>: Optional optimizer for discriminator A. If null, an Adam optimizer with default GAN settings is created.
discriminatorBOptimizer IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>: Optional optimizer for discriminator B. If null, an Adam optimizer with default GAN settings is created.
lossFunction ILossFunction<T>: Optional loss function. If null, the default loss function for generative tasks is used.
cycleConsistencyLambda double: The coefficient for cycle consistency loss. Higher values enforce stronger cycle consistency. Default is 10.0.
identityLambda double: The coefficient for identity loss. Helps preserve color composition. Default is 5.0.

Remarks

This constructor creates a CycleGAN with four separate networks and optimizers: - Generator A→B: Transforms images from domain A to domain B - Generator B→A: Transforms images from domain B to domain A - Discriminator A: Evaluates whether images in domain A are real or generated - Discriminator B: Evaluates whether images in domain B are real or generated

For Beginners: CycleGAN needs four networks to work: - Two generators to translate images in both directions - Two discriminators to judge images in each domain

The cycle consistency loss ensures that translating A→B→A gets back to the original, which helps maintain content while only changing style.

Exceptions

ArgumentNullException: Thrown when any of the architecture parameters is null.
ArgumentOutOfRangeException: Thrown when cycleConsistencyLambda or identityLambda is negative.

Properties

DiscriminatorA

Discriminator for domain A.

public NeuralNetworkBase<T> DiscriminatorA { get; }

Property Value

NeuralNetworkBase<T>

DiscriminatorB

Discriminator for domain B.

public NeuralNetworkBase<T> DiscriminatorB { get; }

Property Value

NeuralNetworkBase<T>

GeneratorAtoB

Generator A→B.

public NeuralNetworkBase<T> GeneratorAtoB { get; }

Property Value

NeuralNetworkBase<T>

GeneratorBtoA

Generator B→A.

public NeuralNetworkBase<T> GeneratorBtoA { get; }

Property Value

NeuralNetworkBase<T>

Methods

CreateNewInstance()

Creates a new instance of the CycleGAN with the same configuration.

protected override IFullModel<T, Tensor<T>, Tensor<T>> CreateNewInstance()

Returns

IFullModel<T, Tensor<T>, Tensor<T>>: A new CycleGAN instance with the same architecture and hyperparameters.

Remarks

This method creates a fresh CycleGAN instance with the same network architectures and hyperparameters. The new instance has freshly initialized optimizers.

For Beginners: This method creates a copy of the CycleGAN structure but with new, untrained networks and fresh optimizers.

DeserializeNetworkSpecificData(BinaryReader)

Deserializes CycleGAN-specific data from a binary reader.

protected override void DeserializeNetworkSpecificData(BinaryReader reader)

Parameters

reader BinaryReader: The binary reader to read from.

Remarks

This method deserializes the CycleGAN-specific configuration and all four networks. After deserialization, the optimizers are reset to their initial state.

For Beginners: This method loads the CycleGAN's settings and all four networks (two generators and two discriminators) from a file.

GetModelMetadata()

Gets the metadata for this neural network model.

public override ModelMetadata<T> GetModelMetadata()

Returns

ModelMetadata<T>: A ModelMetaData object containing information about the model.

InitializeLayers()

Initializes the layers of the neural network based on the architecture.

protected override void InitializeLayers()

Remarks

For Beginners: This method sets up all the layers in your neural network according to the architecture you've defined. It's like assembling the parts of your network before you can use it.

Predict(Tensor<T>)

Makes a prediction using the neural network.

public override Tensor<T> Predict(Tensor<T> input)

Parameters

input Tensor<T>: The input data to process.

Returns

Tensor<T>: The network's prediction.

Remarks

For Beginners: This is the main method you'll use to get results from your trained neural network. You provide some input data (like an image or text), and the network processes it through all its layers to produce an output (like a classification or prediction).

ResetOptimizerState()

Resets the state of all optimizers to their initial values.

public void ResetOptimizerState()

Remarks

This method resets all four optimizers (both generators and both discriminators) to their initial state. This is useful when restarting training or when you want to clear accumulated momentum and adaptive learning rate information.

For Beginners: Call this method when you want to start fresh with training, as if the model had never been trained before. The network weights remain unchanged, but the optimizer's memory of past gradients is cleared.

SerializeNetworkSpecificData(BinaryWriter)

Serializes CycleGAN-specific data to a binary writer.

protected override void SerializeNetworkSpecificData(BinaryWriter writer)

Parameters

writer BinaryWriter: The binary writer to write to.

Remarks

This method serializes the CycleGAN-specific configuration and all four networks. Optimizer state is managed by the optimizer implementations themselves.

For Beginners: This method saves the CycleGAN's settings and all four networks (two generators and two discriminators) to a file.

Train(Tensor<T>, Tensor<T>)

Trains the neural network on a single input-output pair.

public override void Train(Tensor<T> input, Tensor<T> expectedOutput)

Parameters

input Tensor<T>: The input data.
expectedOutput Tensor<T>: The expected output for the given input.

Remarks

This method performs one training step on the neural network using the provided input and expected output. It updates the network's parameters to reduce the error between the network's prediction and the expected output.

For Beginners: This is how your neural network learns. You provide: - An input (what the network should process) - The expected output (what the correct answer should be)

The network then:

Makes a prediction based on the input
Compares its prediction to the expected output
Calculates how wrong it was (the loss)
Adjusts its internal values to do better next time

After training, you can get the loss value using the GetLastLoss() method to see how well the network is learning.

TrainStep(Tensor<T>, Tensor<T>)

Performs one training step for CycleGAN.

public (T discLoss, T genLoss, T cycleLoss) TrainStep(Tensor<T> realA, Tensor<T> realB)

Parameters

realA Tensor<T>: Real images from domain A.
realB Tensor<T>: Real images from domain B.

Returns

(T Precision, T Recall, T F1Score): A tuple containing discriminator loss, generator loss, and cycle consistency loss.

Exceptions

ArgumentNullException: Thrown when realA or realB is null.
ArgumentException: Thrown when batch dimensions don't match or batch size is zero.

TranslateAtoB(Tensor<T>)

Translates image from domain A to domain B.

public Tensor<T> TranslateAtoB(Tensor<T> imageA)

Parameters

imageA Tensor<T>

Returns

Tensor<T>

Remarks

This method temporarily sets the generator to evaluation mode for inference, then restores the original training mode after prediction. This ensures batch normalization and dropout behave correctly during both inference and subsequent training steps.

TranslateBtoA(Tensor<T>)

Translates image from domain B to domain A.

public Tensor<T> TranslateBtoA(Tensor<T> imageB)

Parameters

imageB Tensor<T>

Returns

Tensor<T>

Remarks

UpdateParameters(Vector<T>)

Updates the parameters of all networks in the CycleGAN.

public override void UpdateParameters(Vector<T> parameters)

Parameters

parameters Vector<T>: The new parameters vector containing parameters for all networks.

Table of Contents

Class CycleGAN<T>

Type Parameters

Remarks

Constructors

Parameters

Remarks

Exceptions

Properties

DiscriminatorA

Property Value

DiscriminatorB

Property Value

GeneratorAtoB

Property Value

GeneratorBtoA

Property Value

Methods

CreateNewInstance()

Returns

Remarks

DeserializeNetworkSpecificData(BinaryReader)

Parameters

Remarks

GetModelMetadata()

Returns

InitializeLayers()

Remarks

Predict(Tensor<T>)

Parameters

Returns

Remarks

ResetOptimizerState()

Remarks

SerializeNetworkSpecificData(BinaryWriter)

Parameters

Remarks

Train(Tensor<T>, Tensor<T>)

Parameters

Remarks

TrainStep(Tensor<T>, Tensor<T>)

Parameters

Returns

Exceptions

TranslateAtoB(Tensor<T>)

Parameters

Returns

Remarks

TranslateBtoA(Tensor<T>)

Parameters

Returns

Remarks

UpdateParameters(Vector<T>)

Parameters