Class ACGAN<T>

Namespace: AiDotNet.NeuralNetworks

Assembly: AiDotNet.dll

Represents an Auxiliary Classifier Generative Adversarial Network (AC-GAN), which extends conditional GANs by having the discriminator also predict the class label of the input.

public class ACGAN<T> : NeuralNetworkBase<T>, INeuralNetworkModel<T>, INeuralNetwork<T>, IFullModel<T, Tensor<T>, Tensor<T>>, IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>, IModelSerializer, ICheckpointableModel, IParameterizable<T, Tensor<T>, Tensor<T>>, IFeatureAware, IFeatureImportance<T>, ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>, IGradientComputable<T, Tensor<T>, Tensor<T>>, IJitCompilable<T>, IInterpretableModel<T>, IInputGradientComputable<T>, IDisposable

Type Parameters

T: The numeric type used for calculations, typically float or double.

Inheritance: object

NeuralNetworkBase<T>

ACGAN<T>

Implements: INeuralNetworkModel<T>

INeuralNetwork<T>

IFullModel<T, Tensor<T>, Tensor<T>>

IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>

IModelSerializer

ICheckpointableModel

IParameterizable<T, Tensor<T>, Tensor<T>>

IFeatureAware

IFeatureImportance<T>

ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>

IGradientComputable<T, Tensor<T>, Tensor<T>>

IJitCompilable<T>

IInterpretableModel<T>

IInputGradientComputable<T>

IDisposable

Inherited Members: NeuralNetworkBase<T>.Layers

NeuralNetworkBase<T>.LayerCount

NeuralNetworkBase<T>.Architecture

NeuralNetworkBase<T>.NumOps

NeuralNetworkBase<T>.Engine

NeuralNetworkBase<T>._layerInputs

NeuralNetworkBase<T>._layerOutputs

NeuralNetworkBase<T>.Random

NeuralNetworkBase<T>.LossFunction

NeuralNetworkBase<T>.LastLoss

NeuralNetworkBase<T>.IsTrainingMode

NeuralNetworkBase<T>.SupportsTraining

NeuralNetworkBase<T>.SupportsGpuTraining

NeuralNetworkBase<T>.CanTrainOnGpu

NeuralNetworkBase<T>.GpuEngine

NeuralNetworkBase<T>.MaxGradNorm

NeuralNetworkBase<T>._mixedPrecisionContext

NeuralNetworkBase<T>._memoryManager

NeuralNetworkBase<T>.IsMemoryManagementEnabled

NeuralNetworkBase<T>.IsGradientCheckpointingEnabled

NeuralNetworkBase<T>.IsMixedPrecisionEnabled

NeuralNetworkBase<T>.ClipGradients(List<Tensor<T>>)

NeuralNetworkBase<T>.ClipGradient(Tensor<T>)

NeuralNetworkBase<T>.ClipGradient(Vector<T>)

NeuralNetworkBase<T>.GetParameters()

NeuralNetworkBase<T>.Backpropagate(Tensor<T>)

NeuralNetworkBase<T>.BackpropagateWithRecompute(Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpuDeferred(IGpuTensor<T>, GpuExecutionOptions)

NeuralNetworkBase<T>.UpdateParametersGpu(T, T, T)

NeuralNetworkBase<T>.UpdateParametersGpu(IGpuOptimizerConfig)

NeuralNetworkBase<T>.UpdateParametersGpuDeferred(IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferred(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferredAsync(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions, CancellationToken)

NeuralNetworkBase<T>.UploadWeightsToGpu()

NeuralNetworkBase<T>.DownloadWeightsFromGpu()

NeuralNetworkBase<T>.ZeroGradientsGpu()

NeuralNetworkBase<T>.ExtractSingleExample(Tensor<T>, int)

NeuralNetworkBase<T>.ForwardWithMemory(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithCheckpointing(Tensor<T>)

NeuralNetworkBase<T>.CanUseGpuResidentPath()

NeuralNetworkBase<T>.TryForwardGpuOptimized(Tensor<T>, out Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferred(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferredAsync(Tensor<T>, CancellationToken)

NeuralNetworkBase<T>.BeginGpuExecution(GpuExecutionOptions)

NeuralNetworkBase<T>.ForwardWithGpuContext(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithGpuContext(IGpuTensor<T>)

NeuralNetworkBase<T>.GetGpuMemoryStats()

NeuralNetworkBase<T>.ForwardWithFeatures(Tensor<T>, int[])

NeuralNetworkBase<T>.GetParameterCount()

NeuralNetworkBase<T>.InvalidateParameterCountCache()

NeuralNetworkBase<T>.AddLayerToCollection(ILayer<T>)

NeuralNetworkBase<T>.RemoveLayerFromCollection(ILayer<T>)

NeuralNetworkBase<T>.ClearLayers()

NeuralNetworkBase<T>.ValidateCustomLayers(List<ILayer<T>>)

NeuralNetworkBase<T>.ValidateCustomLayersInternal(List<ILayer<T>>)

NeuralNetworkBase<T>.IsValidInputLayer(ILayer<T>)

NeuralNetworkBase<T>.IsValidOutputLayer(ILayer<T>)

NeuralNetworkBase<T>.AreLayersCompatible(ILayer<T>, ILayer<T>)

NeuralNetworkBase<T>.GetParameterGradients()

NeuralNetworkBase<T>.EnsureArchitectureInitialized()

NeuralNetworkBase<T>.SetTrainingMode(bool)

NeuralNetworkBase<T>.EnableMemoryManagement(TrainingMemoryConfig)

NeuralNetworkBase<T>.DisableMemoryManagement()

NeuralNetworkBase<T>.GetMemoryEstimate(int, int)

NeuralNetworkBase<T>.GetLastLoss()

NeuralNetworkBase<T>.ResetState()

NeuralNetworkBase<T>.BackwardWithInputGradient(Tensor<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Vector<T>, Vector<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.SaveModel(string)

NeuralNetworkBase<T>.LoadModel(string)

NeuralNetworkBase<T>.Serialize()

NeuralNetworkBase<T>.Deserialize(byte[])

NeuralNetworkBase<T>.WithParameters(Vector<T>)

NeuralNetworkBase<T>.GetActiveFeatureIndices()

NeuralNetworkBase<T>.IsFeatureUsed(int)

NeuralNetworkBase<T>.DeepCopy()

NeuralNetworkBase<T>.Clone()

NeuralNetworkBase<T>.SetActiveFeatureIndices(IEnumerable<int>)

NeuralNetworkBase<T>._enabledMethods

NeuralNetworkBase<T>._sensitiveFeatures

NeuralNetworkBase<T>._fairnessMetrics

NeuralNetworkBase<T>._baseModel

NeuralNetworkBase<T>.GetGlobalFeatureImportanceAsync()

NeuralNetworkBase<T>.GetLocalFeatureImportanceAsync(Tensor<T>)

NeuralNetworkBase<T>.GetShapValuesAsync(Tensor<T>)

NeuralNetworkBase<T>.GetLimeExplanationAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetPartialDependenceAsync(Vector<int>, int)

NeuralNetworkBase<T>.GetCounterfactualAsync(Tensor<T>, Tensor<T>, int)

NeuralNetworkBase<T>.GetModelSpecificInterpretabilityAsync()

NeuralNetworkBase<T>.GenerateTextExplanationAsync(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.GetFeatureInteractionAsync(int, int)

NeuralNetworkBase<T>.ValidateFairnessAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetAnchorExplanationAsync(Tensor<T>, T)

NeuralNetworkBase<T>.SetBaseModel<TInput, TOutput>(IFullModel<T, TInput, TOutput>)

NeuralNetworkBase<T>.EnableMethod(params InterpretationMethod[])

NeuralNetworkBase<T>.ConfigureFairness(Vector<int>, params FairnessMetric[])

NeuralNetworkBase<T>.GetNamedLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.GetArchitecture()

NeuralNetworkBase<T>.GetFeatureImportance()

NeuralNetworkBase<T>.SetParameters(Vector<T>)

NeuralNetworkBase<T>.AddLayer(LayerType, int, ActivationFunction)

NeuralNetworkBase<T>.AddConvolutionalLayer(int, int, int, ActivationFunction)

NeuralNetworkBase<T>.AddLSTMLayer(int, bool)

NeuralNetworkBase<T>.AddDropoutLayer(double)

NeuralNetworkBase<T>.AddBatchNormalizationLayer(int, double, double)

NeuralNetworkBase<T>.AddPoolingLayer(int[], PoolingType, int, int?)

NeuralNetworkBase<T>.GetGradients()

NeuralNetworkBase<T>.GetInputShape()

NeuralNetworkBase<T>.GetLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.DefaultLossFunction

NeuralNetworkBase<T>.ComputeGradients(Tensor<T>, Tensor<T>, ILossFunction<T>)

NeuralNetworkBase<T>.ApplyGradients(Vector<T>, T)

NeuralNetworkBase<T>.SaveState(Stream)

NeuralNetworkBase<T>.LoadState(Stream)

NeuralNetworkBase<T>.Dispose()

NeuralNetworkBase<T>.Dispose(bool)

NeuralNetworkBase<T>.SupportsJitCompilation

NeuralNetworkBase<T>.ExportComputationGraph(List<ComputationNode<T>>)

NeuralNetworkBase<T>.ConvertLayerToGraph(ILayer<T>, ComputationNode<T>)

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Extension Methods: DistributedExtensions.AsDistributedForHighBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributedForLowBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, IShardingConfiguration<T>)

Remarks

AC-GAN improves upon conditional GANs by: - Making the discriminator predict both authenticity AND class label - Providing stronger gradient signals for class-conditional generation - Improving image quality and class separability - Enabling better control over generated samples - Training more stable than basic conditional GANs

For Beginners: AC-GAN generates specific types of images with better quality.

Key improvements over cGAN:

Discriminator has two tasks: "Is it real?" AND "What class is it?"
This dual task helps the discriminator learn better features
Generator must create images that fool both checks
Results in higher quality and more class-consistent images

Example use case:

Generate digit "7" that looks very realistic
Discriminator checks: 1) Is it real? 2) Is it a "7"?
This forces the generator to make better "7"s

Reference: Odena et al., "Conditional Image Synthesis with Auxiliary Classifier GANs" (2017)

Constructors

ACGAN(NeuralNetworkArchitecture<T>, NeuralNetworkArchitecture<T>, int, InputType, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, ILossFunction<T>?)

Initializes a new instance of the ACGAN<T> class.

public ACGAN(NeuralNetworkArchitecture<T> generatorArchitecture, NeuralNetworkArchitecture<T> discriminatorArchitecture, int numClasses, InputType inputType, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>? generatorOptimizer = null, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>? discriminatorOptimizer = null, ILossFunction<T>? lossFunction = null)

Parameters

generatorArchitecture NeuralNetworkArchitecture<T>: The neural network architecture for the generator.
discriminatorArchitecture NeuralNetworkArchitecture<T>: The neural network architecture for the discriminator. Note: Output size should be 1 + numClasses (authenticity probability + class probabilities). All outputs must be in range (0, 1) - use sigmoid/softmax activations in the final layer.
numClasses int: The number of classes.
inputType InputType: The type of input.
generatorOptimizer IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>: Optional optimizer for the generator. If null, Adam optimizer is used.
discriminatorOptimizer IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>: Optional optimizer for the discriminator. If null, Adam optimizer is used.
lossFunction ILossFunction<T>: Optional loss function.

Properties

Discriminator

Gets the discriminator network that predicts both authenticity and class.

public ConvolutionalNeuralNetwork<T> Discriminator { get; }

Property Value

ConvolutionalNeuralNetwork<T>

Remarks

Unlike standard GANs, the AC-GAN discriminator has two outputs: 1. Authenticity score (real vs fake) - 1 output 2. Class probability distribution - numClasses outputs

For Beginners: The discriminator is a multi-task network.

Two outputs:

"Is this real or fake?" (1 number: 0-1)
"What class is this?" (probability for each class)

This dual purpose makes it a better feature learner.

Generator

Gets the generator network that creates class-conditional synthetic data.

public ConvolutionalNeuralNetwork<T> Generator { get; }

Property Value

ConvolutionalNeuralNetwork<T>

ParameterCount

Gets the total number of trainable parameters in the ACGAN.

public override int ParameterCount { get; }

Property Value

int

Methods

CreateNewInstance()

Creates a new instance of the same type as this neural network.

protected override IFullModel<T, Tensor<T>, Tensor<T>> CreateNewInstance()

Returns

IFullModel<T, Tensor<T>, Tensor<T>>: A new instance of the same neural network type.

Remarks

For Beginners: This creates a blank version of the same type of neural network.

It's used internally by methods like DeepCopy and Clone to create the right type of network before copying the data into it.

CreateOneHotLabels(int, int)

Creates one-hot encoded class labels.

public Tensor<T> CreateOneHotLabels(int batchSize, int classIndex)

Parameters

batchSize int
classIndex int

Returns

Tensor<T>

DeserializeNetworkSpecificData(BinaryReader)

Deserializes AC-GAN-specific data including networks and optimizer states.

protected override void DeserializeNetworkSpecificData(BinaryReader reader)

Parameters

reader BinaryReader: The binary reader to deserialize data from.

Remarks

This method restores all components needed to continue AC-GAN training from a saved state:

Number of classes for classification
Loss histories for training progress visualization
Generator and Discriminator networks with all learned weights
Optimizer states (momentum vectors, adaptive learning rates, timesteps)

For Beginners: When you load a saved AC-GAN, this method restores everything needed to continue training exactly where you left off:

The networks remember everything they learned
The optimizers remember their momentum and learning rate adjustments
Training can resume smoothly without any "warm-up" period

This is especially important for Adam optimizer which maintains momentum vectors (m and v) and a timestep counter - losing these would cause training instability after loading.

GenerateConditional(Tensor<T>, Tensor<T>)

Generates class-conditional images.

public Tensor<T> GenerateConditional(Tensor<T> noise, Tensor<T> classLabels)

Parameters

noise Tensor<T>
classLabels Tensor<T>

Returns

Tensor<T>

GenerateRandomNoiseTensor(int, int)

Generates random noise tensor using vectorized Gaussian noise generation.

public Tensor<T> GenerateRandomNoiseTensor(int batchSize, int noiseSize)

Parameters

batchSize int
noiseSize int

Returns

Tensor<T>

GetModelMetadata()

Gets the metadata for this neural network model.

public override ModelMetadata<T> GetModelMetadata()

Returns

ModelMetadata<T>: A ModelMetaData object containing information about the model.

InitializeLayers()

Initializes the layers of the neural network based on the architecture.

protected override void InitializeLayers()

Remarks

For Beginners: This method sets up all the layers in your neural network according to the architecture you've defined. It's like assembling the parts of your network before you can use it.

Predict(Tensor<T>)

Makes a prediction using the neural network.

public override Tensor<T> Predict(Tensor<T> input)

Parameters

input Tensor<T>: The input data to process.

Returns

Tensor<T>: The network's prediction.

Remarks

For Beginners: This is the main method you'll use to get results from your trained neural network. You provide some input data (like an image or text), and the network processes it through all its layers to produce an output (like a classification or prediction).

ResetOptimizerState()

Resets both optimizer states for a fresh training run.

public void ResetOptimizerState()

SerializeNetworkSpecificData(BinaryWriter)

Serializes AC-GAN-specific data including networks and optimizer states.

protected override void SerializeNetworkSpecificData(BinaryWriter writer)

Parameters

writer BinaryWriter: The binary writer to serialize data to.

Remarks

This method serializes all components needed to fully restore an AC-GAN's training state:

Number of classes
Loss histories for monitoring training progress
Generator and Discriminator networks with all learned weights
Optimizer states (momentum, adaptive learning rates, timesteps)

For Beginners: When you save an AC-GAN during training, this method ensures that everything needed to resume training is saved:

The networks' learned knowledge (weights and biases)
The optimizers' "memory" (like Adam's momentum vectors)
Training history (loss values for monitoring)

Without saving optimizer states, resuming training would be like starting with a new optimizer that has forgotten all the momentum and adaptive learning rates it built up, which can cause unstable training after loading.

Train(Tensor<T>, Tensor<T>)

Performs a single training iteration using the standard neural network interface.

public override void Train(Tensor<T> input, Tensor<T> expectedOutput)

Parameters

input Tensor<T>: The noise tensor used as input to the generator network. Shape should be [batchSize, noiseSize] where noiseSize matches the generator's expected input.
expectedOutput Tensor<T>: The real images tensor used for discriminator training. Shape should be [batchSize, height, width, channels] or equivalent flattened form.

Remarks

This method adapts the AC-GAN's specialized training to the standard Train(Tensor<T>, Tensor<T>) interface by automatically generating random class labels for both real and fake samples.

The AC-GAN training process differs from standard neural networks because it requires:

Real images with their class labels
Noise vectors for generating fake images
Target class labels for the generated images

When using this simplified interface, random class labels are generated using AiDotNet.Tensors.Helpers.RandomHelper.ThreadSafeRandom for thread-safe, cryptographically-seeded random number generation. For more control over class labels, use the TrainStep(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>) method directly.

For Beginners: This method lets you train an AC-GAN using the same interface as other neural networks. Just provide:

input: Random noise vectors (like random seeds for image generation)
expectedOutput: Real images to learn from

The method automatically assigns random class labels (like "digit 3", "digit 7", etc.) to both the real images and the images to generate. While this is convenient, for best results you should use TrainStep(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>) with actual class labels from your dataset.

Exceptions

ArgumentNullException: Thrown when input or expectedOutput is null.

See Also: TrainStep(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>)

TrainStep(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>)

Performs one training step for the AC-GAN.

public (T discriminatorLoss, T generatorLoss) TrainStep(Tensor<T> realImages, Tensor<T> realLabels, Tensor<T> noise, Tensor<T> fakeLabels)

Parameters

realImages Tensor<T>: Real images tensor.
realLabels Tensor<T>: Real image class labels (one-hot encoded).
noise Tensor<T>: Random noise for generator.
fakeLabels Tensor<T>: Class labels for images to generate (one-hot encoded).

Returns

(T Accuracy, T Loss): Tuple of (discriminator loss, generator loss).

UpdateParameters(Vector<T>)

Updates the network's parameters with new values.

public override void UpdateParameters(Vector<T> parameters)

Parameters

parameters Vector<T>: The new parameter values to set.

Remarks

For Beginners: During training, a neural network's internal values (parameters) get adjusted to improve its performance. This method allows you to update all those values at once by providing a complete set of new parameters.

This is typically used by optimization algorithms that calculate better parameter values based on training data.

Table of Contents

Class ACGAN<T>

Type Parameters

Remarks

Constructors

ACGAN(NeuralNetworkArchitecture<T>, NeuralNetworkArchitecture<T>, int, InputType, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>?, ILossFunction<T>?)

Parameters

Properties

Discriminator

Property Value

Remarks

Generator

Property Value

ParameterCount

Property Value

Methods

CreateNewInstance()

Returns

Remarks

CreateOneHotLabels(int, int)

Parameters

Returns

DeserializeNetworkSpecificData(BinaryReader)

Parameters

Remarks

GenerateConditional(Tensor<T>, Tensor<T>)

Parameters

Returns

GenerateRandomNoiseTensor(int, int)

Parameters

Returns

GetModelMetadata()

Returns

InitializeLayers()

Remarks

Predict(Tensor<T>)

Parameters

Returns

Remarks

ResetOptimizerState()

SerializeNetworkSpecificData(BinaryWriter)

Parameters

Remarks

Train(Tensor<T>, Tensor<T>)

Parameters

Remarks

Exceptions

TrainStep(Tensor<T>, Tensor<T>, Tensor<T>, Tensor<T>)

Parameters

Returns

UpdateParameters(Vector<T>)

Parameters

Remarks