Class MoCoV3<T>

Namespace: AiDotNet.SelfSupervisedLearning

Assembly: AiDotNet.dll

MoCo v3: An Empirical Study of Training Self-Supervised Vision Transformers.

public class MoCoV3<T> : SSLMethodBase<T>, ISSLMethod<T>

Type Parameters

T: The numeric type used for computations.

Inheritance: object

SSLMethodBase<T>

MoCoV3<T>

Implements: ISSLMethod<T>

Inherited Members: SSLMethodBase<T>.NumOps

SSLMethodBase<T>.Engine

SSLMethodBase<T>._encoder

SSLMethodBase<T>._projector

SSLMethodBase<T>._config

SSLMethodBase<T>._isTraining

SSLMethodBase<T>._currentStep

SSLMethodBase<T>._currentEpoch

SSLMethodBase<T>.ParameterCount

SSLMethodBase<T>.GetEncoder()

SSLMethodBase<T>.TrainStep(Tensor<T>, SSLAugmentationContext<T>)

SSLMethodBase<T>.Encode(Tensor<T>)

SSLMethodBase<T>.EncodeAndProject(Tensor<T>)

SSLMethodBase<T>.Reset()

SSLMethodBase<T>.GetParameters()

SSLMethodBase<T>.SetParameters(Vector<T>)

SSLMethodBase<T>.GetAdditionalParameters()

SSLMethodBase<T>.GetAdditionalParameterCount()

SSLMethodBase<T>.SetAdditionalParameters(Vector<T>, ref int)

SSLMethodBase<T>.SetTrainingMode(bool)

SSLMethodBase<T>.OnEpochEnd(int)

SSLMethodBase<T>.GetEffectiveTemperature()

SSLMethodBase<T>.GetEffectiveLearningRate()

SSLMethodBase<T>.CreateStepResult(T)

SSLMethodBase<T>.CosineSimilarity(Tensor<T>, Tensor<T>)

SSLMethodBase<T>.L2Normalize(Tensor<T>)

SSLMethodBase<T>.MatMul(Tensor<T>, Tensor<T>)

SSLMethodBase<T>.ComputeSimilarityMatrix(Tensor<T>, Tensor<T>, bool)

SSLMethodBase<T>.ComputePairwiseDistances(Tensor<T>)

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Remarks

For Beginners: MoCo v3 adapts momentum contrastive learning specifically for Vision Transformers (ViT). It simplifies the framework by removing the memory queue and using in-batch negatives with a symmetric loss.

Key changes from MoCo v1/v2:

No memory queue: Uses in-batch negatives like SimCLR
Symmetric loss: Both views serve as queries and keys
Prediction head: Adds a predictor MLP on one branch
ViT optimizations: Random patch projection, no BN in MLP heads

Training stability for ViT:

Uses lower learning rates and careful initialization
Gradient clipping and careful warmup
Momentum encoder still provides stable targets

Reference: Chen et al., "An Empirical Study of Training Self-Supervised Vision Transformers" (ICCV 2021)

Constructors

MoCoV3(INeuralNetwork<T>, IMomentumEncoder<T>, IProjectorHead<T>, IProjectorHead<T>, IProjectorHead<T>?, SSLConfig?)

Initializes a new instance of the MoCoV3 class.

public MoCoV3(INeuralNetwork<T> encoder, IMomentumEncoder<T> momentumEncoder, IProjectorHead<T> projector, IProjectorHead<T> momentumProjector, IProjectorHead<T>? predictor = null, SSLConfig? config = null)

Parameters

encoder INeuralNetwork<T>: The online encoder network (ViT recommended).
momentumEncoder IMomentumEncoder<T>: The momentum encoder.
projector IProjectorHead<T>: Projection head for online encoder.
momentumProjector IProjectorHead<T>: Projection head for momentum encoder.
predictor IProjectorHead<T>: Predictor head (applied to online branch only).
config SSLConfig: Optional SSL configuration.

Properties

Name

Gets the name of this SSL method.

public override string Name { get; }

Property Value

string

Remarks

Examples: "SimCLR", "MoCo v2", "BYOL", "DINO", "MAE"

RequiresMemoryBank

Indicates whether this method requires a memory bank for negative samples.

public override bool RequiresMemoryBank { get; }

Property Value

bool

Remarks

For Beginners: Memory banks store embeddings from previous batches to use as negative samples in contrastive learning. MoCo uses this, SimCLR does not.

UsesMomentumEncoder

Indicates whether this method uses a momentum-updated encoder.

public override bool UsesMomentumEncoder { get; }

Property Value

bool

Remarks

For Beginners: A momentum encoder is a slowly-updated copy of the main encoder. Methods like MoCo, BYOL, and DINO use this to provide stable targets.

Methods

OnEpochStart(int)

Signals the start of a new epoch.

public override void OnEpochStart(int epochNumber)

Parameters

epochNumber int: The current epoch number.

TrainStepCore(Tensor<T>, SSLAugmentationContext<T>?)

Implementation-specific training step logic.

protected override SSLStepResult<T> TrainStepCore(Tensor<T> batch, SSLAugmentationContext<T>? augmentationContext)

Parameters

batch Tensor<T>: The input batch tensor.
augmentationContext SSLAugmentationContext<T>: Optional augmentation context.

Returns

SSLStepResult<T>: The result of the training step.

Table of Contents

Class MoCoV3<T>

Type Parameters

Remarks

Constructors

MoCoV3(INeuralNetwork<T>, IMomentumEncoder<T>, IProjectorHead<T>, IProjectorHead<T>, IProjectorHead<T>?, SSLConfig?)

Parameters

Properties

Category

Property Value

Remarks

Name

Property Value

Remarks

RequiresMemoryBank

Property Value

Remarks

UsesMomentumEncoder

Property Value

Remarks

Methods

OnEpochStart(int)

Parameters

TrainStepCore(Tensor<T>, SSLAugmentationContext<T>?)

Parameters

Returns