Class CodeModelBase<T>

Namespace: AiDotNet.ProgramSynthesis.Engines

Assembly: AiDotNet.dll

Base class for code models that provides shared tokenization, task dispatch, and structured outputs.

public abstract class CodeModelBase<T> : NeuralNetworkBase<T>, INeuralNetworkModel<T>, INeuralNetwork<T>, IInterpretableModel<T>, IInputGradientComputable<T>, IDisposable, ICodeModel<T>, IFullModel<T, Tensor<T>, Tensor<T>>, IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>, IModelSerializer, ICheckpointableModel, IParameterizable<T, Tensor<T>, Tensor<T>>, IFeatureAware, IFeatureImportance<T>, ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>, IGradientComputable<T, Tensor<T>, Tensor<T>>, IJitCompilable<T>

Type Parameters

T: The numeric type used for calculations (e.g., double, float).

Inheritance: object

NeuralNetworkBase<T>

CodeModelBase<T>

Implements: INeuralNetworkModel<T>

INeuralNetwork<T>

IInterpretableModel<T>

IInputGradientComputable<T>

IDisposable

ICodeModel<T>

IFullModel<T, Tensor<T>, Tensor<T>>

IModel<Tensor<T>, Tensor<T>, ModelMetadata<T>>

IModelSerializer

ICheckpointableModel

IParameterizable<T, Tensor<T>, Tensor<T>>

IFeatureAware

IFeatureImportance<T>

ICloneable<IFullModel<T, Tensor<T>, Tensor<T>>>

IGradientComputable<T, Tensor<T>, Tensor<T>>

IJitCompilable<T>

Derived: CodeBERT<T>

CodeT5<T>

GraphCodeBERT<T>

Inherited Members: NeuralNetworkBase<T>.Layers

NeuralNetworkBase<T>.LayerCount

NeuralNetworkBase<T>.Architecture

NeuralNetworkBase<T>.NumOps

NeuralNetworkBase<T>.Engine

NeuralNetworkBase<T>._layerInputs

NeuralNetworkBase<T>._layerOutputs

NeuralNetworkBase<T>.Random

NeuralNetworkBase<T>.LossFunction

NeuralNetworkBase<T>.LastLoss

NeuralNetworkBase<T>.IsTrainingMode

NeuralNetworkBase<T>.SupportsTraining

NeuralNetworkBase<T>.SupportsGpuTraining

NeuralNetworkBase<T>.CanTrainOnGpu

NeuralNetworkBase<T>.GpuEngine

NeuralNetworkBase<T>.MaxGradNorm

NeuralNetworkBase<T>._mixedPrecisionContext

NeuralNetworkBase<T>._memoryManager

NeuralNetworkBase<T>.IsMemoryManagementEnabled

NeuralNetworkBase<T>.IsGradientCheckpointingEnabled

NeuralNetworkBase<T>.IsMixedPrecisionEnabled

NeuralNetworkBase<T>.ClipGradients(List<Tensor<T>>)

NeuralNetworkBase<T>.ClipGradient(Tensor<T>)

NeuralNetworkBase<T>.ClipGradient(Vector<T>)

NeuralNetworkBase<T>.GetParameters()

NeuralNetworkBase<T>.Backpropagate(Tensor<T>)

NeuralNetworkBase<T>.BackpropagateWithRecompute(Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpu(IGpuTensor<T>)

NeuralNetworkBase<T>.BackpropagateGpuDeferred(IGpuTensor<T>, GpuExecutionOptions)

NeuralNetworkBase<T>.UpdateParametersGpu(T, T, T)

NeuralNetworkBase<T>.UpdateParametersGpu(IGpuOptimizerConfig)

NeuralNetworkBase<T>.UpdateParametersGpuDeferred(IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferred(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions)

NeuralNetworkBase<T>.TrainBatchGpuDeferredAsync(IGpuTensor<T>, IGpuTensor<T>, IGpuOptimizerConfig, GpuExecutionOptions, CancellationToken)

NeuralNetworkBase<T>.UploadWeightsToGpu()

NeuralNetworkBase<T>.DownloadWeightsFromGpu()

NeuralNetworkBase<T>.ZeroGradientsGpu()

NeuralNetworkBase<T>.ExtractSingleExample(Tensor<T>, int)

NeuralNetworkBase<T>.ForwardWithMemory(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithCheckpointing(Tensor<T>)

NeuralNetworkBase<T>.CanUseGpuResidentPath()

NeuralNetworkBase<T>.TryForwardGpuOptimized(Tensor<T>, out Tensor<T>)

NeuralNetworkBase<T>.ForwardGpu(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferred(Tensor<T>)

NeuralNetworkBase<T>.ForwardDeferredAsync(Tensor<T>, CancellationToken)

NeuralNetworkBase<T>.BeginGpuExecution(GpuExecutionOptions)

NeuralNetworkBase<T>.ForwardWithGpuContext(Tensor<T>)

NeuralNetworkBase<T>.ForwardWithGpuContext(IGpuTensor<T>)

NeuralNetworkBase<T>.GetGpuMemoryStats()

NeuralNetworkBase<T>.ForwardWithFeatures(Tensor<T>, int[])

NeuralNetworkBase<T>.ParameterCount

NeuralNetworkBase<T>.GetParameterCount()

NeuralNetworkBase<T>.InvalidateParameterCountCache()

NeuralNetworkBase<T>.AddLayerToCollection(ILayer<T>)

NeuralNetworkBase<T>.RemoveLayerFromCollection(ILayer<T>)

NeuralNetworkBase<T>.ClearLayers()

NeuralNetworkBase<T>.ValidateCustomLayers(List<ILayer<T>>)

NeuralNetworkBase<T>.ValidateCustomLayersInternal(List<ILayer<T>>)

NeuralNetworkBase<T>.IsValidInputLayer(ILayer<T>)

NeuralNetworkBase<T>.IsValidOutputLayer(ILayer<T>)

NeuralNetworkBase<T>.AreLayersCompatible(ILayer<T>, ILayer<T>)

NeuralNetworkBase<T>.GetParameterGradients()

NeuralNetworkBase<T>.EnsureArchitectureInitialized()

NeuralNetworkBase<T>.InitializeLayers()

NeuralNetworkBase<T>.SetTrainingMode(bool)

NeuralNetworkBase<T>.EnableMemoryManagement(TrainingMemoryConfig)

NeuralNetworkBase<T>.DisableMemoryManagement()

NeuralNetworkBase<T>.GetMemoryEstimate(int, int)

NeuralNetworkBase<T>.GetLastLoss()

NeuralNetworkBase<T>.Train(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.GetModelMetadata()

NeuralNetworkBase<T>.ResetState()

NeuralNetworkBase<T>.BackwardWithInputGradient(Tensor<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Vector<T>, Vector<T>)

NeuralNetworkBase<T>.ComputeInputGradient(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.SaveModel(string)

NeuralNetworkBase<T>.LoadModel(string)

NeuralNetworkBase<T>.Serialize()

NeuralNetworkBase<T>.Deserialize(byte[])

NeuralNetworkBase<T>.SerializeNetworkSpecificData(BinaryWriter)

NeuralNetworkBase<T>.DeserializeNetworkSpecificData(BinaryReader)

NeuralNetworkBase<T>.WithParameters(Vector<T>)

NeuralNetworkBase<T>.GetActiveFeatureIndices()

NeuralNetworkBase<T>.IsFeatureUsed(int)

NeuralNetworkBase<T>.DeepCopy()

NeuralNetworkBase<T>.Clone()

NeuralNetworkBase<T>.CreateNewInstance()

NeuralNetworkBase<T>.SetActiveFeatureIndices(IEnumerable<int>)

NeuralNetworkBase<T>._enabledMethods

NeuralNetworkBase<T>._sensitiveFeatures

NeuralNetworkBase<T>._fairnessMetrics

NeuralNetworkBase<T>._baseModel

NeuralNetworkBase<T>.GetGlobalFeatureImportanceAsync()

NeuralNetworkBase<T>.GetLocalFeatureImportanceAsync(Tensor<T>)

NeuralNetworkBase<T>.GetShapValuesAsync(Tensor<T>)

NeuralNetworkBase<T>.GetLimeExplanationAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetPartialDependenceAsync(Vector<int>, int)

NeuralNetworkBase<T>.GetCounterfactualAsync(Tensor<T>, Tensor<T>, int)

NeuralNetworkBase<T>.GetModelSpecificInterpretabilityAsync()

NeuralNetworkBase<T>.GenerateTextExplanationAsync(Tensor<T>, Tensor<T>)

NeuralNetworkBase<T>.GetFeatureInteractionAsync(int, int)

NeuralNetworkBase<T>.ValidateFairnessAsync(Tensor<T>, int)

NeuralNetworkBase<T>.GetAnchorExplanationAsync(Tensor<T>, T)

NeuralNetworkBase<T>.SetBaseModel<TInput, TOutput>(IFullModel<T, TInput, TOutput>)

NeuralNetworkBase<T>.EnableMethod(params InterpretationMethod[])

NeuralNetworkBase<T>.ConfigureFairness(Vector<int>, params FairnessMetric[])

NeuralNetworkBase<T>.GetNamedLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.GetArchitecture()

NeuralNetworkBase<T>.GetFeatureImportance()

NeuralNetworkBase<T>.SetParameters(Vector<T>)

NeuralNetworkBase<T>.AddLayer(LayerType, int, ActivationFunction)

NeuralNetworkBase<T>.AddConvolutionalLayer(int, int, int, ActivationFunction)

NeuralNetworkBase<T>.AddLSTMLayer(int, bool)

NeuralNetworkBase<T>.AddDropoutLayer(double)

NeuralNetworkBase<T>.AddBatchNormalizationLayer(int, double, double)

NeuralNetworkBase<T>.AddPoolingLayer(int[], PoolingType, int, int?)

NeuralNetworkBase<T>.GetGradients()

NeuralNetworkBase<T>.GetInputShape()

NeuralNetworkBase<T>.GetLayerActivations(Tensor<T>)

NeuralNetworkBase<T>.DefaultLossFunction

NeuralNetworkBase<T>.ComputeGradients(Tensor<T>, Tensor<T>, ILossFunction<T>)

NeuralNetworkBase<T>.ApplyGradients(Vector<T>, T)

NeuralNetworkBase<T>.SaveState(Stream)

NeuralNetworkBase<T>.LoadState(Stream)

NeuralNetworkBase<T>.Dispose()

NeuralNetworkBase<T>.Dispose(bool)

NeuralNetworkBase<T>.SupportsJitCompilation

NeuralNetworkBase<T>.ExportComputationGraph(List<ComputationNode<T>>)

NeuralNetworkBase<T>.ConvertLayerToGraph(ILayer<T>, ComputationNode<T>)

object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Extension Methods: DistributedExtensions.AsDistributedForHighBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributedForLowBandwidth<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, ICommunicationBackend<T>)

DistributedExtensions.AsDistributed<T, TInput, TOutput>(IFullModel<T, TInput, TOutput>, IShardingConfiguration<T>)

Constructors

CodeModelBase(CodeSynthesisArchitecture<T>, ILossFunction<T>, ITokenizer?)

protected CodeModelBase(CodeSynthesisArchitecture<T> architecture, ILossFunction<T> lossFunction, ITokenizer? tokenizer = null)

Parameters

architecture CodeSynthesisArchitecture<T>
lossFunction ILossFunction<T>
tokenizer ITokenizer

Properties

CodeArchitecture

protected CodeSynthesisArchitecture<T> CodeArchitecture { get; }

Property Value

CodeSynthesisArchitecture<T>

MaxSequenceLength

Gets the maximum sequence length (in tokens) that the model can process.

public int MaxSequenceLength { get; }

Property Value

int

Remarks

Code models process code as sequences of tokens. This property specifies the maximum number of tokens the model can handle at once.

For Beginners: This is like the maximum length of code the model can read at once.

Code is broken into pieces called "tokens" (like words in a sentence). This number tells you the maximum number of tokens the model can process, which roughly corresponds to how long a code file can be.

TargetLanguage

Gets the target programming language for this model.

public ProgramLanguage TargetLanguage { get; }

Property Value

ProgramLanguage

Remarks

Specifies which programming language this model is designed to work with. Some models are language-specific, while others can work with multiple languages.

For Beginners: This tells you which programming language the model knows.

Like a translator who specializes in French or Spanish, code models often specialize in specific programming languages like Python or Java.

Tokenizer

protected ITokenizer Tokenizer { get; }

Property Value

ITokenizer

VocabularySize

Gets the vocabulary size of the model.

public int VocabularySize { get; }

Property Value

int

Remarks

The vocabulary consists of all the tokens (keywords, operators, identifiers, etc.) that the model knows and can work with.

For Beginners: This is like the model's dictionary size.

It tells you how many different code tokens the model knows. A larger vocabulary means the model can handle more diverse code patterns and identifiers.

Methods

CreateTransformerModelMetadata(string, IReadOnlyDictionary<string, object>?, string)

protected ModelMetadata<T> CreateTransformerModelMetadata(string modelName, IReadOnlyDictionary<string, object>? extraInfo, string optimizerName)

Parameters

modelName string
extraInfo IReadOnlyDictionary<string, object>
optimizerName string

Returns

ModelMetadata<T>

DecodeCode(Tensor<T>)

Decodes a vector representation back into source code.

public string DecodeCode(Tensor<T> encoding)

Parameters

encoding Tensor<T>: The encoded representation to decode.

Returns

string: The decoded source code as a string.

Remarks

Decoding transforms the model's internal numerical representation back into human-readable source code.

For Beginners: Decoding converts the AI's numerical format back to readable code.

After the AI processes code in numerical form, we need to convert it back to text that humans can read and computers can execute. This is the reverse of encoding.

EncodeCode(string)

Encodes source code into a vector representation.

public Tensor<T> EncodeCode(string code)

Parameters

code string: The source code to encode.

Returns

Tensor<T>: A tensor representing the encoded code.

Remarks

Encoding transforms source code (text) into a numerical representation that the model can process. This representation captures semantic information about the code.

For Beginners: Encoding converts code text into numbers the AI can understand.

Computers can't directly work with text, so we convert code into numerical form. This encoding captures the meaning of the code, not just the characters. Like translating emotions into emoji - different form, same meaning.

GetEmbeddings(string)

Gets embeddings for code tokens.

public virtual Tensor<T> GetEmbeddings(string code)

Parameters

code string: The source code to get embeddings for.

Returns

Tensor<T>: A tensor containing token embeddings.

Remarks

Embeddings are dense vector representations of code tokens that capture semantic similarities. Similar code constructs have similar embeddings.

For Beginners: Embeddings represent each piece of code as a point in space.

Code with similar meaning is placed close together in this space. For example, "for loop" and "while loop" would be near each other because they're both loops, but far from "function definition" because that's a different concept.

PerformBugDetection(CodeBugDetectionRequest)

protected virtual CodeBugDetectionResult PerformBugDetection(CodeBugDetectionRequest request)

Parameters

request CodeBugDetectionRequest

Returns

CodeBugDetectionResult

PerformBugFixing(CodeBugFixingRequest)

protected virtual CodeBugFixingResult PerformBugFixing(CodeBugFixingRequest request)

Parameters

request CodeBugFixingRequest

Returns

CodeBugFixingResult

PerformCloneDetection(CodeCloneDetectionRequest)

protected virtual CodeCloneDetectionResult PerformCloneDetection(CodeCloneDetectionRequest request)

Parameters

request CodeCloneDetectionRequest

Returns

CodeCloneDetectionResult

PerformCodeReview(CodeReviewRequest)

protected virtual CodeReviewResult PerformCodeReview(CodeReviewRequest request)

Parameters

request CodeReviewRequest

Returns

CodeReviewResult

PerformCompletion(CodeCompletionRequest)

protected virtual CodeCompletionResult PerformCompletion(CodeCompletionRequest request)

Parameters

request CodeCompletionRequest

Returns

CodeCompletionResult

PerformDocumentation(CodeDocumentationRequest)

protected virtual CodeDocumentationResult PerformDocumentation(CodeDocumentationRequest request)

Parameters

request CodeDocumentationRequest

Returns

CodeDocumentationResult

PerformGeneration(CodeGenerationRequest)

protected virtual CodeGenerationResult PerformGeneration(CodeGenerationRequest request)

Parameters

request CodeGenerationRequest

Returns

CodeGenerationResult

PerformRefactoring(CodeRefactoringRequest)

protected virtual CodeRefactoringResult PerformRefactoring(CodeRefactoringRequest request)

Parameters

request CodeRefactoringRequest

Returns

CodeRefactoringResult

PerformSearch(CodeSearchRequest)

protected virtual CodeSearchResult PerformSearch(CodeSearchRequest request)

Parameters

request CodeSearchRequest

Returns

CodeSearchResult

PerformSummarization(CodeSummarizationRequest)

protected virtual CodeSummarizationResult PerformSummarization(CodeSummarizationRequest request)

Parameters

request CodeSummarizationRequest

Returns

CodeSummarizationResult

PerformTask(CodeTaskRequestBase)

Performs a code-related task and returns a structured result type.

public CodeTaskResultBase PerformTask(CodeTaskRequestBase request)

Parameters

request CodeTaskRequestBase: The task request.

Returns

CodeTaskResultBase: A structured task result.

PerformTask(string, CodeTask)

Performs a code-related task on the input code.

[Obsolete("Use PerformTask(CodeTaskRequestBase) for structured outputs.")]
public string PerformTask(string code, CodeTask task)

Parameters

code string: The source code to process.
task CodeTask: The type of task to perform.

Returns

string: The result of the task as a string.

Remarks

This method allows the model to perform various code-related tasks such as completion, summarization, bug detection, etc. based on the specified task type.

For Beginners: This method lets you tell the model what to do with the code.

You provide code and specify what you want done with it:

Complete it
Summarize it
Find bugs
Generate documentation

The model then performs that specific task and returns the result.

PerformTestGeneration(CodeTestGenerationRequest)

protected virtual CodeTestGenerationResult PerformTestGeneration(CodeTestGenerationRequest request)

Parameters

request CodeTestGenerationRequest

Returns

CodeTestGenerationResult

PerformTranslation(CodeTranslationRequest)

protected virtual CodeTranslationResult PerformTranslation(CodeTranslationRequest request)

Parameters

request CodeTranslationRequest

Returns

CodeTranslationResult

PerformUnderstanding(CodeUnderstandingRequest)

protected virtual CodeUnderstandingResult PerformUnderstanding(CodeUnderstandingRequest request)

Parameters

request CodeUnderstandingRequest

Returns

CodeUnderstandingResult

Predict(Tensor<T>)

Makes a prediction using the neural network.

public override Tensor<T> Predict(Tensor<T> input)

Parameters

input Tensor<T>: The input data to process.

Returns

Tensor<T>: The network's prediction.

Remarks

For Beginners: This is the main method you'll use to get results from your trained neural network. You provide some input data (like an image or text), and the network processes it through all its layers to produce an output (like a classification or prediction).

TrainWithOptimizer(Tensor<T>, Tensor<T>, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>)

protected void TrainWithOptimizer(Tensor<T> input, Tensor<T> expectedOutput, IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>> optimizer)

Parameters

input Tensor<T>
expectedOutput Tensor<T>
optimizer IGradientBasedOptimizer<T, Tensor<T>, Tensor<T>>

UpdateParameters(Vector<T>)

Updates the network's parameters with new values.

public override void UpdateParameters(Vector<T> parameters)

Parameters

parameters Vector<T>: The new parameter values to set.

Remarks

For Beginners: During training, a neural network's internal values (parameters) get adjusted to improve its performance. This method allows you to update all those values at once by providing a complete set of new parameters.

This is typically used by optimization algorithms that calculate better parameter values based on training data.

Table of Contents

Class CodeModelBase<T>

Type Parameters

Constructors

CodeModelBase(CodeSynthesisArchitecture<T>, ILossFunction<T>, ITokenizer?)

Parameters

Properties

CodeArchitecture

Property Value

MaxSequenceLength

Property Value

Remarks

TargetLanguage

Property Value

Remarks

Tokenizer

Property Value

VocabularySize

Property Value

Remarks

Methods

CreateTransformerModelMetadata(string, IReadOnlyDictionary<string, object>?, string)

Parameters

Returns

DecodeCode(Tensor<T>)

Parameters

Returns

Remarks

EncodeCode(string)

Parameters

Returns

Remarks

GetEmbeddings(string)

Parameters

Returns

Remarks

PerformBugDetection(CodeBugDetectionRequest)

Parameters

Returns

PerformBugFixing(CodeBugFixingRequest)

Parameters

Returns

PerformCloneDetection(CodeCloneDetectionRequest)

Parameters

Returns

PerformCodeReview(CodeReviewRequest)

Parameters

Returns

PerformCompletion(CodeCompletionRequest)

Parameters

Returns

PerformDocumentation(CodeDocumentationRequest)

Parameters

Returns

PerformGeneration(CodeGenerationRequest)

Parameters

Returns

PerformRefactoring(CodeRefactoringRequest)

Parameters

Returns

PerformSearch(CodeSearchRequest)

Parameters

Returns

PerformSummarization(CodeSummarizationRequest)

Parameters

Returns

PerformTask(CodeTaskRequestBase)

Parameters

Returns

PerformTask(string, CodeTask)

Parameters

Returns

Remarks

PerformTestGeneration(CodeTestGenerationRequest)

Parameters

Returns

PerformTranslation(CodeTranslationRequest)

Parameters

Returns

PerformUnderstanding(CodeUnderstandingRequest)