Class DINO<T>
- Namespace
- AiDotNet.ComputerVision.Detection.ObjectDetection.DETR
- Assembly
- AiDotNet.dll
DINO (DETR with Improved deNoising anchOr boxes) - State-of-the-art DETR variant.
public class DINO<T> : ObjectDetectorBase<T>
Type Parameters
TThe numeric type used for calculations.
- Inheritance
-
DINO<T>
- Inherited Members
Remarks
For Beginners: DINO improves upon DETR by using contrastive denoising training and mixed query selection. It achieves better performance with faster convergence than the original DETR.
Key improvements: - Contrastive denoising training for better query learning - Mixed query selection (both content and position queries) - Look forward twice for better box predictions - Multi-scale deformable attention (optional)
Reference: Zhang et al., "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection", ICLR 2023
Constructors
DINO(ObjectDetectionOptions<T>)
Creates a new DINO detector.
public DINO(ObjectDetectionOptions<T> options)
Parameters
optionsObjectDetectionOptions<T>Detection options.
Properties
Name
Name of this detector architecture.
public override string Name { get; }
Property Value
Methods
Detect(Tensor<T>, double, double)
Detects objects in an image with custom thresholds.
public override DetectionResult<T> Detect(Tensor<T> image, double confidenceThreshold, double nmsThreshold)
Parameters
imageTensor<T>Input image tensor.
confidenceThresholddoubleMinimum confidence to keep a detection.
nmsThresholddoubleIoU threshold for NMS.
Returns
- DetectionResult<T>
Detection results.
Forward(Tensor<T>)
Performs forward pass through the network.
protected override List<Tensor<T>> Forward(Tensor<T> input)
Parameters
inputTensor<T>Input image tensor.
Returns
- List<Tensor<T>>
Raw network outputs before post-processing.
GetHeadParameterCount()
Gets the number of parameters in the detection head.
protected override long GetHeadParameterCount()
Returns
- long
Number of parameters.
LoadWeightsAsync(string, CancellationToken)
Loads pre-trained weights from a file or URL.
public override Task LoadWeightsAsync(string pathOrUrl, CancellationToken cancellationToken = default)
Parameters
pathOrUrlstringLocal file path or URL to weights.
cancellationTokenCancellationTokenCancellation token.
Returns
PostProcess(List<Tensor<T>>, int, int, double, double)
Post-processes raw network outputs into detections.
protected override List<Detection<T>> PostProcess(List<Tensor<T>> outputs, int imageWidth, int imageHeight, double confidenceThreshold, double nmsThreshold)
Parameters
outputsList<Tensor<T>>Raw network outputs.
imageWidthintOriginal image width.
imageHeightintOriginal image height.
confidenceThresholddoubleMinimum confidence threshold.
nmsThresholddoubleNMS IoU threshold.
Returns
SaveWeights(string)
Saves model weights to a file.
public override void SaveWeights(string path)
Parameters
pathstringFile path to save weights.