Class RTDETR<T>
- Namespace
- AiDotNet.ComputerVision.Detection.ObjectDetection.DETR
- Assembly
- AiDotNet.dll
RT-DETR (Real-Time DEtection TRansformer) - First real-time end-to-end object detector.
public class RTDETR<T> : ObjectDetectorBase<T>
Type Parameters
TThe numeric type used for calculations.
- Inheritance
-
RTDETR<T>
- Inherited Members
Remarks
For Beginners: RT-DETR is the first real-time end-to-end transformer-based object detector. It achieves YOLO-level speed while maintaining transformer accuracy by using a hybrid encoder and efficient decoder design.
Key features: - Hybrid encoder with intra-scale and cross-scale feature interaction - Efficient decoder with uncertainty-minimal query selection - Flexible inference speed/accuracy trade-off - No NMS post-processing required
Reference: Lv et al., "DETRs Beat YOLOs on Real-time Object Detection", CVPR 2024
Constructors
RTDETR(ObjectDetectionOptions<T>)
Creates a new RT-DETR detector.
public RTDETR(ObjectDetectionOptions<T> options)
Parameters
optionsObjectDetectionOptions<T>Detection options.
Properties
Name
Name of this detector architecture.
public override string Name { get; }
Property Value
Methods
Detect(Tensor<T>, double, double)
Detects objects in an image with custom thresholds.
public override DetectionResult<T> Detect(Tensor<T> image, double confidenceThreshold, double nmsThreshold)
Parameters
imageTensor<T>Input image tensor.
confidenceThresholddoubleMinimum confidence to keep a detection.
nmsThresholddoubleIoU threshold for NMS.
Returns
- DetectionResult<T>
Detection results.
Forward(Tensor<T>)
Performs forward pass through the network.
protected override List<Tensor<T>> Forward(Tensor<T> input)
Parameters
inputTensor<T>Input image tensor.
Returns
- List<Tensor<T>>
Raw network outputs before post-processing.
GetHeadParameterCount()
Gets the number of parameters in the detection head.
protected override long GetHeadParameterCount()
Returns
- long
Number of parameters.
LoadWeightsAsync(string, CancellationToken)
Loads pre-trained weights from a file or URL.
public override Task LoadWeightsAsync(string pathOrUrl, CancellationToken cancellationToken = default)
Parameters
pathOrUrlstringLocal file path or URL to weights.
cancellationTokenCancellationTokenCancellation token.
Returns
PostProcess(List<Tensor<T>>, int, int, double, double)
Post-processes raw network outputs into detections.
protected override List<Detection<T>> PostProcess(List<Tensor<T>> outputs, int imageWidth, int imageHeight, double confidenceThreshold, double nmsThreshold)
Parameters
outputsList<Tensor<T>>Raw network outputs.
imageWidthintOriginal image width.
imageHeightintOriginal image height.
confidenceThresholddoubleMinimum confidence threshold.
nmsThresholddoubleNMS IoU threshold.
Returns
SaveWeights(string)
Saves model weights to a file.
public override void SaveWeights(string path)
Parameters
pathstringFile path to save weights.