Class DocumentReader<T>

Namespace: AiDotNet.ComputerVision.OCR.EndToEnd

Assembly: AiDotNet.dll

Document reader for OCR with layout analysis.

public class DocumentReader<T>

Type Parameters

T: The numeric type used for calculations.

Inheritance: object

DocumentReader<T>

Inherited Members: object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Remarks

For Beginners: DocumentReader is optimized for reading structured documents like scanned papers, forms, and PDFs. Unlike scene text, documents have regular layouts with clear reading order. This reader analyzes the document structure and extracts text in logical reading order.

Key features: - Layout analysis for document structure understanding - Reading order detection - Paragraph and line grouping - Handles multi-column layouts - Optimized for clean text on uniform backgrounds

Constructors

DocumentReader(OCROptions<T>)

Creates a new document reader.

public DocumentReader(OCROptions<T> options)

Parameters

options OCROptions<T>

Properties

Name

Name of this document reader.

public string Name { get; }

Property Value

string

Methods

GetParameterCount()

Gets the total parameter count.

public long GetParameterCount()

Returns

long

ReadDocument(Tensor<T>)

Reads a document image and returns structured text.

public DocumentOCRResult<T> ReadDocument(Tensor<T> image)

Parameters

image Tensor<T>: Document image tensor [batch, channels, height, width].

Returns

DocumentOCRResult<T>: Document OCR result with layout information.

Table of Contents

Class DocumentReader<T>

Type Parameters

Remarks

Constructors

DocumentReader(OCROptions<T>)

Parameters

Properties

Name

Property Value

Methods

GetParameterCount()

Returns

ReadDocument(Tensor<T>)

Parameters

Returns