Class REINFORCEOptions<T>

Namespace: AiDotNet.Models.Options

Assembly: AiDotNet.dll

Configuration options for REINFORCE agents.

public class REINFORCEOptions<T>

Type Parameters

T: The numeric type used for calculations.

Inheritance: object

REINFORCEOptions<T>

Inherited Members: object.Equals(object)

object.Equals(object, object)

object.GetHashCode()

object.GetType()

object.MemberwiseClone()

object.ReferenceEquals(object, object)

object.ToString()

Remarks

REINFORCE is the simplest policy gradient algorithm. It directly optimizes the policy by following the gradient of expected returns.

For Beginners: REINFORCE is the "hello world" of policy gradient methods. It's simple but powerful: - Play an entire episode - See which actions led to good rewards - Make those actions more likely in the future

Think of it like learning to play a game: you play a round, see your score, then adjust your strategy to do better next time.

Simple, but can be slow to learn and high variance. Modern algorithms like PPO improve on REINFORCE's ideas.

Constructors

REINFORCEOptions()

public REINFORCEOptions()

Properties

ActionSize

public int ActionSize { get; set; }

Property Value

int

DiscountFactor

public T DiscountFactor { get; set; }

Property Value

T

HiddenLayers

public List<int> HiddenLayers { get; set; }

Property Value

List<int>

IsContinuous

public bool IsContinuous { get; set; }

Property Value

bool

LearningRate

public T LearningRate { get; set; }

Property Value

T

Seed

public int? Seed { get; set; }

Property Value

int?

StateSize

public int StateSize { get; set; }

Property Value

int

Table of Contents

Class REINFORCEOptions<T>

Type Parameters

Remarks

Constructors

REINFORCEOptions()

Properties

ActionSize

Property Value

DiscountFactor

Property Value

HiddenLayers

Property Value

IsContinuous

Property Value

LearningRate

Property Value

Seed

Property Value

StateSize

Property Value