Table of Contents

Namespace AiDotNet.ReinforcementLearning.Agents.AdvancedRL

Classes

LSPIAgent<T>

LSPI (Least-Squares Policy Iteration) agent using iterative policy improvement with LSTDQ.

LSTDAgent<T>

LSTD (Least-Squares Temporal Difference) agent using direct solution for value function weights.

LinearQLearningAgent<T>

Linear Q-Learning agent using linear function approximation.

LinearSARSAAgent<T>

Linear SARSA agent using linear function approximation with on-policy learning.

TabularActorCriticAgent<T>

Tabular Actor-Critic agent combining policy and value learning.