Namespace AiDotNet.ReinforcementLearning.Agents.AdvancedRL

Classes

LSPIAgent<T>: LSPI (Least-Squares Policy Iteration) agent using iterative policy improvement with LSTDQ.

LSTDAgent<T>: LSTD (Least-Squares Temporal Difference) agent using direct solution for value function weights.

LinearQLearningAgent<T>: Linear Q-Learning agent using linear function approximation.

LinearSARSAAgent<T>: Linear SARSA agent using linear function approximation with on-policy learning.

TabularActorCriticAgent<T>: Tabular Actor-Critic agent combining policy and value learning.