Namespace AiDotNet.ReinforcementLearning.Agents.AdvancedRL
Classes
- LSPIAgent<T>
LSPI (Least-Squares Policy Iteration) agent using iterative policy improvement with LSTDQ.
- LSTDAgent<T>
LSTD (Least-Squares Temporal Difference) agent using direct solution for value function weights.
- LinearQLearningAgent<T>
Linear Q-Learning agent using linear function approximation.
- LinearSARSAAgent<T>
Linear SARSA agent using linear function approximation with on-policy learning.
- TabularActorCriticAgent<T>
Tabular Actor-Critic agent combining policy and value learning.