Namespace AiDotNet.ReinforcementLearning.Agents.Bandits

Classes

EpsilonGreedyBanditAgent<T>: Epsilon-Greedy Multi-Armed Bandit agent.

GradientBanditAgent<T>: Gradient Bandit agent using softmax action preferences.

ThompsonSamplingAgent<T>: Thompson Sampling (Bayesian) Multi-Armed Bandit agent.

UCBBanditAgent<T>: Upper Confidence Bound (UCB) Multi-Armed Bandit agent.