Namespace AiDotNet.ReinforcementLearning.Agents.Bandits
Classes
- EpsilonGreedyBanditAgent<T>
Epsilon-Greedy Multi-Armed Bandit agent.
- GradientBanditAgent<T>
Gradient Bandit agent using softmax action preferences.
- ThompsonSamplingAgent<T>
Thompson Sampling (Bayesian) Multi-Armed Bandit agent.
- UCBBanditAgent<T>
Upper Confidence Bound (UCB) Multi-Armed Bandit agent.