Stochastic optimization
Markov models
Reinforcement learning
Variance
Algorithm
Statistics
Machine learning
Multi-armed bandit