interactive learning: MAB, linear bandits, contextual bandits, linear dynamical systems, MDP (open problem) Linear bandits: pure exploration algorithms -> LinGapE (computationally efficient), RAGE
Zhaoqi Li
Ph.D. Candidate in statistics at the University of Washington
- University of Washington
- Google Scholar