Search for dissertations about: "Odalric-ambrym Maillard"

Found 1 swedish dissertation containing the words Odalric-ambrym Maillard.

  1. 1. Efficient Online Learning under Bandit Feedback

    Author : Stefan Magureanu; Alexandre Proutiere; Odalric-Ambrym Maillard; KTH; []
    Keywords : TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; multi-armed bandits; reinforcement learning; learning to rank; Electrical Engineering; Elektro- och systemteknik;

    Abstract : In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlated arms. Particularly, we investigate the case when the expected rewards are a Lipschitz function of the arm and extend these results to bandits with arbitrary structure that is known to the decision maker. READ MORE