Search for dissertations about: "Regret Minimization"

Found 5 swedish dissertations containing the words Regret Minimization.

  1. 1. Regret Minimization in Structured Reinforcement Learning

    Author : Damianos Tranos; Alexandre Proutiere; Yevgeny Seldin; KTH; []
    Keywords : TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; Reinforcement Learning; Electrical Engineering; Elektro- och systemteknik;

    Abstract : We consider a class of sequential decision making problems in the presence of uncertainty, which belongs to the field of Reinforcement Learning (RL). Specifically, we study discrete Markov decision Processes (MDPs) which model a decision maker or agent that interacts with a stochastic and dynamic environment and receives feedback from it in the form of a reward. READ MORE

  2. 2. Minimizing Regret in Combinatorial Bandits and Reinforcement Learning

    Author : Mohammad Sadegh Talebi Mazraeh Shahi; Alexandre Proutiere; Mikael Johansson; Ronald Ortner; KTH; []
    Keywords : Multi-armed Bandits; Reinforcement Learning; Regret Minimization; Statistics; Electrical Engineering; Elektro- och systemteknik;

    Abstract : This thesis investigates sequential decision making tasks that fall in the framework of reinforcement learning (RL). These tasks involve a decision maker repeatedly interacting with an environment modeled by an unknown finite Markov decision process (MDP), who wishes to maximize a notion of reward accumulated during her experience. READ MORE

  3. 3. Inference and Online Learning in Structured Stochastic Systems

    Author : Kaito Ariu; Alexandre Proutiere; Mikael Johansson; Wouter Koolen; KTH; []
    Keywords : TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; Electrical Engineering; Elektro- och systemteknik;

    Abstract : This thesis contributes to the field of stochastic online learning problems, with a collection of six papers each addressing unique aspects of online learning and inference problems under specific structures. The first four papers focus on exploration and inference problems, uncovering fundamental information-theoretic limits and efficient algorithms under various structures. READ MORE

  4. 4. Bandit Methods for Network Optimization : Safety, Exploration, and Coordination

    Author : Filippo Vannella; Alexandre Proutiere; Vincent Tan; KTH; []
    Keywords : TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; Electrical Engineering; Elektro- och systemteknik;

    Abstract : The increasing complexity of modern mobile networks poses unprecedented challenges to their optimization. Mobile Network Operators (MNOs) need to control a large number of network parameters to satisfy the users’ demands. READ MORE

  5. 5. Combinatorial Semi-Bandit Methods for Navigation of Electric Vehicles

    Author : Niklas Åkerblom; Chalmers tekniska högskola; []
    Keywords : NATURVETENSKAP; NATURAL SCIENCES; energy-efficient navigation; online learning; multi-armed bandit problem; Thompson sampling; combinatorial semi-bandit problem;

    Abstract : Climate change is one of the most urgent global challenges humanity is currently facing. As major contributors of greenhouse gas emissions, the transport and automotive sectors have crucial roles to play in solving the problem. READ MORE