Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems (Foundations and Trends(r) in Machine Learning)

Bubeck, S�bastien; Nicol�, Cesa-Bianchi

ISBN 10: 1601986262 ISBN 13: 9781601986269

Verlag: Now Publishers, 2012

Neu Softcover

Verk�ufer Ria Christie Collections, Uxbridge, Vereinigtes K�nigreich Verk�uferbewertung 5 von 5 Sternen

AbeBooks-Verk�ufer seit 25. M�rz 2015

Dieses Buch ist nicht mehr verf�gbar. AbeBooks f�hrt Millionen von B�chern. Bitte geben Sie unten Suchbegriffe ein, um �hnliche Exemplare zu finden.

Alle Artikel dieses Verk�ufers anzeigen Ein Kaufgesuch f�r �hnliche Artikel erstellen

Alle Exemplare dieses Buches anzeigen

Beschreibung

In. Bestandsnummer des Verk�ufers ria9781601986269_new

Diesen Artikel melden

Inhaltsangabe:

A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem defined by a set of actions. At each time step, a unit resource is allocated to an action and some observable payoff is obtained. The goal is to maximize the total payoff obtained in a sequence of allocations. The name bandit refers to the colloquial term for a slot machine (a "one-armed bandit" in American slang). In a casino, a sequential allocation problem is obtained when the player is facing many slot machines at once (a "multi-armed bandit"), and must repeatedly choose where to insert the next coin. Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that might give higher payoffs in the future. Although the study of bandit problems dates back to the 1930s, exploration-exploitation trade-offs arise in several modern applications, such as ad placement, website optimization, and packet routing. Mathematically, a multi-armed bandit is defined by the payoff process associated with each option. In this book, the focus is on two extreme cases in which the analysis of regret is particularly simple and elegant: independent and identically distributed payoffs and adversarial payoffs. Besides the basic setting of finitely many actions, it also analyzes some of the most important variants and extensions, such as the contextual bandit model. This monograph is an ideal reference for students and researchers with an interest in bandit problems.

Rese�a del editor: A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem defined by a set of actions. At each time step, a unit resource is allocated to an action and some observable payoff is obtained. The goal is to maximize the total payoff obtained in a sequence of allocations. The name bandit refers to the colloquial term for a slot machine (a "one-armed bandit" in American slang). In a casino, a sequential allocation problem is obtained when the player is facing many slot machines at once (a "multi-armed bandit"), and must repeatedly choose where to insert the next coin. Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that might give higher payoffs in the future. Although the study of bandit problems dates back to the 1930s, exploration-exploitation trade-offs arise in several modern applications, such as ad placement, website optimization, and packet routing. Mathematically, a multi-armed bandit is defined by the payoff process associated with each option. In this book, the focus is on two extreme cases in which the analysis of regret is particularly simple and elegant: independent and identically distributed payoffs and adversarial payoffs. Besides the basic setting of finitely many actions, it also analyzes some of the most important variants and extensions, such as the contextual bandit model. This monograph is an ideal reference for students and researchers with an interest in bandit problems.

��ber diesen Titel� kann sich auf eine andere Ausgabe dieses Titels beziehen.

Bibliografische Details

Titel: Regret Analysis of Stochastic and ...
Verlag: Now Publishers
Erscheinungsdatum: 2012
Einband: Softcover
Zustand: New

ZVAB ist ein Internet-Marktplatz f�r neue, gebrauchte, antiquarische und vergriffene B�cher. Bei uns finden Sie Tausende professioneller Buchh�ndler weltweit und Millionen B�cher. Einkaufen beim ZVAB ist einfach und zu 100% sicher — Suchen Sie nach Ihrem Buch, erwerben Sie es �ber unsere sichere Kaufabwicklung und erhalten Sie Ihr Buch direkt vom H�ndler.

Millionen neuer und gebrauchter B�cher bei tausenden Anbietern

Antiquarische B�cher

Von seltenen Erstausgaben bis hin zu begehrten signierten Ausgaben – beim ZVAB finden Sie eine gro�e Anzahl seltener, wertvoller B�cher und Sammlerst�cke.

ZVAB Startseite

Erstausgaben

Erstausgaben sind besondere B�cher, die den ersten Abdruck des Textes in seiner urspr�nglichen Form darstellen. Hier finden sie Erstausgaben von damals bis heute.

Erstausgaben

Gebrauchte B�cher

Ob Bestseller oder Klassiker, das ZVAB bietet Ihnen eine breite Auswahl an gebrauchten B�chern: St�bern Sie in unseren Rubriken und entdecken Sie ein Buch-Schn�ppchen.

Gebrauchte B�cher

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems (Foundations and Trends(r) in Machine Learning)

Bubeck, S�bastien; Nicol�, Cesa-Bianchi

Beschreibung

Beschreibung:

Inhaltsangabe:

Bibliografische Details

Millionen neuer und gebrauchter B�cher bei tausenden Anbietern

Antiquarische B�cher

Erstausgaben

Gebrauchte B�cher

Mehr B�cher entdecken