From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning (Foundations and Trends(r) in Machine Learning)

Munos, R�mi

ISBN 10: 1601987668 ISBN 13: 9781601987662

Verlag: Now Publishers, 2014

Neu Softcover

Verk�ufer Ria Christie Collections, Uxbridge, Vereinigtes K�nigreich Verk�uferbewertung 5 von 5 Sternen

AbeBooks-Verk�ufer seit 25. M�rz 2015

Dieses Buch ist nicht mehr verf�gbar. AbeBooks f�hrt Millionen von B�chern. Bitte geben Sie unten Suchbegriffe ein, um �hnliche Exemplare zu finden.

Alle Artikel dieses Verk�ufers anzeigen Ein Kaufgesuch f�r �hnliche Artikel erstellen

Alle Exemplare dieses Buches anzeigen

Beschreibung

In. Bestandsnummer des Verk�ufers ria9781601987662_new

Diesen Artikel melden

Inhaltsangabe:

From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning covers several aspects of the "optimism in the face of uncertainty" principle for large scale optimization problems under finite numerical budget.The monograph’s initial motivation came from the empirical success of the so-called "Monte-Carlo Tree Search" method popularized in Computer Go and further extended to many other games as well as optimization and planning problems. It lays out the theoretical foundations of the field by characterizing the complexity of the optimization problems and designing efficient algorithms with performance guarantees.The main direction followed in this monograph consists in decomposing a complex decision making problem (such as an optimization problem in a large search space) into a sequence of elementary decisions, where each decision of the sequence is solved using a stochastic "multi-armed bandit" (mathematical model for decision making in stochastic environments). This defines a hierarchical search which possesses the nice feature of starting the exploration by a quasi-uniform sampling of the space and then focusing, at different scales, on the most promising areas (using the optimistic principle) until eventually performing a local search around the global optima of the function.This monograph considers the problem of function optimization in general search spaces (such as metric spaces, structured spaces, trees, and graphs) as well as the problem of planning in Markov decision processes. Its main contribution is a class of hierarchical optimistic algorithms with different algorithmic instantiations depending on whether the evaluations are noisy or noiseless and whether some measure of the local "smoothness" of the function around the global maximum is known or unknown.

Rese�a del editor: From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning covers several aspects of the "optimism in the face of uncertainty" principle for large scale optimization problems under finite numerical budget. The monograph's initial motivation came from the empirical success of the so-called "Monte-Carlo Tree Search" method popularized in Computer Go and further extended to many other games as well as optimization and planning problems. It lays out the theoretical foundations of the field by characterizing the complexity of the optimization problems and designing efficient algorithms with performance guarantees. The main direction followed in this monograph consists in decomposing a complex decision making problem (such as an optimization problem in a large search space) into a sequence of elementary decisions, where each decision of the sequence is solved using a stochastic "multi-armed bandit" (mathematical model for decision making in stochastic environments). This defines a hierarchical search which possesses the nice feature of starting the exploration by a quasi-uniform sampling of the space and then focusing, at different scales, on the most promising areas (using the optimistic principle) until eventually performing a local search around the global optima of the function. This monograph considers the problem of function optimization in general search spaces (such as metric spaces, structured spaces, trees, and graphs) as well as the problem of planning in Markov decision processes. Its main contribution is a class of hierarchical optimistic algorithms with different algorithmic instantiations depending on whether the evaluations are noisy or noiseless and whether some measure of the local ''smoothness'' of the function around the global maximum is known or unknown.

��ber diesen Titel� kann sich auf eine andere Ausgabe dieses Titels beziehen.

Bibliografische Details

Titel: From Bandits to Monte-Carlo Tree Search: The...
Verlag: Now Publishers
Erscheinungsdatum: 2014
Einband: Softcover
Zustand: New

ZVAB ist ein Internet-Marktplatz f�r neue, gebrauchte, antiquarische und vergriffene B�cher. Bei uns finden Sie Tausende professioneller Buchh�ndler weltweit und Millionen B�cher. Einkaufen beim ZVAB ist einfach und zu 100% sicher — Suchen Sie nach Ihrem Buch, erwerben Sie es �ber unsere sichere Kaufabwicklung und erhalten Sie Ihr Buch direkt vom H�ndler.

Millionen neuer und gebrauchter B�cher bei tausenden Anbietern

Antiquarische B�cher

Von seltenen Erstausgaben bis hin zu begehrten signierten Ausgaben – beim ZVAB finden Sie eine gro�e Anzahl seltener, wertvoller B�cher und Sammlerst�cke.

ZVAB Startseite

Erstausgaben

Erstausgaben sind besondere B�cher, die den ersten Abdruck des Textes in seiner urspr�nglichen Form darstellen. Hier finden sie Erstausgaben von damals bis heute.

Erstausgaben

Gebrauchte B�cher

Ob Bestseller oder Klassiker, das ZVAB bietet Ihnen eine breite Auswahl an gebrauchten B�chern: St�bern Sie in unseren Rubriken und entdecken Sie ein Buch-Schn�ppchen.

Gebrauchte B�cher

From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning (Foundations and Trends(r) in Machine Learning)

Munos, R�mi

Beschreibung

Beschreibung:

Inhaltsangabe:

Bibliografische Details

Millionen neuer und gebrauchter B�cher bei tausenden Anbietern

Antiquarische B�cher

Erstausgaben

Gebrauchte B�cher

Mehr B�cher entdecken