Verwandte Artikel zu Algorithms and Data Structures for Massive Datasets

Algorithms and Data Structures for Massive Datasets - Softcover

 
9781617298035: Algorithms and Data Structures for Massive Datasets

Inhaltsangabe

Massive modern datasets make traditional data structures and algorithms grind to a halt. This fun and practical guide introduces cutting-edge techniques that can reliably handle even the largest distributed datasets.

In Algorithms and Data Structures for Massive Datasets you will learn:

Probabilistic sketching data structures for practical problems
Choosing the right database engine for your application
Evaluating and designing efficient on-disk data structures and algorithms
Understanding the algorithmic trade-offs involved in massive-scale systems
Deriving basic statistics from streaming data
Correctly sampling streaming data
Computing percentiles with limited space resources

Algorithms and Data Structures for Massive Datasets reveals a toolbox of new methods that are perfect for handling modern big data applications. You’ll explore the novel data structures and algorithms that underpin Google, Facebook, and other enterprise applications that work with truly massive amounts of data. These effective techniques can be applied to any discipline, from finance to text analysis. Graphics, illustrations, and hands-on industry examples make complex ideas practical to implement in your projects—and there’s no mathematical proofs to puzzle over. Work through this one-of-a-kind guide, and you’ll find the sweet spot of saving space without sacrificing your data’s accuracy.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the technology


Standard algorithms and data structures may become slow—or fail altogether—when applied to large distributed datasets. Choosing algorithms designed for big data saves time, increases accuracy, and reduces processing cost. This unique book distills cutting-edge research papers into practical techniques for sketching, streaming, and organizing massive datasets on-disk and in the cloud.

About the book


Algorithms and Data Structures for Massive Datasets introduces processing and analytics techniques for large distributed data. Packed with industry stories and entertaining illustrations, this friendly guide makes even complex concepts easy to understand. You’ll explore real-world examples as you learn to map powerful algorithms like Bloom filters, Count-min sketch, HyperLogLog, and LSM-trees to your own use cases.

What's inside



Probabilistic sketching data structures
Choosing the right database engine
Designing efficient on-disk data structures and algorithms
Algorithmic tradeoffs in massive-scale systems
Computing percentiles with limited space resources

About the reader


Examples in Python, R, and pseudocode.

About the author


Dzejla Medjedovic earned her PhD in the Applied Algorithms Lab at Stony Brook University, New York. Emin Tahirovic earned his PhD in biostatistics from University of Pennsylvania. Illustrator Ines Dedovic earned her PhD at the Institute for Imaging and Computer Vision at RWTH Aachen University, Germany.

 

Table of Contents

1 Introduction
PART 1 HASH-BASED SKETCHES
2 Review of hash tables and modern hashing
3 Approximate membership: Bloom and quotient filters
4 Frequency estimation and count-min sketch
5 Cardinality estimation and HyperLogLog
PART 2 REAL-TIME ANALYTICS
6 Streaming data: Bringing everything together
7 Sampling from data streams
8 Approximate quantiles on data streams
PART 3 DATA STRUCTURES FOR DATABASES AND EXTERNAL MEMORY ALGORITHMS
9 Introducing the external memory model
10 Data structures for databases: B-trees, Be-trees, and LSM-trees
11 External memory sorting

Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.

Über die Autorin bzw. den Autor

Dzejla Medjedovic earned her PhD in the Applied Algorithms Lab of the computer science department at Stony Brook University, NY in 2014. She has worked on a number of projects in algorithms for massive data, taught algorithms at various levels and also spent some time at Microsoft.

Emin Tahirovic earned his doctorate in biostatistics from UPenn in 2016, and his master's degree in theoretical computer science from Goethe University in Frankfurt in 2008. He has worked for DBahn AG as an IT consultant and he regularly consults on projects for pharma and tech companies.

Ines Dedovic earned her PhD at the Institute for Imaging and Computer Vision of the Department of Electrical Engineering at RWTH Aachen University, Germany. She has worked as a researcher at the Research Center Jülich and is currently employed as a software developer for camera systems at Jonas & Redmann, an automation company.

„Über diesen Titel“ kann sich auf eine andere Ausgabe dieses Titels beziehen.

Gratis für den Versand innerhalb von/der Deutschland

Versandziele, Kosten & Dauer

Suchergebnisse für Algorithms and Data Structures for Massive Datasets

Foto des Verkäufers

Medjedovic, Dzejla|Tahirovic, Emin
Verlag: Manning, 2022
ISBN 10: 1617298034 ISBN 13: 9781617298035
Neu Kartoniert / Broschiert

Anbieter: moluna, Greven, Deutschland

Verkäuferbewertung 5 von 5 Sternen 5 Sterne, Erfahren Sie mehr über Verkäufer-Bewertungen

Kartoniert / Broschiert. Zustand: New. Dedovic, Ines (illustrator). &Uumlber den AutorDzejla Medjedovic earned her PhD in the Applied Algorithms Lab of the computer science department at Stony Brook University, NY in 2014. She has worked on a number of projects in algorithms for massive data, tau. Artikel-Nr. 497594425

Verkäufer kontaktieren

Neu kaufen

EUR 59,60
Währung umrechnen
Versand: Gratis
Innerhalb Deutschlands
Versandziele, Kosten & Dauer

Anzahl: 5 verfügbar

In den Warenkorb

Foto des Verkäufers

Dzejla Medjedovic
ISBN 10: 1617298034 ISBN 13: 9781617298035
Neu Taschenbuch

Anbieter: AHA-BUCH GmbH, Einbeck, Deutschland

Verkäuferbewertung 5 von 5 Sternen 5 Sterne, Erfahren Sie mehr über Verkäufer-Bewertungen

Taschenbuch. Zustand: Neu. Dedovic, Ines (illustrator). Neuware - Data structures and algorithms that are great for traditional software may quickly slow or fail altogether when applied to huge datasets. Algorithmsand Data Structures for Massive Datasets introduces a toolbox of new techniques that are perfect for handling modern big data applications. You'll discover methods for reducing and sketching data so it fits in small memory without losing accuracy, and unlock the algorithms and data structures that form the backbone of a big data system. Filled with fun illustrations and examples from real-world businesses, you'll learn how each of these complex techniques can be practically applied to maximize the accuracy and through put of big data processing and analytics. Artikel-Nr. 9781617298035

Verkäufer kontaktieren

Neu kaufen

EUR 78,94
Währung umrechnen
Versand: Gratis
Innerhalb Deutschlands
Versandziele, Kosten & Dauer

Anzahl: 2 verfügbar

In den Warenkorb

Beispielbild für diese ISBN

Medjedovic, Dzejla/ Tahirovic, Emin/ Dedovic, Ines
Verlag: Manning Pubns Co, 2022
ISBN 10: 1617298034 ISBN 13: 9781617298035
Neu Paperback

Anbieter: Revaluation Books, Exeter, Vereinigtes Königreich

Verkäuferbewertung 5 von 5 Sternen 5 Sterne, Erfahren Sie mehr über Verkäufer-Bewertungen

Paperback. Zustand: Brand New. Dedovic, Ines (illustrator). 325 pages. 9.25x7.37x0.81 inches. In Stock. Artikel-Nr. x-1617298034

Verkäufer kontaktieren

Neu kaufen

EUR 75,42
Währung umrechnen
Versand: EUR 11,44
Von Vereinigtes Königreich nach Deutschland
Versandziele, Kosten & Dauer

Anzahl: 2 verfügbar

In den Warenkorb

Beispielbild für diese ISBN

Medjedovic, Dzejla
Verlag: Manning, 2022
ISBN 10: 1617298034 ISBN 13: 9781617298035
Neu Softcover

Anbieter: Kennys Bookstore, Olney, MD, USA

Verkäuferbewertung 5 von 5 Sternen 5 Sterne, Erfahren Sie mehr über Verkäufer-Bewertungen

Zustand: New. Dedovic, Ines (illustrator). 2022. Paperback. . . . . . Books ship from the US and Ireland. Artikel-Nr. V9781617298035

Verkäufer kontaktieren

Neu kaufen

EUR 85,68
Währung umrechnen
Versand: EUR 1,87
Von USA nach Deutschland
Versandziele, Kosten & Dauer

Anzahl: 6 verfügbar

In den Warenkorb