Pentaho for Big Data Analytics

3,71 durchschnittliche Bewertung
( 7 Bewertungen bei Goodreads )
9781783282159: Pentaho for Big Data Analytics

With your knowledge of Java and this guide, you can take the analysis of your big data to new levels using Pentaho. Covers all the essentials tools, techniques, tips, and tricks in one handy volume.


  • A guide to using Pentaho Business Analytics for big data analysis
  • Learn Pentaho’s visualization and reporting tools with practical examples and tips
  • Precise insights into churning big data into meaningful knowledge with Pentaho

In Detail

Pentaho accelerates the realization of value from big data with the most complete solution for big data analytics and data integration. The real power of big data analytics is the abstraction between data and analytics. Data can be distributed across the cluster in various formats, and the analytics platform should have the capability to talk to different heterogeneous data stores and fetch the filtered data to enrich its value.

Pentaho Big Data Analytics is a practical, hands-on guide that provides you with clear, step-by-step exercises for using Pentaho to take advantage of big data systems, where data beats algorithm, and gives you a good grounding in using Pentaho Business Analytics’ capabilities.

This book looks at the key ingredients of the Pentaho Business Analytics platform. We will see how to prepare the Pentaho BI environment, and get to grips with the big data ecosystem through Hadoop and Pentaho MapReduce. The book provides a clear guide to the essential tools of Pentaho Business Analytics, providing familiarity with both the various design tools for setting up reports, and the visualization tools necessary for complete data analysis.

What you will learn from this book

  • Get to grips with the Pentaho suite
  • Explore the basics of Big Data and its business context
  • Set up a Pentaho business analytics server
  • Consume Big Data on HDFS platform using Pentaho Data Integration
  • Create visualization with Pentaho's tools
  • Distinguish signal from noise with Pentaho's Data Analytics capabilities
  • Design and set up your own Pentaho dashboard
  • Move from data to analytics in just a few steps with Community Dashboard Framework (CDF)


The book is a practical guide, full of step-by-step examples that are easy to follow and implement.

Who this book is written for

This book is for developers, system administrators, and business intelligence professionals looking to learn how to get more out of their data through Pentaho. In order to best engage with the examples, some knowledge of Java will be required.

Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.

About the Author:

Manoj R Patil

Manoj R Patil is the Chief Architect in Big Data at Compassites Software Solutions Pvt. Ltd. where he overlooks the overall platform architecture related to Big Data solutions, and he also has a hands-on contribution to some assignments. He has been working in the IT industry for the last 15 years. He started as a programmer and, on the way, acquired skills in architecting and designing solutions, managing projects keeping each stakeholder's interest in mind, and deploying and maintaining the solution on a cloud infrastructure. He has been working on the Pentaho-related stack for the last 5 years, providing solutions while working with employers and as a freelancer as well.

Manoj has extensive experience in JavaEE, MySQL, various frameworks, and Business Intelligence, and is keen to pursue his interest in predictive analysis.

He was also associated with TalentBeat, Inc. and Persistent Systems, and implemented interesting solutions in logistics, data masking, and data-intensive life sciences.

Feris Thia

Feris Thia is a founder of PHI-Integration, a Jakarta-based IT consulting company that focuses on data management, data warehousing and Business Intelligence solutions. As a technical consultant, he has spent the last seven years delivering solutions with Pentaho and the Microsoft Business Intelligence platform across various industries, including retail, trading, finance/banking, and telecommunication.

He is also a member and maintainer of two very active local Indonesian discussion groups related to Pentaho ( and Microsoft Excel (the Facebook group).

His current activities include research and building software based on Big Data and the data mining platform, that is, Apache Hadoop, R, and Mahout.

He would like to work on a book with a topic on analyzing customer behavior using the Apache Mahout platform.

„Über diesen Titel“ kann sich auf eine andere Ausgabe dieses Titels beziehen.

(Keine Angebote verfügbar)

Buch Finden:

Kaufgesuch aufgeben

Sie kennen Autor und Titel des Buches und finden es trotzdem nicht auf ZVAB? Dann geben Sie einen Suchauftrag auf und wir informieren Sie automatisch, sobald das Buch verfügbar ist!

Kaufgesuch aufgeben