Processing ......
FreeComputerBooks.com
Links to Free Computer, Mathematics, Technical Books all over the World
 
Big Data Analytics with Hadoop 3
🌠 Top Free Java Books - 100% Free or Open Source!
  • Title Big Data Analytics with Hadoop 3
  • Author(s) Sridhar Alla
  • Publisher: Packt Publishing (May 31, 2018); eBook (Free Edition)
  • Permission: Free eBook by the Publisher (Packt)
  • Paperback: 482 pages
  • eBook: PDF
  • Language: English
  • ISBN-10/ASIN: 1788628845
  • ISBN-13: 978-1788628846
  • Share This:  

Book Description

This book introduces you to advanced MapReduce concepts and teaches you everything from identifying the factors that affect MapReduce job performance to tuning the MapReduce configuration. Based on real-world experience, this book will help you to fully utilize your cluster's node resources to run MapReduce jobs optimally.

Starting with how MapReduce works and the factors that affect MapReduce performance, you will be given an overview of Hadoop metrics and several performance monitoring tools. Further on, you will explore performance counters that help you identify resource bottlenecks, check cluster health, and size your Hadoop cluster. You will also learn about optimizing map and reduce tasks by using Combiners and compression.

If you are a Hadoop administrator, developer, MapReduce user, or beginner, this book is the best choice available if you wish to optimize your clusters and applications. Having prior knowledge of creating MapReduce applications is not necessary, but will help you better understand the concepts and snippets of MapReduce class template code.

  • Learn about the factors that affect MapReduce performance
  • Utilize the Hadoop MapReduce performance counters to identify resource bottlenecks
  • Size your Hadoop cluster's nodes
  • Set the number of mappers and reducers correctly
  • Optimize mapper and reducer task throughput and code size using compression and Combiners
  • Understand the various tuning properties and best practices to optimize clusters
About the Authors
  • Khaled Tannir has a Bachelor's degree in Electronics, a Master's degree in System Information Architectures, in which he graduated with a professional thesis, and completed his education with a Master of Research degree.
Reviews, Ratings, and Recommendations: Related Book Categories: Read and Download Links: Similar Books:
  • Big Data Processing with Apache Spark (Srini Penchikala)

    Learn about the Apache Spark framework and develop Spark programs for use cases in big-data analysis. It covers all the libraries that are part of Spark ecosystem, which includes Spark Core, Spark SQL, Spark Streaming, Spark MLlib, and Spark GraphX.

  • Hadoop with Python (Zachary Radtka, et al)

    This book takes you through the basic concepts behind Hadoop, MapReduce, Pig, and Spark. Then, through multiple examples and use cases, you'll learn how to work with these technologies by applying various Python tools.

  • Hadoop for Windows Succinctly (Dave Vickers)

    This book provides a thorough guide to using Hadoop directly on Windows operating systems. From a conceptual overview to practical examples, Hadoop for Windows Succinctly is a valuable resource for developers.

  • Hadoop Succinctly (Elton Stoneman)

    This booky explains how Hadoop works, what goes on in the cluster, demonstrates how to move data in and out of Hadoop, and how to query it efficiently. It also walks through a Java MapReduce example, illustrates it in Python and .NET, too.

  • Hadoop Illuminated (Mark Kerzner, et al)

    This book aims to make Hadoop knowledge accessible to a wider audience, not just to the highly technical. It book introduces you to Hadoop and to concepts such as 'MapReduce', etc., which will help you get acquainted with the technology.

  • Understanding Big Data: Analytics for Hadoop and Streaming Data

    In this book, the three defining characteristics of Big Data - volume, variety, and velocity, are discussed. Industry use cases are also included in this practical guide, to deliver a robust, secure, highly available, enterprise-class Big Data platform.

  • Data-Intensive Text Processing with MapReduce (Jimmy Lin)

    This free book focuses on MapReduce algorithm design, with an emphasis on text processing algorithms common in natural language processing, information retrieval, and machine learning.

  • Engineering Agile Big-Data Systems (Kevin Feeney, et al)

    This book outlines an approach to dealing with problems in software and data engineering, describing a methodology for aligning these processes throughout product lifecycles. It discusses tools which can be used to achieve these goals.

Book Categories
:
Other Categories
Resources and Links