Analyze and optimize cloud cluster performance

Use configurable parameters to monitor and tune the performance of a cloud Hadoop cluster

From the developerWorks archives

Yu Li

Date archived: November 29, 2016 | First published: March 07, 2011

Hadoop is a popular software framework that enables distributed manipulation of large amounts of data, thus making it a perfect companion to cloud computing. In fact, Hadoop MapReduce, the programming model and software framework used to write applications to rapidly process vast amounts of data in parallel on large clusters of compute nodes, is already in play on cloud systems. This article shows you how to take full advantage of Hadoop by introducing Hadoop configurable parameters and using them to monitor, analyze, and tune the performance of your Hadoop cluster.

This content is no longer being updated or maintained. The full article is provided "as is" in a PDF file. Given the rapid evolution of technology, some steps and illustrations may have changed.



static.content.url=http://www.ibm.com/developerworks/js/artrating/
SITE_ID=1
Zone=Cloud computing, Big data and analytics
ArticleID=630591
ArticleTitle=Analyze and optimize cloud cluster performance
publish-date=03072011