This book was published earlier this year, so I was given a few free copies and had them at the IDUG conference in Phoenix. I didn’t think people at the conference would be interested in this book, so I had put them aside. But, when people started noticing the book, they wanted to know how they could get a copy. On the spur of the moment, I decided that if people tweet the message “I want a copy of the Hadoop book from Susan”, I’d give it to them. To my huge surprise, I gave out more than 20 copies of this book in this way. Of course a data centric audience is interested in a book that teaches them how to modernize their data environment!
by Dirk deRoos and Paul Zikopoulos
At IBM Insights:
The book signing is scheduled for Monday October 27 from 4:30pm - 5:00pm at the Conference Bookstore. I’m pretty sure Dirk will be there… and I made sure it was on Paul’s calendar, but I’m sure Paul will be in huge demand the entire week, so it is possible that Paul will not be able to make it at this time.
About the book:
Big data has become big business, and companies and organizations of all sizes are struggling to find ways to retrieve valuable information from their massive data sets with becoming overwhelmed. Enter Hadoop and this easy-to-understand For Dummies guide. Hadoop For Dummies helps readers understand the value of big data, make a business case for using Hadoop, navigate the Hadoop ecosystem, and build and manage Hadoop applications and clusters.
- Explains the origins of Hadoop, its economic benefits, and its functionality and practical applications
- Helps you find your way around the Hadoop ecosystem, program MapReduce, utilize design patterns, and get your Hadoop cluster up and running quickly and easily
- Details how to use Hadoop applications for data mining, web analytics and personalization, large-scale text processing, data science, and problem-solving
- Shows you how to improve the value of your Hadoop cluster, maximize your investment in Hadoop, and avoid common pitfalls when building your Hadoop cluster
From programmers challenged with building and maintaining affordable, scaleable data systems to administrators who must deal with huge volumes of information effectively and efficiently, this how-to has something to help you with Hadoop.
- Understand the value of big data and how Hadoop can help manage it
- Navigate the Hadoop 2 ecosystem and create clusters
- Use applications for data mining, problem-solving, analytics, and more
The easy-to-use, practical guide to using Hadoop for big data
With most of the world’s data created in only the past two years, Hadoop has emerged as the definitive computing paradigm to handle big data. This comprehensive guide from IBM big data experts provides a hands-on resource for those who want to dig into the details of HDFS and MapReduce to take data storage and processing to the next level.
- Get started with Hadoop — discover the origins of Hadoop, the realities of worldwide data growth, and practical use cases for this revolutionary platform
- Under the Hadoop hood — dig into Hadoop’s distributed framework, including HDFS and MapReduce and the best tools for working with data in Hadoop
- Hadoop and structured data — modernize data warehouses with Hadoop and discover data utilities like HBase, Hive, and Sqoop
- Hands on with Hadoop — get your hands dirty with details on configuring Hadoop clusters and an overview of day-to-day Hadoop administration
- Take your Hadoop knowledge to the next level — use additional Hadoop resources to understand the technology at a deeper level
Open the book and find:
- Coverage of the Hadoop 2 ecosystem and Yarn
- Real-world use cases to help you get started
- Details on Hadoop distributions and cluster setup
- How to use Oozie for scheduling workflows
- How to add structure with Hive and HBase
- Details on running native SQL queries on Hive
- On-premise and cloud deployment options for Hadoop
- The challenges faced by administrators
About the Authors:
Dirk deRoos is the technical sales lead for IBM’s InfoSphere BigInsights. Paul C. Zikopoulos is the vice president of big data in the IBM Information Management division. Roman B. Melnyk, PhD is a senior member of the DB2 Information Development team. Bruce Brown and Rafael Coss work with big data with IBM.