Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data
by Dirk deRoos, Chris Eaton, George Lapis, Paul Zikopoulos, Tom Deutsch
Big Data represents a new era in data exploration and utilization, and IBM is uniquely positioned to help clients navigate this transformation. This Flashbook reveals how IBM is leveraging open source Big Data technology to deliver a robust, secure, highly available, enterprise-class Big Data platform.
The three defining characteristics of Big Data—volume, variety, and velocity—are discussed. You’ll get a primer on Hadoop and how IBM is 'hardening' it for the enterprise, and learn when to leverage IBM InfoSphere BigInsights (Big Data at rest) and IBM InfoSphere Streams (Big Data in motion) technologies. Deployment and scaling strategies plus industry use cases are also included in this practical guide.
- Learn how IBM hardens Hadoop for enterprise-class scalability and reliability
- Gain insight into IBM's unique in-motion and at-rest Big Data analytics
- Learn tips and tricks for Big Data use cases and solutions
- Get a quick Hadoop primer
This book is about Big Data: but you already knew that. Big Data is a Big Deal! This book’s authoring team is well seasoned in traditional database technologies; and all recognized one thing: Big Data is an inflection point when it comes to information management technologies. In fact, Big Data is going to change the way you do things in the future, how you gain insight, and make decisions (the change isn’t going to be a replacement, rather a synergy and extension). Recognizing this inflection point, the author team decided to write this book to help you get quickly up to speed on this technology and to show you the unique things IBM is doing to turn the freely available open source Big Data technology into a Big Data Platform; there’s a major difference and the platform is comprised of leveraging the open source technologies (and never forking it) and marrying that to enterprise capabilities provided by a technology leader that understands the benefits a platform can provide.
By the time you are done reading this book, you’ll have a good handle on the Big Data opportunity that lies ahead, a better understanding on the requirements that ensures you have the right Big Data platform (as opposed to just technology), and have a strong foundational knowledge as to the business opportunities that lie ahead with Big Data and some of the technologies available.
PART 1: The Big Deal about Big Data
Chapter 1 – What is Big Data? Hint: You’re a Part of it Every Day
Chapter 2 – Why Big Data is Important
Chapter 3 – Why IBM for Big Data
PART II: Big Data: From the Technology Perspective
Chapter 4 - All About Hadoop: The Big Data Lingo Chapter
Chapter 5 – IBM InfoSphere Big Insights – Analytics for “At Rest” Big Data
Chapter 6 – IBM InfoSphere Streams – Analytics for “In Motion” Big Data
Chris Eaton, B.Sc., is a worldwide technical specialist for IBM’s Information Management products focused on Database Technology, Big Data, and Workload Optimization. Chris is also an international award winning speaker, having presented at data management conferences across the globe, and has one of the most popular DB2 blogs located on IT Toolbox at: http://it.toolbox.com/blogs/db2luw.
Dirk DeRoos, B.Sc, B.A. is a member of the IBM World-Wide Technical Sales Team, specializing in the IBM Big Data Platform. Dirk joined IBM eleven years ago, and has a Bachelor of Computer Science and a Bachelor of Arts (Honors English) from the University of New Brunswick.
Thomas Deutsch, B.A, M.B.A., serves as a Program Director in IBM’s Big Data business. Tom has spent the couple of years helping customers with Apache Hadoop, identifying architecture fit, and managing early stage projects in 200+ customer engagements.
George Lapis, MS CS, is a Big Data Solutions Architect at IBM's Silicon Valley Lab. He has worked in database software area for more than 30 years. He was a founding member of R* and Starburst research projects at IBM's Almaden Research Center in the valley, as well as a member of the compiler development team for several releases of DB2.
Paul C. Zikopoulos, B.A., M.B.A., is the Director of Technical Professionals for IBM Software Group’s Information Management division and additionally leads the World Wide Database Competitive and Big Data SWAT teams. Paul has written more than 300 magazine articles and 14 books on DB2 and can be reached at: firstname.lastname@example.org.