What is Big Data?
Everyday, we create 2.5 quintillion bytes of data–so much that 90% of the data in the world today has been created in the last two years alone. This data comes from everywhere: from sensors used to gather climate information, posts to social media sites, digital pictures and videos posted online, transaction records of online purchases, and from cell phone GPS signals to name a few. This data is big data.
The Big Data platform
IBM InfoSphere BigInsights builds on open source Apache Hadoop with IBM unique innovations including a sophisticated text analytics module. BigInsights is able to ingest and analyze data in its native format, without imposing a schema/structure, enabling fast ad-hoc analysis.
Tivoli Workload Scheduler and Big Data
Use the IBM Tivoli Workload Scheduler Integrator for InfoSphere BigInsights to run IBM InfoSphere BigInsights jobs, using Tivoli Workload Scheduler. In this way, you take advantage of all the Tivoli Workload Scheduler scheduling functions to manage these jobs. Installing the integrator on your Tivoli Workload Scheduler environment, you can define, schedule, and monitor InfoSphere BigInsights jobs through Tivoli Workload Scheduler.
A business scenario
A large wind energy company performs business in wind power generation. To succeed, the company uses one of the largest supercomputer worldwide networks together along with a new big data modeling solution, based on InfoSphere BigInsights. To reduce data processing times in establishing the optimal turbine placement, the company runs InfoSphere BigInsights jobs every day to collect and analyze data. The process of establishing a location starts with its wind library, which incorporates data from global weather systems with data collected from existing turbines. Combined, this information helps the company not only to select the best site for turbine placement, but also to forecast wind and power production for its customers. The process that runs the InfoSphere BigInsights jobs overnight is performed manually by an operator. To reduce costs and to ensure that the SLA requirement of having analyzed data available every morning is satisfied, the company wants to automate this process.
Using IBM Tivoli Workload Scheduler Integrator for InfoSphere BigInsights, the company can satisfy this objective because the product helps it to automate and control the entire process.
For more information or to download the IBM Tivoli Workload Scheduler Integrator for InfoSphere BigInsights see the online page on ISM Library Support at: TWS Integrator for InfoSphere BigInsights on ISM