IBM InfoSphere BigInsights Version 3.0

Jaql overview

Although it was designed for JSON, Jaql can also handle other data types. You can increase the extensibility of Jaql by using functions and modules.

Jaql can help you work with and manipulate large amounts of data on demand or in a production environment. Jaql is a suitable language for you to use if you want to use a scripting paradigm for your analysis. You can use it to automate tasks and increase processing control to conduct a large, comprehensive analysis of data.

Jaql, which borrows some of the best features of other scripting languages, is designed to meet the following objectives:
Semi-structured analytics
With Jaql, you can easily analyze and manipulate large-scale semi-structured data, like JSON data.
Jaql queries that process large amounts of data are able to take advantage of scaled-out architectures. For instance, Jaql uses the Hadoop MapReduce framework to process JSON data in parallel.
You can easily extend Jaql functions and data sources to facilitate plugging in computations for their data.

You can run Jaql queries in cluster mode or in local mode. Use cluster mode when you are running a Hadoop cluster and you want to analyze large amounts of data by using distributed processing in the cluster. In local mode, both Jaql queries and the processing are performed locally, so it is more suitable for sampling, testing, and working with small data sets.