Analyzing data and building models

You can use data analysis and model building tools and services in Cloud Pak for Data to find insights and train or solve AI models.

The tools that you can use in analytics projects depend on which services you have installed:

Analyzing data and building models with Watson Studio, Watson Machine Learning, and other supplemental services

You can analyze data and build models with the Watson Studio service. Supplemental services to Watson Studio, such as Watson Machine Learning, add tools and compute resources to analytics projects.

Service The Watson Studio, Watson Machine Learning, and other supplemental services are not available by default. An administrator must install these services on the IBM Cloud Pak for Data platform. To determine whether a service is installed, open the Services catalog and check whether the service is enabled.

Watson Studio usage

To start analyzing data with Watson Studio:

  1. Create or open an analytics project:
    • To create a project, choose Projects > View all projects from the main menu and then click New project on the Projects page and choose Analytics project. See creating an analytics project.
    • To open an existing project, choose Projects > View all projects from the main menu and then click the name of the analytics project.
  2. Add data to the project. Alternatively, you can add data from within a tool.
  3. Analyze data or build models. Find out how to choose the right tool in Watson Studio.

    You can use the data analytics and model building methods listed in the following table with Watson Studio plus the other listed services.

Table 1. Data analysis and model building methods with Watson Studio and supplementary services
Method Supplementary services to Watson Studio
Analyze data by writing code in Jupyter notebooks
Code notebooks and Python scripts in the JupyterLab IDE with Git integration
Visualize and prepare data in Data Refinery
Develop Shiny applications in the RStudio IDE RStudio Server with R 3.6
Visualize your data without coding with Cognos dashboards Cognos Dashboards
Run analytic workloads with Spark environments or Spark APIs Analytics Engine powered by Apache Spark
Analyze data on Apache Hadoop clusters Execution Engine for Apache Hadoop
Analyze data with SQL queries on Hadoop clusters or cloud object stores Db2 Big SQL
Build models with AutoAI Watson Machine Learning
Train models with federated learning Watson Machine Learning
Build models in notebooks Watson Machine Learning
Run deep learning experiments Watson Machine Learning
Solve Decision Optimization models Watson Machine Learning
Decision Optimization
Build models with SPSS Modeler SPSS Modeler

Analyzing data with other services

If you don't have the Watson Studio service installed, you can use the data analytics methods with the services listed in the following table.

Table 1. Data analysis methods and the required services
Method Service
Run analytic workloads with Spark APIs Analytics Engine powered by Apache Spark
Analyze data with SQL queries on Hadoop clusters or cloud object stores Db2 Big SQL