Environments (Watson Studio and IBM Knowledge Catalog)

You run assets, create jobs, and launch IDEs like RStudio or JupyterLab in a runtime. The runtime details are specified by an environment template.

Environment templates specify the hardware and software configuration of the runtimes:

  • The hardware configuration specifies the amount of processing power and available RAM.
  • The software configuration specifies the programming languages, a set of preinstalled libraries, and optional libraries or packages that you can specify.

Included environment templates

You can use the environment templates that are included in Watson Studio to quickly get started, without having to create your own environment templates. The included environment templates are listed on the project's Environments page.

The included environments for notebooks and JupyterLab are added as an affiliate of a runtime release and prefixed with Runtime followed by the release year and release version.

A runtime release specifies a list of key data science libraries and a language version, for example Python 3.11. All environments of a runtime release are built based on the library versions that are defined in the release. This ensures the consistent use of data science libraries across all data science applications.

Runtime releases

One 24.1 Runtime release exists for different versions of Python and R.

While a runtime release is supported, IBM continues to update the library versions to address security requirements. These updates will change only the <Patch> version of the libraries, not the <Major>.<Minor> versions. Therefore, your notebook assets will continue to run.

For example: A runtime release supports TensorFlow 2.12. In Cloud Pak for Data 5.0 , the runtime release contains TensorFlow 2.14.1. Although TensorFlow might be updated to version 2.14.2 or 2.14.3 in later Cloud Pak for Data 5.0.x releases, it will not be updated to version 2.15.

Library packages included in Runtimes

For specific versions of popular data science library packages that are included in Watson Studio runtimes refer to these tables:

Table 1. Packages and their versions in the 24.1 Runtime release for Python
Library Runtime 24.1 on Python 3.11
Keras 2.14.0
Lale 0.8.x
LightGBM 4.2.0
NumPy 1.26.4
ONNX 1.16
ONNX Runtime 1.16.3
OpenCV 4.8.1
pandas 2.1.4
PyArrow 15.0.1
PyTorch 2.1.2
scikit-learn 1.3.0
SciPy 1.11.4
SnapML 1.14.6
TensorFlow 2.14.1
XGBoost 2.0.3
Table 2. Packages and their versions in the 24.1 Runtime release for R
Library Runtime 24.1 on R 4.3
arrow 15.0
car 3.1
caret 6.0
catools 1.18
forecast 8.21
ggplot2 3.4
glmnet 4.1
hmisc 5.1
keras 2.13
lme4 1.1
mvtnorm 1.2
pandoc 2.12
psych 2.3
python 3.11
randomforest 4.7
reticulate 1.34
sandwich 3.0
scikit-learn 1.3
spatial 7.3
tensorflow 2.15
tidyr 1.3
xgboost 1.7

In addition to the libraries listed in the tables, runtimes include many other useful libraries. To see the full list, select the Manage tab in your project, then click Templates, select the Environments tab, and then click on one of the listed environment templates.

Getting started

Python with GPU and Execution Engine for Apache Hadoop environments are not available by default.

  • For Python with GPU environments, the Jupyter Notebooks with Python for GPU service must be installed.
  • For Execution Engine for Apache Hadoop environments, the Execution Engine for Apache Hadoop service must be installed.

After these services are installed, you must create your own environment templates to use these environments.

Use the following table to find out more about environment templates by asset type.

Note:
  • R-based runtimes for notebook editor and Data Refinery do not work on the IBM Z (s390x) platform.
  • Up to and including Cloud Pak for Data release 5.1.1, R-based runtimes for Data Refinery and notebook editor without Analytics Engine installed do not work on the IBM Power® (ppc64le) platform. From Cloud Pak for Data release 5.1.2, R-based runtimes for Data Refinery that are based on R4.3 and above work on the IBM Power® (ppc64le) platform.

Environment templates listed by operational asset type
Asset Programming language Tool Environment template type Available environment templates/compute resources
Jupyter notebook Python notebook editor Anaconda Python distribution Python environments
Jupyter notebook Python notebook editor Anaconda Python distribution with GPU GPU environments
Jupyter notebook Python notebook editor Spark Spark environments
Jupyter notebook Python notebook editor Spark Hadoop cluster
Jupyter notebook R notebook editor Anaconda R distribution R environments
Jupyter notebook R notebook editor Spark Spark environments
Jupyter notebook Python JupyterLab Anaconda Python distribution JupyterLab environments
Jupyter notebook Python Visual Studio Code editor Anaconda Python distribution JupyterLab environments
Script R RStudio Anaconda R distribution RStudio environments
Shiny app R RStudio Anaconda R distribution RStudio environments
SPSS Modeler flow N/A SPSS Modeler SPSS Modeler SPSS Modeler environments
Data Refinery flow R Data Refinery Spark Data Refinery environments
Data Refinery flow R Data Refinery Spark Hadoop cluster

Learn more

Parent topic: Projects