Overview of IBM watsonx.ai and IBM watsonx.governance software
IBM watsonx is a secure and collaborative environment where you can access your organization's trusted data, automate AI processes, and deliver AI in your applications. The IBM watsonx.ai component provides a studio of integrated tools for working with generative AI capabilities that are powered by foundation models and for building machine learning models. The IBM watsonx.governance component provides end-to-end monitoring for machine learning and generative AI models to accelerate responsible, transparent, and explainable AI workflows.
AI engineers, data scientists, and AI risk and compliance officers can accomplish the following goals with watsonx.ai and watsonx.governance:
-
Build machine learning models Build models by using open source frameworks and code-based, automated, or visual data science tools.
-
Develop with foundation models Generate prompts to generate, classify, summarize, or extract content from your input text. Choose from IBM models or open source models from Hugging Face. Tune foundation models to customize your prompt output.
-
Manage the AI lifecycle Manage and automate the full AI model lifecycle with all the integrated tools and runtimes to train, validate, and deploy AI models.
-
Govern AI Track the detailed history of AI models and evaluate model output to help ensure compliance.
Data engineers can collect, store, query, and analyze enterprise data in a lakehouse architecture with IBM watsonx.data. See IBM watsonx.data 1.1.x documentation.
The following graphic shows the capabilities of the watsonx.ai and watsonx.governance components on top of the common core functionality that provides an integrated user experience. The watsonx.data experience is separate and not shown in the graphic.
Watsonx.ai
Watsonx.ai includes tools for working with data and models, the foundation models for generative AI, and the hardware and software resources for computing and inferencing. Which tools, foundation models, and hardware and software resources that you have access to depend on which services are installed on your system.
Tools for preparing data and working with models
Your watsonx.ai tools are in collaborative workspaces called projects.
You can use watsonx.ai tools to prepare data and work with models in the following ways:
- Prepare data: Refine and visualize your data files or data tables in remote data sources or generate synthetic tabular data.
- Build machine learning models: Automatically generate model candidates, create machine learning model training flows, or write model training code in Python or R.
- Work with foundation models: Experiment with generative AI prompts, tune foundation models for your use case, or write generative AI solution code in Python.
- Automate model lifecycles: Create repeatable and scheduled flows that automate notebook, Data Refinery, and machine learning pipelines.
For a list of tools, their levels of automation, and whether you can use them to prepare data or work with models, see Analyzing data and working with models. For generative AI tools, see Developing generative AI solutions.
Deployed foundation models
IBM watsonx.ai has a range of large language models for generative AI that your administrator can deploy in your cluster. The deployed foundation models include open source models from Hugging Face and IBM foundation models. You can also deploy your own custom foundation models. You can customize foundation model behavior by tuning a foundation model.
For a list of included foundation models that your administrator can deploy for watsonx.ai, see Supported foundation models.
Usage resources for computing and inference
When you run tools or your AI solutions with watsonx.ai, you consume the following types of resources:
- Compute usage
- When you run jobs, notebooks, experiments that train or tune models, or deployments, you consume compute resources. Compute resources include the appropriate hardware and software that are specific for the workload. Your compute resources can vary depending on which services are installed on your cluster. See Runtime environments.
- Inferencing usage
- When you run inferencing against foundation models, your inferencing usage depends on the number of the tokens in the prompt input and output text. Tokens are basic units of text. Inferencing is run on GPU hardware.
Watsonx.governance
Watsonx.governance includes tools for governing, evaluating, and explaining models.
Tools for governing models
You can use watsonx.governance tools to govern models in the following ways:
-
Monitor and evaluate models: You can monitor model output and explain model predictions. Your watsonx.governance model monitoring and evaluation tools are in projects and deployment spaces.
-
Track and document AI use cases: You can view model lifecycle status, general model and deployment details, training information and metrics, and deployment metrics. Your watsonx.governance model tracking tools in AI use cases.
-
Manage governance activity from the Governance Console Optionally integrate with the watsonx Governance Console from IBM OpenPages. Sync data from factsheets with the Governance Console, and extend governance capabilities with workflows and other compliance tools.
Common core functionality
Watsonx includes the following core functionality of IBM Cloud Pak for Data as the secure and scalable foundation for your organization to collaborate efficiently:
- Connectivity
- Administration
- Storage
- Workspaces
Connectivity
You can create connections to remote data sources and import connected data. You can configure connections with personal or shared credentials. For a list of supported connectors, see Connectors.
You can share connections with others across the platform in the Platform assets catalog.
Administration
IBM watsonx contains the core of IBM Cloud Pak for Data and has the same administration features and experience. Administrators can perform the following types of tasks:
- Installing, upgrading, or migrating the software
- Backing up or restoring the software
- Monitoring the platform
- Securing the environment
- Auditing events
- Forwarding alerts, notifications, and announcements
- Setting up services
- Managing resources
- Managing users
Storage
IBM watsonx and Cloud Pak for Data require a persistent storage solution that is accessible to your Red Hat OpenShift cluster. All the assets that you create with watsonx.ai and watsonx.governance are stored in that persistent storage solution.
See Storage requirements.
Workspaces
Watsonx is organized as a set of collaborative workspaces where you can work with your team or organization. Each workspace has a set of members with roles that provide permissions to perform actions.
Most users work with assets, which are items that are created or added to workspaces by users. Assets can represent data, models, or other types of code or information. Data assets contain metadata that represents data. Assets that you create in tools, such as models, run code to work with data. You can also create assets that contain information about other assets, such as model use cases that contain metadata, history, and reports about models. See Asset types and properties.
You can work in these types of workspaces in watsonx:
- Projects
- Deployment spaces
- Platform connections
- AI use cases
You can search for assets across all workspaces that you belong to.
Projects
Projects are where your data science and model builder teams work with data to create assets, such as, saved prompts, notebooks, models, or pipelines.
The following image shows what the Overview page of a project might look like.
Deployment spaces
Deployment spaces are where your ModelOps team deploys models and other deployable assets to production and then tests and manages deployments in production. After you build models and deployable assets in projects, you promote them to deployment spaces. See Deployment spaces.
The following image shows what the Overview page of a deployment space might look like.
Platform connections
Platform connections is a view of the Platform assets catalog that lists connection assets. You can access platform connections in any project or deployment space.
The following image shows what the Connections page of the Platform connections might look like.
AI use cases
AI use cases is a view of the Platform assets catalog that lists model use cases. Use cases track the details for AI assets in factsheets.
The following image shows what the AI use cases page might look like.