Enterprise capabilities, essential for achieving strategic goals and operating requirements, are outlined in the Generative AI Architecture capability model. It includes six major categories, showcasing unique and supporting capabilities needed for effective generative AI implementation, with comprehensive documentation available in other architectures.
The remaining capability groups are supporting capabilities for generative AI. The capabilities are not unique to generative AI but must be present to support it as an enterprise capability. These groups are:
Data Management is a group of capabilities to store, manage, and transform data to forms that make it suitable for tuning and training of generative AI models. Also included in this category are capabilities to log and rate model responses for auditing purposes, and as input to further model tuning and refinement.
Supporting Capabilities is a catch-all grouping of application, integration, and IT operations capabilities required to successfully deploy and manage generative AI solutions with an enterprise.
GenAI Resources captures the hardware and platform capabilities necessary to efficiently and effectively develop, tune, deploy, and manage generative AI models and solutions.
Each capability category is made up of one or more capability groups. This section highlights groups and capabilities key to generative AI.
Model Hub capability group encapsulates the capabilities necessary to manage imported models as well as models tuned or trained by the enterprise. These capabilities enable enterprises to manage the models and data sets available for use within the enterprise, and to limit access to models and data sets to specific users or groups within the enterprise. Model importing and Data importing are key capabilities for enterprises to gate the intake of models from the growing number of public model repositories such as Hugging Face.
Model Hosting Model Hosting offers capabilities for deploying general and tuned models as API-enabled services within an enterprise, optimizing resource utilization, allowing independent refinement and replacement, and simplifying governance. Key to this is Model Access Policy Management, ensuring model access is restricted to authorized users and groups, preventing unauthorized usage.
Model Customization is a group of capabilities that enable an enterprise to tune and train generative AI models for specific business needs. Typically this capability will be realized using a cloud platform as the cloud's pay-as-you-go model is well-suited to the 'bursty' nature of tuning and training resource demands.
Model and Data Governance is a critical set of capabilities for an enterprise to make use of generative AI models on a wide scale. Specifically, these capabilities provide enterprises with the insights they need to monitor and manage model risks such as the introduction of bias in model responses, and to help address regulatory and compliance requirements for model transparency and fairness.
Model Monitoring is the operational analogue to Model Governance; where Model Governance deals with long-term model and risk management, the capabilities in Model Monitoring enable enterprises to monitor and management model operations in real time. Model Monitoring is comprised of several key capabilities, including:
GenAI Compliance Management is a category of capabilities is about enabling the controls needed to “secure the usage” of AI through the application stack and “secure the applications” themselves. Adhering to ethical standards and guidelines to ensure that AI systems respect human values and rights.
AI Application Security Management. This category of capabilities is about enabling the controls needed to “secure the usage” of AI through the application stack and “secure the applications” themselves. Adhering to ethical standards and guidelines to ensure that AI systems respect human values and rights.
AI Model Security Management. This category of capabilities is about enabling the controls needed to “secure the model” layer as well as securing the usage of models. Implementing best practices for model training, validation, and evaluation to enhance performance and reliability.
AI Data Security Management. This category of capabilities enables controls to “secure the data” layer. Establishing clear guidelines for data collection, storage, and usage to ensure data quality and mitigate bias. While data security is not unique to GenAI, we'll focus only on areas where GenAI require particular attention from a data standpoint.
Agentic AI is a group of capabilities required to create and deploy agentic AI applications. These include core capabilities like Routing and Orchestration, and Tool Management and Tool Calling.
GenAI Tuning is a group of capabilities necessary to 'customize' a general generative model to the needs of the enterprise. Models are trained on a broad base of knowledge and will lack knowledge of specific industry jargon and processes. Thus, most enterprises will need to make use of capabilities like Prompt Engineering, Prompt Tuning, and Model Fine-tuning to create a model that understands the terms and processes of the enterprise's business.
GenAI Application Capabilities enable enterprises to develop advanced generative AI applications. Capabilities include the ability to dynamically generate functions to respond to user queries; conversational memory, which enables generative AI applications to retain and reference prior interactions in a conversational manner; and model routing, which enables applications to dynamically route queries to a model best suited to respond.