Skip to main content

By clicking Submit, you agree to the developerWorks terms of use.

The first time you sign into developerWorks, a profile is created for you. Select information in your developerWorks profile is displayed to the public, but you may edit the information at any time. Your first name, last name (unless you choose to hide them), and display name will accompany the content that you post.

All information submitted is secure.

  • Close [x]

The first time you sign in to developerWorks, a profile is created for you, so you need to choose a display name. Your display name accompanies the content you post on developerworks.

Please choose a display name between 3-31 characters. Your display name must be unique in the developerWorks community and should not be your email address for privacy reasons.

By clicking Submit, you agree to the developerWorks terms of use.

All information submitted is secure.

  • Close [x]

Use the IBM Industry Model Information Insurance Warehouse to define smart and mature data models

Introduction to the data warehousing development method for insurance

Alexander Tarabrin (tarabrin@de.ibm.com), Advisory IT Architect, IBM
Photo of Alexander Tarabrin
Alexander Tarabrin has 14 years of professional experience in the IT domain. He is an Advisory IT Architect in IBM Software Group Services in IBM Germany Alexander is working in the areas of data modeling, data management, industry models, and customer data integration.
Hermann Voellinger (VGR@de.ibm.com), Executive IT Architect, IBM
Photo of Hermann Voellinger
Hermann Voellinger is an Executive IT Architect at IBM Software Group Services. For the last 12 years, he has been the responsible IT architect for big data warehouse (DWH) projects in Germany. The 10 years before working on DWH projects, Hermann worked in the German Development Lab as the lead developer and architect for Text and Data-Mining solutions and tools. His key skills and main focus areas are currently the architecture of data population processes (ETL), data modeling concepts, and the strategy and architecture of DWH solutions.

Summary:  In this tutorial, understand the method for developing data models for data warehouse projects using the IBM Industry Model Insurance Information Warehouse (IIW), which is part of the IBM Industry Models product defined for the domain of insurance. The tutorial shows the best approach to develop core data warehouse (CDW) models and data mart (DM) models. The tutorial also introduces the recommended data warehousing development method (DWDM) to deal with the IIW model pattern framework to architect DWH solutions for insurance companies.

Date:  23 Dec 2010
Level:  Intermediate PDF:  A4 and Letter (743 KB | 32 pages)Get Adobe® Reader®

Activity:  23996 views
Comments:  

Before you start

Introduction

Data warehousing design and data modeling is a well-known, significant blend of computer science and IT. The technology grew up in the early 1990s using several approaches developed during that time. The most significant methods were defined by Ralph Kimball (top-down) and W. H. Inmon (bottom-up) (see Resources).

Commercial data modeling products are valuable because of their content-specific knowledge, which is based on practical experience and business expertise. IBM offers an intellectual capital product family in this space called IBM Industry Models. The IBM Industry Models products consist of mature and well-tested pattern frameworks for data modeling (relational and multi-dimensional), packaged for several industries. This article presents an overview of the Information Insurance Warehouse (IIW), which is a part of the IBM Industry Models product defined for the insurance industry.

This tutorial introduces the method to develop data models for data warehouse (DWH) using the IBM Industry Model IIW. The tutorial illustrates the approach for the development of the core data warehouse (CDW) models (highly normalized data models that hold the atomic data elements) and the data mart (DM) models (de-normalized data models that implement the structure of multi-dimensional data models). Multi-dimensional data models are characterized by the definition of measures, which are stored in fact tables, and by the definition of dimension tables, which defines the axes or dimensions of the analysis.

The method described in this tutorial is the IIW roadmap for developing data models. The IIW roadmap is based on the top-down approach, which starts with the capture of business requirements and the definition of the business model (in terms of the IIW known as the analysis data model). Defining the business requirements is the prerequisite for all further work. Ideally, this work should be carried out as a joint effort of the data modeler and experts from the business departments. When the business departments create and approve the model, the phase to create logical models starts.

The design of logical models consists of two steps: designing the DWH logical model (CDW) followed by designing the DM logical model. It is important to follow this sequential approach. Starting succeeding phases before finishing predecessor phases might yield undesirable results. Therefore the IIW roadmap's structure, and this tutorial, is divided into the following four phases:

  1. Phase 1: Capturing IIW business requirements
  2. Phase 2: Defining the analysis data model
  3. Phase 3: Designing the data warehouse logical model
  4. Phase 4: Designing the data mart

These four phases fulfill different goals and offer different deliverables:

Phase 1: Capturing IIW business requirements
A complete description of the business requirements that the BI project should solve. The deliverables are a conceptual model and an analytical requirements model.
Conceptual model
A model of all the concepts and business terms to be used across the organization
Analytical requirements model
Predefined models of business requirements that address specific industry issues. Models are expressed as measures and dimensions
Phase 2: Defining the analysis data model
A conceptual model that represents an ideal picture of the business concepts and how these concepts relate to each other. This model is platform independent and does not require physical aspects of the implementation. The deliverable is the analysis data model.
Analysis data model
A data model that specifies the normalized data structures required to represent the concepts defined in the conceptual model.
DWH and DM design phases
The business concepts mapped on an entity-relationship (ER) logical model (DWH) and on a multi-dimensional (MD) logical model. These models are the basis for the physical structure of the data in the database. The deliverables are DW design data models and DM design data models.
DW design data models
Data models that represent the enterprise-wide repository of atomic and analytical data used for informational processing
DM design data models
Dimensional models that implement analytical requirements and are structured to allow specific dimensional analyses

Figure 1 summarizes these deliverables.


Figure 1. Deliverables of the four IIW phases
Diagram: conceptual model feeds into analytical requirements and analysis data model. Analytical requirements feed into DW design data models. Analyis data model feeds into DW design data models.  DW design data models feed into data mart design data models.

IIW also defines three model layers:

  • The foundation layer contains the conceptual and analytical requirements models.
  • The analysis layer covers the analysis data model.
  • The design layer contains the DW design and the DM design models.

The diagram in Figure 2 depicts these layers.


Figure 2. IIW model layers
Shows the 3 foundation layers, with the supporting models layer above containing schemas for HIPAA, Sarbanes Oxley, Basel II, IFRS/IAS, and Solvency II

The following sections of the tutorial describe the four phases with examples of each phase using InfoSphere™ Data Architect (IDA). The examples use IBM IIW Model Version 8.2. The IIW model content is imported in IDA with the help of the Enterprise Model Extender (EME) tool. EME is a set of plug-in extensions to the IBM InfoSphere Data Architect product. To follow along with the tutorial, you will need these products installed.

1 of 8 | Next

Comments



Help: Update or add to My dW interests

What's this?

This little timesaver lets you update your My developerWorks profile with just one click! The general subject of this content (AIX and UNIX, Information Management, Lotus, Rational, Tivoli, WebSphere, Java, Linux, Open source, SOA and Web services, Web development, or XML) will be added to the interests section of your profile, if it's not there already. You only need to be logged in to My developerWorks.

And what's the point of adding your interests to your profile? That's how you find other users with the same interests as yours, and see what they're reading and contributing to the community. Your interests also help us recommend relevant developerWorks content to you.

View your My developerWorks profile

Return from help

Help: Remove from My dW interests

What's this?

Removing this interest does not alter your profile, but rather removes this piece of content from a list of all content for which you've indicated interest. In a future enhancement to My developerWorks, you'll be able to see a record of that content.

View your My developerWorks profile

Return from help

static.content.url=http://www.ibm.com/developerworks/js/artrating/
SITE_ID=1
Zone=Information Management
ArticleID=604381
TutorialTitle=Use the IBM Industry Model Information Insurance Warehouse to define smart and mature data models
publish-date=12232010
author1-email=tarabrin@de.ibm.com
author1-email-cc=
author2-email=VGR@de.ibm.com
author2-email-cc=

Tags

Help
Use the search field to find all types of content in My developerWorks with that tag.

Use the slider bar to see more or fewer tags.

Popular tags shows the top tags for this particular content zone (for example, Java technology, Linux, WebSphere).

My tags shows your tags for this particular content zone (for example, Java technology, Linux, WebSphere).

Use the search field to find all types of content in My developerWorks with that tag. Popular tags shows the top tags for this particular content zone (for example, Java technology, Linux, WebSphere). My tags shows your tags for this particular content zone (for example, Java technology, Linux, WebSphere).

Try IBM PureSystems. No charge.