Skip to main content

By clicking Submit, you agree to the developerWorks terms of use.

The first time you sign into developerWorks, a profile is created for you. Select information in your profile (name, country/region, and company) is displayed to the public and will accompany any content you post. You may update your IBM account at any time.

All information submitted is secure.

  • Close [x]

The first time you sign in to developerWorks, a profile is created for you, so you need to choose a display name. Your display name accompanies the content you post on developerworks.

Please choose a display name between 3-31 characters. Your display name must be unique in the developerWorks community and should not be your email address for privacy reasons.

By clicking Submit, you agree to the developerWorks terms of use.

All information submitted is secure.

  • Close [x]

Use the IBM Industry Model Information Insurance Warehouse to define smart and mature data models

Introduction to the data warehousing development method for insurance

Alexander Tarabrin (tarabrin@de.ibm.com), Advisory IT Architect, I.B.M.
Photo of Alexander Tarabrin
Alexander Tarabrin has 14 years of professional experience in the IT domain. He is an Advisory IT Architect in IBM Software Group Services in IBM Germany Alexander is working in the areas of data modeling, data management, industry models, and customer data integration.
Hermann Voellinger (VGR@de.ibm.com), Executive IT Architect, IBM
Photo of Hermann Voellinger
Hermann Voellinger is an Executive IT Architect at IBM Software Group Services. For the last 12 years, he has been the responsible IT architect for big data warehouse (DWH) projects in Germany. The 10 years before working on DWH projects, Hermann worked in the German Development Lab as the lead developer and architect for Text and Data-Mining solutions and tools. His key skills and main focus areas are currently the architecture of data population processes (ETL), data modeling concepts, and the strategy and architecture of DWH solutions.

Summary:  In this tutorial, understand the method for developing data models for data warehouse projects using the IBM Industry Model Insurance Information Warehouse (IIW), which is part of the IBM Industry Models product defined for the domain of insurance. The tutorial shows the best approach to develop core data warehouse (CDW) models and data mart (DM) models. The tutorial also introduces the recommended data warehousing development method (DWDM) to deal with the IIW model pattern framework to architect DWH solutions for insurance companies.

Date:  23 Dec 2010
Level:  Intermediate PDF:  A4 and Letter (1012 KB | 30 pages)Get Adobe® Reader®

Activity:  34697 views
Comments:  

Phase 3: Designing the data warehouse logical model

Business users complete the phases described in the first two phases of this tutorial. Designing IIW logical models is rather IT-oriented, and it is better-suited for IT users (data designers) to complete. The design layer of IIW consists of the atomic part (the enterprise model) and of the analytical part (conformed dimensions model and data mart models). This section describes the IIW enterprise model.

The structure of the enterprise model is similar to the analysis data model. The model is packaged into several packages. Each package consists of a set of artifacts (entities, attributes, and relationships) and some diagrams for visualization needs, as shown in Figure 13. This is the standard structure of logical models in IBM InfoSphere Data Architect.


Figure 13. Sample cut-out from the enterprise model structure
Shows explorer view of the hierarchy

Enterprise model artifacts

The data warehouse model is a data model that represents the enterprise-wide repository of atomic data used for informational processing. This model includes the history of the value changes of business information that may vary over time. You might want to keep track of this history for analytical purposes.

The data warehouse model defines the following attribute types:

Fundamental entity
Contains atomic business information. A fundamental entity is either versioned or non-versioned. It requires either an anchor entity if versioned or a root entity if non-versioned to maintain its versions
Anchor entity
Acts as an anchor to maintain different versions of an entity instance. Anchors are also used as time-invariant keys (TIKs) in a data warehouse environment.
Root entity
Acts as super-type for non-versioned fundamental entities.
Associative entity
Serves as relationship between two root or anchor entities.
Population characteristic entity
Contains information regarding ETL jobs.
Classification entity
Instantiates specific semantic attributes of data type enumeration.

The entities are connected using relationships of different types as follows, defined for the enterprise model:

Design relationship
Connects two entities, usually one-to-many, for design purposes (as navigation or better performance).
Anchor relationship
Connects the fundamental entity with the related anchor entity.

The entities have attributes. The attribute types defined in the enterprise model are as follows:

Basic attribute
Contains business data.
Candidate key
Contains the business key that uniquely identifies an entity instance.
Relationship attribute
Contains a foreign key attribute.
Derived attribute
Contains a value derived (or duplicated) from the value of one or many of the other attributes.

Figure 14 shows a cutout from the data diagram of the claim domain. The attributes are omitted to improve the readability.


Figure 14. Cutout from the enterprise model for claim domain
shows parent-child relationships for claim domain

Customization approach

The approach of the enterprise model customization is described in detail in the online help installed with Enterprise Model Extender. There are specific techniques to add or modify an entity, relationship, or attribute of any type. The techniques are figured out in the form of eight transformation rules. The rules describe the customization approach by adding semantic association entities and semantic associations; transforming super-semantic entities and sub-semantic entities; and coping with derived and duplicate attributes.

When customizing the enterprise model, the data designer should keep in mind that the elements of the enterprise model should be linked to the appropriate elements of the analysis data model.


Towards a physical model

Unlike the analysis data model, the enterprise model is IT-oriented. The enterprise model contains the following features:

Data versioning
One of the most important aspects of the data warehousing design. The fundamental entities of the enterprise model (EM) are featured with the following four attributes:
  • EFFECTIVE_FROM
  • EFFECTIVE_TO
  • VALID_FROM
  • VALID_TO
The EFFECTIVE_FROM/TO attributes rely on the period of semantic validity. The VALID_FROM/TO attributes rely on the timestamps when the data have been populated from the source system and represent the period of technical validity.
Primary keys
The primary keys defined in the EM are 4-bit integers. They are surrogate keys without business semantic. They are not derived from the primary keys of the source systems.
Data population
The entities contain the data population attributes that describe the information about the ETL process.

Summary of phase 3

This section described the enterprise model and the approach to customizing it. You can transfer the enterprise model to the physical model using the InfoSphere Data Architect wizard.

4 of 8 | Previous | Next

Comments



static.content.url=http://www.ibm.com/developerworks/js/artrating/
SITE_ID=1
Zone=Information Management, Industries
ArticleID=604381
TutorialTitle=Use the IBM Industry Model Information Insurance Warehouse to define smart and mature data models
publish-date=12232010
author1-email=tarabrin@de.ibm.com
author1-email-cc=
author2-email=VGR@de.ibm.com
author2-email-cc=