Skip to main content

By clicking Submit, you agree to the developerWorks terms of use.

The first time you sign into developerWorks, a profile is created for you. Select information in your developerWorks profile is displayed to the public, but you may edit the information at any time. Your first name, last name (unless you choose to hide them), and display name will accompany the content that you post.

All information submitted is secure.

  • Close [x]

The first time you sign in to developerWorks, a profile is created for you, so you need to choose a display name. Your display name accompanies the content you post on developerworks.

Please choose a display name between 3-31 characters. Your display name must be unique in the developerWorks community and should not be your email address for privacy reasons.

By clicking Submit, you agree to the developerWorks terms of use.

All information submitted is secure.

  • Close [x]

Develop mapping models with IBM InfoSphere Data Architect

Photo of Jorge Machado
Jorge Machado is an IT Architect with IBM Software Group Services in Germany. Having covered various roles within his 10 year history of assignments within IBM, his main focus area now is the planning, design, and development of data warehouse solutions as well as business intelligence strategies for customers in central Europe.
Hermann Voellinger (VGR@de.ibm.com), IT Architect, IBM
Photo of Hermann Voellinger
Hermann Voellinger is an Executive IT Architect at IBM Software Group Services. For the last 12 years, he has been the responsible IT architect for big data warehouse (DWH) projects in Germany. The 10 years before working on DWH projects, Hermann worked in the German Development Lab as the lead developer and architect for Text and Data-Mining solutions and tools. His key skills and main focus areas are currently the architecture of data population processes (ETL), data modeling concepts, and the strategy and architecture of DWH solutions.

Summary:  Designing the mappings for an extract, transform, and load (ETL) process is a critical step in a data warehouse project. Mappings must be easy to modify, capable of version control, easily reported, and easily exported to other formats. This tutorial illustrates how to develop a complete source-to-target mapping model using InfoSphere™ Data Architect. You will also learn about the reporting functions that InfoSphere Data Architect provides.

Date:  27 Jan 2011
Level:  Intermediate PDF:  A4 and Letter (1777 KB | 43 pages)Get Adobe® Reader®

Activity:  16535 views
Comments:  

Before you start

Introduction

The goal of this tutorial is to introduce a method for developing mapping models for data warehouse (DWH) projects using the functionality available in InfoSphere Data Architect (IDA), Version 7.5.2. The design of the mappings for the extract, transform, and load (ETL) process is an absolutely critical part of each data warehouse project. This design of mappings should be stored in a format that fulfills several requirements, including the following:

  • The mapping can easily be changed.
  • Changes can be versioned.
  • It is easy to report the mappings.
  • It can be exported to other formats.

You will learn in this tutorial how these goals can be reached with IDA. The tutorial builds describes all the necessary steps to build your first mapping model and to create a report for this model. The high-level steps are:

  1. Create a project in IDA
  2. Create the source data model (using the DB2 SAMPLE database)
  3. Create the target data model
  4. Create mappings between source and target tables
  5. Analyze your model using the reporting functions

This tutorial describes each of these five steps in detail. At the end, you will have your first meaningful mapping report in PDF style by using the inherited functionality of the IDA tool.

Product overview and prerequisites

The IBM InfoSphere Data Architect (previously called Rational Data Architect) is an enterprise data modeling and integration design tool that provides a collaborative data design solution that you can use to discover, model, relate, and standardize diverse and distributed data assets.

IDA is based on an open-standard Eclipse integration.

In order to work through this tutorial you need to install the following:

  • InfoSphere Data Architect, Version 7.5.2
  • IBM DB2 9.7 Enterprise Server Edition with its sample database
  • An instance of IBM DB2 Enterprise Edition and the standard sample database installed and accessible on the same computer

See Resources for links to trial versions of both DB2 and IDA.

1 of 9 | Next

Comments



Help: Update or add to My dW interests

What's this?

This little timesaver lets you update your My developerWorks profile with just one click! The general subject of this content (AIX and UNIX, Information Management, Lotus, Rational, Tivoli, WebSphere, Java, Linux, Open source, SOA and Web services, Web development, or XML) will be added to the interests section of your profile, if it's not there already. You only need to be logged in to My developerWorks.

And what's the point of adding your interests to your profile? That's how you find other users with the same interests as yours, and see what they're reading and contributing to the community. Your interests also help us recommend relevant developerWorks content to you.

View your My developerWorks profile

Return from help

Help: Remove from My dW interests

What's this?

Removing this interest does not alter your profile, but rather removes this piece of content from a list of all content for which you've indicated interest. In a future enhancement to My developerWorks, you'll be able to see a record of that content.

View your My developerWorks profile

Return from help

static.content.url=http://www.ibm.com/developerworks/js/artrating/
SITE_ID=1
Zone=Information Management
ArticleID=618702
TutorialTitle=Develop mapping models with IBM InfoSphere Data Architect
publish-date=01272011
author1-email=jorge_machado@de.ibm.com
author1-email-cc=
author2-email=VGR@de.ibm.com
author2-email-cc=

Tags

Help
Use the search field to find all types of content in My developerWorks with that tag.

Use the slider bar to see more or fewer tags.

Popular tags shows the top tags for this particular content zone (for example, Java technology, Linux, WebSphere).

My tags shows your tags for this particular content zone (for example, Java technology, Linux, WebSphere).

Use the search field to find all types of content in My developerWorks with that tag. Popular tags shows the top tags for this particular content zone (for example, Java technology, Linux, WebSphere). My tags shows your tags for this particular content zone (for example, Java technology, Linux, WebSphere).

Try IBM PureSystems. No charge.