Collecting data warehouse data with the Data Collection Component
Use the Data Collection Component to extract data from the data warehouse. Set up the data collection jobs to extract the data that you need.
About this task
Data Collection Component is designed to improve performance with easier setup, configuration and deployment.
If you want to administer Data Collection Component in a production environment, you must use enterprise-level products, such as Db2® as the database, and either WebSphere Application Server Liberty Profile or WebSphere Application Server as the application server.
Procedure
-
Start the application server, which hosts the Jazz®
Team Server and
Data Collection Component. For details, see Deploying and starting the server topic.
For the application server, you have the following options:
- WebSphere Application Server Liberty Profile
- Starting in version 6.0.1, you can quickly get started with running data collection jobs by
running WebSphere Application Server
Liberty Profile
application server that is packaged with the Jazz
Team Server. The
Data Collection Component
by default occupies the same ports as the Jazz
Team Server:
9080
for HTTP and9443
for HTTPS. If you need to change the ports, backup your server.xml file located in<JTSInstallDir>\JazzTeamServer\server\liberty\servers\clm
directory and edit the appropriate port numbers in the server.xml file. This server.xml file is created only after the first-time you start the Liberty Profile server.WebSphere Application Server Liberty Profile example:- To start the WebSphere Application Server
Liberty Profile server
that is packaged with the Jazz
Team Server:
- In a command prompt, start the server from
[JTSInstallDir]\JazzTeamServer\server\server.startup.bat
Server clm started
.Tip:Here is an example of the console output when the Liberty Profile server is first created and started:On Windows operating system, the Liberty Profile server does not run in a visible console window, like Apache Tomcat server does.
Server clm created. 4547 File(s) copied Starting server clm. Server clm started.
- In a command prompt, start the server from
- If you want to later stop Liberty Profile that
is packaged with the Jazz
Team Server:
- In a command prompt, stop the server from
[JTSInstallDir]\JazzTeamServer\server\server.shutdown.bat
Server clm stopped
. - In a command prompt, stop the server from
- To start the WebSphere Application Server
Liberty Profile server
that is packaged with the Jazz
Team Server:
- Apache Tomcat
- Starting in version 6.0.1, Apache Tomcat is not provided as a default application server.
However, Apache Tomcat is supported and you can use your own installed copy. For details to install
and configure the Apache Tomcat with Rational® solution
for Collaborative Lifecycle Management (CLM), use the
Interactive installation guide.Tip: By default, all the Apache Tomcat port numbers are in the 8000 range. The Data Collection Component by default occupies the same ports as the Jazz Team Server:
9080
for HTTP and9443
for HTTPS. If you want to change the default 8080 (HTTP) and 8443 (HTTPS) port numbers, complete these steps:- Go to
<JTSInstallDir>\JazzTeamServer\server\tomcat\conf
directory and open the server.xml file for editing. - Change all instances of
port="8080"
andredirectPort="8443"
toport="9080"
andredirectPort="9443"
. - Save the file.
Apache Tomcat example:- To start the Apache Tomcat server:
- In a command prompt, start the server from
[JTSInstallDir]\JazzTeamServer\server\server.startup.bat -tomcat
Tip: A separate Apache Tomcat console window opens. If you close this window, the server stops. You need to wait a moment for the Tomcat server to start. After the server is started, the startup time in millisecond is displayed in the Tomcat window. For example, Server startup in 26964 ms. - In a command prompt, start the server from
- If you want to later stop the Apache Tomcat server:
- In a command prompt, stop the server from
[JTSInstallDir]\JazzTeamServer\server\server.shutdown.bat -tomcat
Tip: You can determine the server stops completely, when the pop-up window that displays the console output for the Tomcat server disappears and is no longer available. - In a command prompt, stop the server from
- Go to
- WebSphere Application Server
- If you want to administer the Data Collection Component
application in an enterprise environment, you can install and configure a new or existing WebSphere Application
Server. For
details on configuring Data Collection Component
application in WebSphere Application Server, see Deploying and
starting WebSphere Application Server topic.WebSphere Application Server example:
- To start the WebSphere Application
Server by running
the following
command:
For example:<WASInstallDir>\profiles\<profile_name>\bin\StartServer.bat server1
C:\Program Files\ibm\WebSphere\AppServer\profiles\DCCProfile\bin\StartServer.bat server1
- If you want to later stop the WebSphere Application
Server, you can
run the following
command:
For example:<WASInstallDir>\profiles\<profile_name>\bin\StopServer.bat server1
C:\Program Files\ibm\WebSphere\AppServer\profiles\DCCProfile\bin\StopServer.bat server1
- To start the WebSphere Application
Server by running
the following
command:
-
Open a web browser to the dedicated Data Collection Component
location.
The URL is
https://<server>:<port>/dcc/web
, where <server> is the fully qualified domain name orlocalhost
and the context root is/dcc/web
.Example:In a web browser, type the address:
https://localhost:9443/dcc/web
-
When you are prompted, provide your user ID and password to authenticate the Data Collection Component
application set up on the Jazz
Team Server.
The user ID must be a member in the JazzAdmins group.
Example:- Default user ID (case-sensitive):
ADMIN
- Default password (case-sensitive):
ADMIN
- Click Log In.
Tip: If the Data Collection Component is not registered with a Jazz Team Server, one of the following error messages displays in the web browser:Error! Data Collection Component could not be loaded due to a syntax error or missing dependency. Page ID: com.ibm.rational.datacollection.web.ui.internal.pages.DataCollectionManagementPage
HTTP Status 503 - CRJAZ1173W The com.ibm.rational.datacollection.service.internal.web.IWebRedirector service is not available.
For details on how to register Data Collection Component with the Jazz Team Server, see Running the setup by using Custom setup in the setup wizard. - Default user ID (case-sensitive):
-
If this is your first time opening the Data Collection Component
application, you might get the following error message:
Failed to load Licenses Data Collection resource. Error 500: CRRCD9002E The Data Collection Component is still initializing. Refresh the page a few minutes later.
You must wait a few minutes as the application is initializing and goes into a loading state. You can determine the initialization is complete, when you refresh the web browser and the pages in the Data Collection Component are populated. For example, in the Data Collection Configuration page, wait for the sections under Data Warehouse Properties and Data Collection Properties to populate.Learn more about Data Collection Configuration and Administration pages:The Data Collection Configuration page displays and lets you manage the essential settings for the Data Collection Component.
Advanced settings that are common settings amongst the administration pages for the CLM applications are available for Data collection Component. To go the administration page for Data Collection Component, select one of the following options:- When the Data Collection Component page
(for example
https://localhost:9443/dcc/web
) is open in your web browser, you can select from the toolbarAdministration > Manage Application. The administration page for Data Collection Component page opens in the web browser with the page title: Application Administration - Data Collection Component (/dcc)
- Alternatively, open a web browser to
https://<server>:<port>/dcc/admin
. For example,https://localhost:9443/dcc/admin
- When the Data Collection Component page
(for example
-
Test the data warehouse connection.
Example:
- On the left-pane of the Data Collection Component application, under the Configuration section, click the Data Collection Configuration link.
- On the Data Collection Configuration page, click the Test Connection button.
- A successful data warehouse connection displays the message: Connection tested
successfully.
A failed data warehouse connection displays the message: Failed to connect to the data warehouse. For more details, click the show details link.
-
Configure resource groups.
Important: This step is required if you want to collect data from any of the following sources:
- Rational solution for Collaborative Lifecycle Management (CLM) applications on a remote Jazz Team Server
- Rational DOORS®
- Rational ClearQuest®
When you run Data Collection Component on a local Jazz Team Server, resource groups are automatically setup for the CLM applications and data warehouse that are registered on this same Jazz Team Server. You can continue to the next step as you are ready to collect data from applications on the local Jazz Team Server.
Resource Groups are the applications containing the data that you want to collect and register in Data Collection Component to run data collection jobs against.
Data Collection Component supports collecting data from the following Rational solution for CLM applications:- Jazz Team Server
- Rational Quality Manager
- Rational Team Concert®
- Rational DOORS Next Generation
- Rational DOORS
- Rational
ClearQuestImportant: Before you can collect data from Rational ClearQuest, you must install and configure the data adapter as part of ALM Cognos Connector. See, Configuring the Rational ClearQuest data adapter.
To extract data from Rational ClearQuest and load it into the CLM data warehouse, see https://jazz.net/wiki/bin/view/Main/MigratingClearQuestETLFromDataManagerToDCC.
Before you can collect data from Rational DOORS, you must prepare the data in Rational DOORS. See Configuring Rational DOORS data for external applications.
Before running data collection jobs, applications must be registered in Data Collection Component as resource groups. Resource groups are automatically configured for CLM applications and data warehouse that are registered on same Jazz Team Server as Data Collection Component. For CLM applications, you can discover applications that are registered with a local or external Jazz Team Server or you can manually add the resource groups. For non-CLM applications, you must manually add the resource groups.
Example:- On the left-pane of the Data Collection Component application, under the Configuration section, select the Resource Group Configuration link.
- To automatically detect applications that are registered with your Jazz
Team Server, on the
Resource Group Configuration page, click Discover. You have the option to
discover CLM
applications on either a local or remote Jazz
Team Server one at a
time, however you can repeat this step to register a mixture of local and remote resource groups:
- To discover CLM
applications on a local Jazz
Team Server:
- In the Add Resource Group window, you can view the list of CLM applications local to your Jazz Team Server where Data Collection Component is also registered. By default, under the Application Instance column, the check boxes are selected because the local CLM applications have been automatically added as resource groups, and ready for running data collection jobs against them.
- To discover CLM
applications on a remote Jazz
Team Server:
- On the Add Resource Group window, select the Use an external JTS to discover applications check box. This check box allows you to discover CLM applications registered on a remote machine hosted in an external Jazz Team Server.
- In the JTS Root Uri field, specify the location of the external JTS:
https://<hostname>:<port>/jts
. For example:https://my.remote.host.com:9443/jts
- Click the Discover button to display a list of CLM applications registered on the external Jazz Team Server on a remote machine.
- For example, under the Application Instance column, select the following
list of check boxes to the CLM
applications on the remote machine to add as a resource group for collecting and running data
collection jobs:
/rm
/qm
my.remote.host.com:9443/jts
/ccm
- Click OK.
- Back on the Resource Group Configuration page, under each of the product groups you will see the resource groups that have been added.
- To discover CLM
applications on a local Jazz
Team Server:
- To manually detect applications that are registered with your Jazz
Team Server:Tip: If you want to add only CLM applications as resource groups, you can skip this step if you have already automatically detected and discovered the applications from the previous step. However, you must complete this step if you want to add non-CLM applications as resource groups.
- On the Resource Group Configuration page, choose and expand one of the product groups that you want to add as a resource group.
- Click Add. A resource group entry is added under the selected product group.
- Expand the resource group entry that was added under the product group.
- You must configure the location of the resource group by specifying the URL to the application
registered on your Jazz
Team Server. For example:
- Jazz Team Server
https://<server>:<port>/jts
- Quality Management (QM)
https://<server>:<port>/qm
- Change and Configuration Management (CCM)
https://<server>:<port>/ccm
- Rational DOORS Next Generation
https:/<server>:<port>/rm
- Rational DOORS
https:/<server>:<port>/dwa-rrtranslator
- Rational ClearQuest
https:/<server>:<port>/DataServices/ClearQuest/<connection>/<schema-repository>
- For each of these resource group that was added, specify the additional configuration details,
such as the authentication type, version, and unique identifier:
- In the Authentication type list, select the authentication setting of the
resource group. The valid values are Username and Password,
OAuth-JTS or Jazz Security Architecture SSO:
- Username and Password
- When you select Username and Password as the authentication type, the Username and Password fields are displayed. In the Username field, specify the user name of the resource group. And in the Password field, specify the password of the resource group.
- OAuth-JTS
- When you select OAuth-JTS as the authentication type, the
Consumer key and Secret fields are displayed. In the
Consumer key field, specify the consumer key obtained from the Jazz
Team Server. And in the
Secret field, specify the secret of the consumer key.Tip: If the resource group is local to your Jazz Team Server where Data Collection Component is also registered, the Consumer key and Secret fields are automatically complete with the values configured when the application was registered and finalized during the Jazz Team Server setup.
- Jazz Security Architecture SSO
- This option is available, if you enabled Jazz Security Architecture single sign-on (SSO) authentication on all Jazz applications.
- In the Version field, specify the version-level of the resource group.
For example, type
6.0
or5.0
for the release of Rational solution for Collaborative Lifecycle Management (CLM),8.0
for the release of Rational ClearQuest or9.6.1.6
for the release of Rational DOORS. - In the Unique Identifier field, you must specify a globally unique
identifier that should never be changed in the future. This identifier is used to fill in the
SOURCE_ID
column of most data warehouse tables. The Rational Insight® XDC (XML Data Configuration) tool generates this identifier automatically for each resource group that is added to a XDC file. When migrating from a Rational Insight ETL (extract, transform, load process) to Data Collection Component, ensure that you re-use the same identifiers from the XDC files. You can find the identifier by opening the XDC file using a text editor and looking for the following XML text:
In this example, the identifier is<ResourceGroupCategory guid="_UerUAI0FEd2cD8t22EkwSQ" name="ClearQuest ALM" description=""> <ResourceGroup guid="_8_QnwBLoEd2rmrYM7OkLOA" name="CQALM" description="Use Only for ALM" version="7.1" url="http://localhost:9080/DataServices/ClearQuest/7.0.0/SAMPL" ...
"_8_QnwBLoEd2rmrYM7OkLOA"
. For reports that are not migrated from Rational Insight, you can use a GUID generator such as www.guidgenerator.com to generate a unique identifier to copy and paste into that field.
Remember: Under the Rational Reporting Data Warehouse Database > A Relational Database DataSource section, there is a special resource group which is the data warehouse. This data warehouse is needed as a data source to create the fact and dimension tables for the data-mart. You must configure the warehouse by specifying the username, password and URL to the data warehouse. For example for a Derby data warehouse, specify the following configuration settings:- Relational Database Type: Derby Client
- URL:
//localhost:1527/conf/jts/derby/warehouseDB
Restriction: If you use Apache Derby as your data warehouse and the application server (which hosts the Jazz Team Server and Data Collection Component) is an Apache Tomcat server that runs as a Windows service, you must specify an absolute path for the database alias part of the URL to the relational database data source. For example, change //localhost:1527/conf/jts/derby/warehouseDB to:
Otherwise, for this particular environment a relative path specified in the URL causes some of the data collection jobs to fail.//localhost:1527/C:/PROGRA~1/IBM/JazzTeamServer/server/conf/jts/derby/warehouseDB
- For the User name and Password, keep the default
value to
none
. - For the Version, select the version-level of the resource group is from
6.0
or5.0
release of Rational solution for Collaborative Lifecycle Management (CLM).
- In the Authentication type list, select the authentication setting of the
resource group. The valid values are Username and Password,
OAuth-JTS or Jazz Security Architecture SSO:
- Click the Test Connection link for each of the resource group that was added.
- A successful connection to the resource group displays the following message: Successfully connected to the resource group.
- A failed connection to the resource group displays the following message: Failed to connect to the resource group. For more details, click the show details link.
- Remember to click Save button in the upper right-hand corner of the Resource Group Configuration page to avoid losing your newly added settings.
-
Specify the load type for the next data collection job.
Full loads occur during the initial run of the data collection jobs, which builds all the data warehouse tables from scratch. These jobs take longer than typical data collection jobs that run delta builds, which only collect changes from the last time the job ran or a specified date. Any subsequent loads after the initial load or after running a job with a changed load type configuration, automatically defaults back to the Delta load since previous run setting.
Remember: You must reserve full loads for populating a new or empty data warehouse database, and run delta loads for a data warehouse that is already populated with data.A full load can take a long period of time to complete, such as several days, depending on the amount of data to process. If you are required to do a full load, keep in mind that to generate complete and accurate reports, a full load needs to completely pull in all the data that exists in the configured resource group, for example a specified CLM application. Otherwise if a full load is interrupted, this interruption can cause your reports to contain incomplete or inaccurate data generated from a not entirely loaded data warehouse. In addition, an interrupted full load can cause the data collection jobs to fail.
Example:You can change the next load configuration settings or keep the default load configuration at Delta load since previous run setting:- On the left-pane of the Data Collection Component application, under the Configuration section, select the Delta Load Configuration link.
- Under the Enabled column, select the check box of the resource group that you want to change the load configuration.
- Under the Load Type column, select one of the following load type option
from the drop-down menu:
- Full load
- This load job rebuilds all the data warehouse tables from scratch.
- Delta load since previous run
- This load job only loads the data that has changed or been added since the last successful load. This is the default setting.
- Delta load since date
- This load job requires you to specify a date to load the data that has changed or been added since the selected date.
- To apply the changes, click Save.
Remember: When you change the next load job to a Full load or Delta load since date option, after the job runs with this load type, the load type setting automatically defaults back to the Delta load since previous run setting. -
Decide which data collection jobs you want to run by loading, selecting, or removing any
predefined or custom jobs.
Custom jobs are jobs that you developed from a new or existing ETL (Extract, Transform, and Load) .ttl file. For details on creating custom jobs, see https://jazz.net/wiki/bin/view/Main/DataCollectionCustomizationUsingDCC
Predefined jobs, also referred to as out-of-box jobs, are jobs packaged with the product. You can use these predefined jobs as samples when developing your own custom jobs. The predefined jobs are located inside the com.ibm.rational.datacollection.service.jar file in the following directory:-
<installation directory>/server/conf/dcc/sites/dcc-updatesite/plugins/
For example:
C:\Program Files\ibm\JazzTeamServer\server\conf\dcc\sites\dcc-updatesite\plugins\com.ibm.rational.datacollection.service_<version>.jar
If you want to view the predefined jobs, copy the com.ibm.rational.datacollection.service.jar to a temporary location outside of the product, and use an extraction tool to decompress the .jar file. You can find the predefined jobs in the /mapping directory inside the .jar file.
Loading jobs:To load all the predefined (out-of-box) jobs and any jobs in the Data Collection Component mapping directory:- If you want to run a job that you developed (custom job), you must add the custom .ttl job files in the Data Collection Component mapping directory. This directory is located at <installation directory>/server/conf/dcc/mapping. For example, place your custom .ttl job files into the following directory: C:\Program Files\ibm\JazzTeamServer\server\conf\dcc\mapping.
- In the Data Collection Jobs page, click the Load Jobs button.
- You must wait a few minutes for the jobs to load. You can refresh the web browser multiple times until all your jobs display in the Data Collection Jobs page.
Remember: Each time the Load Jobs button is clicked, by default all the predefined jobs gets loaded with its default settings. In addition, any jobs in the Data Collection Component mapping directory also gets loaded.Removing jobs:To remove jobs that you do not want to run in the next data collection session:- In the Data Collection Jobs page, locate the job that you want to remove.
- Under the Schedule column, hover and click the Delete data
collection job icon (
).
Remember: If you click the Load Jobs button, your job removal selection may not persist. The Load Jobs action causes by default all the predefined jobs to get loaded with its default settings. In addition, any jobs in the Data Collection Component mapping directory are also loaded. The mapping directory is located in the <installation directory>/server/conf/dcc/mapping directory, for example: C:\Program Files\ibm\JazzTeamServer\server\conf\dcc\mapping.Selecting jobs:To select jobs that you want to run:- In the Data Collection Jobs page, select the check box for each data collection job that you want to run.
- If you select a check box for a data collection job that is by default clear, a window pops up
confirming if you are sure that you want to enable the job. The reason for the confirmation is
because running this job can impact the performance of the corresponding applications and the
overall time to collect data.Tip: Limit scheduling the Data-mart data collection jobs to run as infrequently as possible, for example at most once a day. In addition, jobs in this collection must never run individually. Processing Data-mart data collection jobs can take a significant amount of time because Data-mart jobs are not delta jobs and typically insert a lot of data in the data warehouse.
Remember: If you click the Load Jobs button, your job selection may not persist. The Load Jobs action causes by default all the predefined jobs to get loaded with its default settings. In addition, any jobs in the Data Collection Component mapping directory are also loaded. The mapping directory is located in the <installation directory>/server/conf/dcc/mapping directory, for example: C:\Program Files\ibm\JazzTeamServer\server\conf\dcc\mapping. -
-
Run the data collection jobs.
You can choose to run all the data collection jobs, run a specific data collection, or run a specific data collection job.
On the Data Collection Jobs page, you can see that the Data Collection Component categorize the jobs into data collections, such as Operational Data Store (ODS), Data-Mart, and Licenses.
You no longer need to worry about running the jobs in a strict and proper sequence. The Data Collection Component offering is designed to improve performance by using parallel and concurrent processing. Jobs in a data collection, for example in an ODS data collection run in parallel concurrent process. But before the jobs in the Data-Mart data collection can start it needs to wait for the jobs in the ODS data collection to complete. The reason for this is because the Data-Mart data collection extracts data from the ODS data collection, transforms it using a set of rules, then loads it into the metrics tables as sets of point-in-time information and relationships. This process is cumulative, meaning that a new set of information is added each time the Data-Mart data collection job is run, resulting in a collection of many sets of point-in-time metrics and relationships. This type of data can be used for reports that show metrics, trends, aggregations, and other relationships among the data.Tip: Limit scheduling the Data-mart data collection jobs to run as infrequently as possible, for example at most once a day. In addition, jobs in this collection must never run individually. Processing Data-mart data collection jobs can take a significant amount of time because Data-mart jobs are not delta jobs and typically insert a lot of data in the data warehouse.Select your scope for running data collection jobs:
Running all the data collection jobs:You can trigger the data collection jobs for all the registered applications at once. The jobs within an Operational Data Store data collection run in parallel concurrent process. However, before the jobs in the Data-Mart data collection can start it needs to wait for the jobs in the Operational Data Store data collection to complete.
- On the Data Collection Jobs page, click the Run all data warehouse collection jobs link.
Running a specific data collection:You can trigger all the jobs in a data collection. The data collection groups are Operational Data Store (ODS), Data-Mart, and Licenses.
- On the Data Collection Jobs page, choose a data collection that you want
to run:
- ODS Data Collection
- Data-Mart Data Collection
- Licenses Data Collection
- Click the corresponding Run link available in the section heading of the data collection that you chose in the previous step.
Running a specific data collection job:The status of each data collection job can change to one of these states:You can trigger a particular data collection job to run.
- On the Data Collection Jobs page, locate the job that you want to run.
- Under the Schedule column, hover and click the Run data
collection job icon (
).
- Idle
- The data collection job is inactive.
- Running
- The data collection job is currently in progress.
- Stopping
- The data collection job is terminating.
-
Monitoring the data collection jobs.
On the Data Collection Jobs Status page, you can view your collection status. You can check what jobs have completed or currently running, and find the job logs.
Example:- Open the Data Collection Jobs Status page by going to the left-pane of the Data Collection Component application, under the Data Collection section, click the Data Collection Jobs Status link.
- To refresh and see what jobs are still running or completed, click the
Refresh button (
) next to the Data Collection Jobs Status link. The Data Collection Jobs Status link is available on the left-pane of the Data Collection Component application, under the Data Collection section.
- You can view the list of jobs that are running under the Currently Running
Jobs section, which displays the following details:
- Job Name
- Displays the name of the data collection job that is running.
- Start Time
- Displays a
Waiting to start
status when a job is issued but has not yet started. Displays the date and 24-hour clock timestamp when a job is started, for example:2014-04-23 15:53:56
- Running Time
- Displays the length of time the job has been running, since you last refreshed the Data
Collection Jobs Status page. For example,
0 hours, 0 minutes, 12 seconds
.
- You can view the list of jobs that have completed under the Job History
section, which displays the following details:
- Job Name
- Displays the name of the data collection job that is completed.
- Status
- Displays if the job completed is
Success
,Failed
, orCancelled
. - Start Time
- Display the date and 24-hour clock timestamp when a job is started, for example:
2014-04-23 16:02:53
- End time
- Display the date and 24-hour clock timestamp when a job is completed, for example:
2014-04-23 16:02:55
- Time Taken
- Displays the total length of time to complete the job, for example
0 hours, 0 minutes, 2 seconds
- When you expand a job under the Job History section, you get the
following additional details:
- Resource Group
- Displays the registered application that the job is collecting data from.
- Status
- Displays if the job completed is
Success
,Failed
, orCancelled
. - Start Time
- Displays the date and 24-hour clock timestamp when a job is started, for example:
2014-04-23 16:02:53
- End Time
- Displays the date and 24-hour clock timestamp when a job is completed, for example:
2014-04-23 16:02:55
- Running Time
- Displays the total length of time to complete the job, for example
0 hours, 0 minutes, 2 seconds
- Processed Resources
- Displays the number of records processed.
- Delivered Rows
- Displays the number of records that were delivered to the data warehouse.
- Failed Resources
- Displays the number of records that failed to deliver to the data warehouse.
- Log
- A Log link that you can click to download and view the log file.
- You can limit the number of records display in the job history. Under the Job History section, in the Number of previous jobs to display data for field, type the number of records you want to display in the job history, for example 20 and then click the Run button.
- You can delete the job history. Click the Delete History link for the
following drop-down options:
- Delete All
- A Delete Job History window opens. Click OK to delete all job history. Or click Cancel.
- Delete Before
- You can select a date to delete all job history on or before the selected date.
- Delete After
- You can select a date to delete all job history on or after the selected date.
- Delete Between
- You can select two dates to delete all job history inclusively between the two selected dates.
-
Schedule data collection jobs.
After you are satisfied with the initial run of your data collection job, you have the option to schedule your data collection jobs to run in multiple intervals or times throughout the day. You can trigger to run a data collection job using either of the two schedule options:
- Interval schedule
- Runs a data collection job at every X amount of minutes of the day.
- Daily schedule
- Runs a data collection job at particular times of the day.
Example:- Open the Data Collection Jobs page by going to the left-pane of the Data Collection Component application, under the Data Collection section, click the Data Collection Jobs link.
- On the Data Collection Jobs page, choose a data collection group that you want to schedule the jobs:
- ODS Data Collection
- Data-Mart Data Collection
- Licenses Data Collection
- Click the Schedule link available on the right-side of your selected data collection group.
- In the Edit schedule window, use the drop-down menu on the upper right to
select one of the following schedule options:
- Interval Schedule
-
- By default the Enabled check box is selected to turn on the schedule. However, if you need to temporarily turn off the schedule, clear the Enabled check box, such as when performing a server rename. For details, see Impact of server rename on the Data Collection Component topic.
- Under the Interval group, you can specify to collect data every X amount of minutes, by typing a number in the Current time interval field. The minimum amount of minutes is 5.
- Under the Week Days group, you can specify to collect data on certain
days.
- To select the multiple days of the week, press and hold down the Ctrl key and click the days of the week you want to collect data. For example, Monday, Wednesday, and Friday.
- To select all the days of the week to collect data, click the All button.
- After you specify the interval schedule, click the OK button in the
Edit schedule window.
- Daily Schedule
-
- By default the Enabled check box is selected to turn on the schedule. However, if you need to temporarily turn off the schedule, clear the Enabled check box, such as when performing a server rename. For details, see Impact of server rename on the Data Collection Component topic.
- Under the Times group, you can specify more than one particular time of
the day that you want to collect data. For example, run a 9:00 am, noon, and before end of day 4:00
pm data collection job throughout the day.
- Click the Create Time button each time you want to add a time of the day entry.
- In the first drop-down menu, specify the hour of the day in a 24-hour clock.
- In the second drop-down menu, specify the minutes of the selected hour.
- To remove a time, click the Remove time icon that is next to the time
entry that you want to remove.
- Under the Week Days group, you can specify to collect data on certain
days at the previously specified times of the day.
- To select the multiple days of the week, press and hold down the Ctrl key and click the days of the week you want to collect data. For example, Monday, through Friday.
- To select all the days of the week to collect data, click the All button.
- After you specify the daily schedule, click the OK button in the
Edit schedule window.
- To apply schedule settings, you must click the Save button available on the upper-right hand corner of the Data Collection Jobs page.
- To remove a schedule setting:
- On the Data Collection Jobs page, choose a data collection group that you want to remove the
schedule setting:
- ODS Data Collection
- Data-Mart Data Collection
- Licenses Data Collection
- Click the Unschedule link available on the right-side of your selected data collection group.
- To apply the removal of the schedule settings, you must click the Save button available on the upper-right hand corner of the Data Collection Jobs page.
- On the Data Collection Jobs page, choose a data collection group that you want to remove the
schedule setting:
-
Get email notifications when a data collection job fails.
Remember: Before you can receive email notifications from Data Collection Component, you must verify the Jazz Team Server is enabled to send email notifications by properly configured to integrate with an email server. For details, see Configuring email settings topic.Example:
- On the left-pane of the Data Collection Component application, under the Configuration section, select the Data Collection Configuration link.
- In the Data Collection Properties section, click the Edit link.
- In the ETL Administrator E-mail List field, type an email address for subscribing to notifications when a data collection job fails. Multiple email addresses can also be specified by using a comma-separated list.
- Click the Save button to update your changes.
Data collection jobs failed for: https://<server>:<port>/dcc/web#action=com.ibm.rational.datacollection.manageResources.dataCollectionJobStatus [MapGroup]Name:WORKITEM_ODS_DB | Status: Failed | Start Time: Fri Sep 25 15:25:28 EDT 2015 | End Time: Fri Sep 25 15:25:30 EDT 2015 | Duration Time (milliseconds):1326 | Retreived Resources: 0 | Processed Resources:0 | Failed Resources:0 | Inserted Rows:0 | Updated Rows:0 [ResourceGroup]Name:RIDW | Status: Failed | Start Time: Fri Sep 25 15:25:28 EDT 2015 | End Time: Fri Sep 25 15:25:30 EDT 2015 | Duration Time (milliseconds):1139 | Retreived Resources: 0 | Processed Resources:0 | Failed Resources:0 | Inserted Rows:0 | Updated Rows:0