Creating and using the sample data sets

Many of the examples in this document refer to the sample data sets SORT.SAMPIN, SORT.SAMPADD, SORT.BRANCH and SORT.SAMPOUT. Appendix A, "Creating the Sample Data Sets" shows you how to create your own copies of these data sets, using a program called ICESAMP shipped with DFSORT, if you want to try the examples in this document that use them.

Note: Some of the examples use data sets other than SORT.SAMPIN, SORT.SAMPOUT, SORT.SAMPADD, and SORT.BRANCH. You can either create data sets from scratch to match the ones used in the text, or else perform a similar exercise on data sets you already have.

Before you begin, turn to Descriptions of the sample data sets. Many of the examples in this document refer to the sample bookstore data sets as the input data sets, so you should become familiar with them. The input data sets contain the data that you want arranged or sorted. You must specify an input data set for every DFSORT job you run. The sample bookstore data set is named SORT.SAMPIN and the additional bookstore data set is named SORT.SAMPADD.

Each record in the bookstore data sets has 12 fields (book title, author’s last name, and so on). A record can be represented by one horizontal row on the page. A field can be represented by one vertical column on the page.

To sort a data set, you choose one or more fields that you want to use to order the records (arrange in ascending or descending order). These fields are called control fields (or, in COBOL, keys).

As you work through the exercises on the following pages, remember that each entire record is sorted, not just the control field. However, for the sake of simplicity, the figures in the text show only the control fields being discussed. The sorted records actually contain all of the fields, but one page is not wide enough to show them. Descriptions of the sample data sets, shows all of the fields in each record. It is also arranged with headings and numbers that show the byte positions of each field. The numeric fields are in binary format (see Table 2) and therefore will not appear on most displays as they do in this document. Methods you can use to arrange and view the data are explained in the chapters on DFSORT functions that follow.

Table 1 shows an example of sorted fields. Notice the line of numbers above the sorted fields. These numbers represent the byte positions of those fields. You use byte positions to identify fields to DFSORT. The examples show the byte positions to help you while you are learning to use DFSORT. The byte positions do not actually appear in any of your processed data sets.

In Table 1, the first two records, which show nothing in the course department fields, are general purpose books not required for a particular course. For this example, the control field is the Course Department field.

Table 1. Sample Bookstore Data Set Sorted by Course Department in Ascending Order
Book Title Course Department Price
1           75
110  114
170  173
LIVING WELL ON A SMALL BUDGET
PICK'S POCKET DICTIONARY
INTRODUCTION TO BIOLOGY
SUPPLYING THE DEMAND
STRATEGIC MARKETING
COMPUTER LANGUAGES
VIDEO GAME DESIGN
COMPUTERS: AN INTRODUCTION
NUMBERING SYSTEMS
SYSTEM PROGRAMMING
INKLINGS: AN ANTHOLOGY OF YOUNG POETS
EDITING SOFTWARE MANUALS
MODERN ANTHOLOGY OF WOMEN POETS
THE COMPLETE PROOFREADER
SHORT STORIES AND TALL TALES
THE INDUSTRIAL REVOLUTION
EIGHTEENTH CENTURY EUROPE
CRISIS OF THE MIDDLE AGES
INTRODUCTION TO PSYCHOLOGY
ADVANCED TOPICS IN PSYCHOANALYSIS
 
 
BIOL
BUSIN
BUSIN
COMP
COMP
COMP
COMP
COMP
ENGL
ENGL
ENGL
ENGL
ENGL
HIST
HIST
HIST
PSYCH
PSYCH
    9900
     295
    2350
    1925
    2350
    2600
    2199
    1899
     360
    3195
     595
    1450
     450
     625
    1520
     795
    1790
    1200
    2200
    2600

Also notice that records in Table 1 with equally collating control fields (in this case,the same department) appear in their original order. For example, within the Computer Science department (COMP), the title Video Game Design still appears before Computers: An Introduction.

You can control whether records with equally collating control fields appear in their original order or whether DFSORT orders them randomly. The system programmer sets defaults at installation time that you can change with some DFSORT options at run time. The examples in this document assume that the default is for records with equally collating control fields to appear in their original order.

Summary
So far in Getting Started you covered the following concepts:
  • You can sort, copy, or merge data sets using DFSORT.
  • You can write JCL and DFSORT program control statements to create and process DFSORT jobs.
  • You can run DFSORT jobs directly or call DFSORT from a program.

In addition, this chapter covered how to use and read the sample data sets provided with DFSORT. Now continue with tutorials on how to write DFSORT control statements.