Overview (DATASET ACTIVATE command)
The DATASET
commands (DATASET NAME
, DATASET ACTIVATE
, DATASET DECLARE
, DATASET COPY
, DATASET CLOSE
) provide the
ability to have multiple data sources open at the same time and control
which open data source is active at any point in the session. Using
defined dataset names, you can then:
- Merge data
(for example,
MATCH FILES
,ADD FILES
,UPDATE
) from multiple different source types (for example, text data, database, spreadsheet) without saving each one as an external IBM® SPSS® Statistics data file first. - Create new datasets that are subsets of open data sources (for example, males in one subset, females in another, people under a certain age in another, or original data in one set and transformed/computed values in another subset).
- Copy and paste variables, cases, and/or variable properties between two or more open data sources in the Data Editor.
The DATASET ACTIVATE
command
makes the named dataset the active dataset in the session.
- If the previous active dataset does not have a defined dataset name, it is no longer available in the session.
- If the previous active dataset has a defined dataset name, it remains available for subsequent use in its current state.
- If the named dataset does not exist, an error occurs, and the command is not executed.
-
DATASET ACTIVATE
cannot be used within transformation structures such asDO IF
,DO REPEAT
, orLOOP
.
Basic Specification
The basic specification for DATASET ACTIVATE
is the command name followed by a name
of a previously defined dataset. See the topic DATASET NAME for more information.
WINDOW keyword
The WINDOW
keyword controls
the state of the Data Editor window associated with the dataset.
ASIS. The Data Editor window containing the dataset is not affected. This is the default.
FRONT. The Data Editor window containing the dataset is brought to the front and the dataset becomes the active dataset for dialog boxes.
Operations
- Commands operate on the active dataset. The active dataset is the data source most
recently opened (for example, by commands such as
GET DATA
,GET SAS
,GET STATA
,GET TRANSLATE
) or most recently activated by aDATASET ACTIVATE
command.Note: The active dataset can also be changed by clicking anywhere in the Data Editor window of an open data source or selecting a dataset from the list of available datasets in a syntax window toolbar.
- Variables from one dataset are not available when another dataset is the active dataset.
- Transformations to the active dataset--before or after defining a dataset name--are preserved with the named dataset during the session, and any pending transformations to the active dataset are automatically executed whenever a different data source becomes the active dataset.
- Dataset names can be used in most commands that can contain references to IBM SPSS Statistics data files.
- For commands that can create a new dataset or overwrite
an existing dataset, you cannot use the dataset name of the active
dataset to overwrite the active dataset. For example, if the active
dataset is mydata, a command with
the subcommand
/OUTFILE=mydata
will result in an error. To overwrite a named active dataset, use an asterisk instead of the dataset name, as in:/OUTFILE=*
. - Wherever a dataset name,
file handle (defined by the
FILE HANDLE
command), or filename can be used to refer to IBM SPSS Statistics data files, defined dataset names take precedence over file handles, which take precedence over filenames. For example, if file1 exists as both a dataset name and a file handle,FILE=file1
in theMATCH FILES
command will be interpreted as referring to the dataset named file1, not the file handle.
Example
GET FILE='/data/mydata.sav'.
DATASET NAME file1.
COMPUTE AvgIncome=income/famsize.
GET DATA /TYPE=XLS
/FILE='/data/exceldata.xls'.
COMPUTE TotIncome=SUM(income1, income2, income3).
DATASET NAME file2.
DATASET ACTIVATE file1.
- Reading a new
data source automatically changes the active dataset; so the
GET DATA
command changes the active dataset to the data read from the Excel worksheet. - Since
the previous active dataset has a defined dataset name associated
with it, it is preserved in its current state for subsequent use in
the session. The "current state" includes the new variable AvgIncome generated by the
COMPUTE
command, since pending transformations are automatically executed before the Excel worksheet become the active dataset. - When the dataset file1 is activated again, any pending transformations associated with dataset file2 are automatically executed; so the new variable TotIncome is preserved with the dataset.