TARGET_OPTIONS Subcommand (GENLINMIXED command)
The TARGET_OPTIONS
subcommand specifies the reference
category, distribution, and link function for the target.
- This subcommand is optional. If the
TARGET_OPTIONS
subcommand is not specified, thenGENLINMIXED
assigns a default distribution and link function based on the measurement level of the target. A continuous target hasNORMAL
distribution andIDENTITY
link. A categorical target hasMULTINOMIAL
distribution andLOGIT
link.
REFERENCE. For categorical targets, this is the target
value to use as the reference category for parameter estimation. No
model parameters are assigned to the reference category. Specify a
single value. Put the value inside a pair of quotes if it is formatted
(such as date or time) or if the target is of string storage type.
The REFERENCE
value must be found in the dataset.
If REFERENCE
is not specified, the last category
in ascending order is used as the reference.
- If the target has ordinal measurement level, the
REFERENCE
keyword is ignored.DISTRIBUTION Keyword
The
DISTRIBUTION
keyword specifies the probability distribution of the target.- The default probability distribution depends on the measurement
level of the target. If the target is continuous, the default distribution
is
NORMAL
. If the target is categorical, then the default isMULTINOMIAL
. If an events/trials specification is used, then the default distribution isBINOMIAL
. - Caution must be exercised when the
LINK
but not theDISTRIBUTION
keyword is used. The discussion of theLINK
keyword below gives details about proper and improper combinations ofDISTRIBUTION
andLINK
settings.
BINOMIAL. Binomial probability distribution. If the target is specified as a single field, then it can be numeric or string and it can have only two distinct valid data values.
If the events and trials options are specified, then the procedure automatically computes the ratio of the events field over the trials field or number. The events field—and the trials field if specified—must be numeric. Data values for the events field must be integers greater than or equal to zero. Data values for the trials field must be integers greater than zero. For each record, the trials value must be greater than or equal to the events value. If an events value is noninteger, less than zero, or missing, then the corresponding record is not used in the analysis. If a trials value is noninteger, less than or equal to zero, less than the events value, or missing, then the corresponding record is not used in the analysis.
If the trials option specifies a number, then it must be a positive integer, and it must be greater than or equal to the events value for each record. Records with invalid values are not used in the analysis.
This is the default probability distribution if the target is specified using events/trials format.
GAMMA. Gamma probability distribution. The target must be numeric, with data values greater than zero. If a data value is less than or equal to zero, or missing, then the corresponding record is not used in the analysis.
INVERSE_GAUSSIAN. Inverse Gaussian probability distribution. The target must be numeric, with data values greater than zero. If a data value is less than or equal to zero, or missing, then the corresponding record is not used in the analysis.
MULTINOMIAL. Multinomial probability distribution. The target can be numeric or string, and it must have at least two distinct, valid data values. This is the default probability distribution if the target is categorical. Note that the form of the model will depend on the measurement level of the target.
- A nominal target will result in a nominal multinomial model in which a separate set of model parameters are estimated for each category of the target (except the reference category). The parameter estimates for a given predictor show the relationship between that predictor and the likelihood of each category of the target, relative to the reference category.
- An ordinal target will result in an ordinal multinomial model in which the traditional intercept term is replaced with a set of threshold parameters that relate to the cumulative probability of the target categories.
NEGATIVE_BINOMIAL. Negative binomial probability distribution. The target must be numeric, with data values that are integers greater than or equal to zero. If a data value is noninteger, less than zero, or missing, then the corresponding record is not used in the analysis.
NORMAL. Normal probability distribution. The target must be numeric. This is the default probability distribution if the target is continuous.
POISSON. Poisson probability distribution. The target must be numeric, with data values that are integers greater than or equal to zero. If a data value is noninteger, less than zero, or missing, then the corresponding record is not used in the analysis.
LINK Keyword
The
LINK
keyword specifies the link function. The following link functions are available.IDENTITY. Identity link function. f(x)=x. This link function is valid with any distribution except the multinomial.
CAUCHIT. Cauchit link function. f(x) = tan(π (x − 0.5)). This is only valid with ordinal multinomial models.
CLOGLOG. Complementary log-log link function. f(x)=ln(−ln(1−x)). This valid with binomial and ordinal multinomial models.
LOG. Log link function. f(x)=ln(x). This link function is valid with any distribution except the multinomial.
LOGC. Log complement link function. f(x)=ln(1−x). This is only valid with the binomial distribution.
LOGIT. Logit link function. f(x)=ln(x / (1−x)). This is valid with the binomial and multinomial distributions.
NLOGLOG. Negative log-log link function. f(x)=−ln(−ln(x)). This valid with binomial and ordinal multinomial models.
POWER. Power link function. f(x)=x α, if α≠0. f(x)=ln(x), if α=0. α is a required number specified on the
LINKPARAMETER
keyword and must be a real number. If |α| < 2.2e-16, α is treated as 0. This link function is valid with any distribution except the multinomial.PROBIT. Probit link function. f(x)=Φ−1(x), where Φ−1 is the inverse standard normal cumulative distribution function. This valid with binomial and ordinal multinomial models.
- The default probability distribution depends on the measurement
level of the target. If the target is continuous, the default distribution
is
- The default setting for
LINK
depends on theDISTRIBUTION
setting, whetherDISTRIBUTION
is set explicitly or is the default, as shown in the following table.Table 1. Default link function for each distribution DISTRIBUTION Setting Default LINK Setting NORMAL IDENTITY BINOMIAL LOGIT GAMMA POWER(−1) INVERSE_GAUSSIAN POWER(−2) MULTINOMIAL LOGIT NEGATIVE_BINOMIAL LOG POISSON LOG - The procedure will fit a model if a permissible combination of
LINK
andDISTRIBUTION
specifications is given. The table below indicates the permissibleLINK
andDISTRIBUTION
combinations. Specifying an improper combination will yield an error message. - Note that the default setting for
DISTRIBUTION
is irrespective of theLINK
specification, and that not allLINK
specifications are valid for the defaultDISTRIBUTION
. Thus, ifLINK = CLOGLOG
is specified for a continuous target butDISTRIBUTION
is not, then the defaultDISTRIBUTION = NORMAL
yields an improper combination ofDISTRIBUTION
andLINK
settings.
Link | NORMAL | BI-NOMIAL | GAMMA | INVERSE_ GAUSSIAN | Nominal MULTI- NOMIAL | Ordinal MULTI- NOMIAL | NEGATIVE_ BINOMIAL | POISSON |
---|---|---|---|---|---|---|---|---|
IDENTITY | X | X | X | X | X | X | ||
CAUCHIT | X | |||||||
CLOGLOG | X | X | ||||||
LOG | X | X | X | X | X | X | ||
LOGC | X | |||||||
LOGIT | X | X | X | |||||
NLOGLOG | X | X | ||||||
PROBIT | X | X | ||||||
POWER | X | X | X | X | X | X |
LINKPARAMETER Keyword
The LINKPARAMETER
keyword specifies the value
of α for the POWER
link.
- Specify a number. There is no default.