# Summarize

This feature requires the Statistics Base option.

The Summarize procedure calculates subgroup statistics for variables
within categories of one or more grouping variables. All levels of
the grouping variable are crosstabulated. You can choose the order
in which the statistics are displayed. Summary statistics for each
variable across all categories are also displayed. Data values in
each category can be listed or suppressed. With large datasets, you
can choose to list only the first *n* cases.

**Example.** What is the average product sales amount by region
and customer industry? You might discover that the average sales amount
is slightly higher in the western region than in other regions, with
corporate customers in the western region yielding the highest average
sales amount.

**Statistics.** Sum, number of cases, mean, median, grouped
median, standard error of the mean, minimum, maximum, range, variable
value of the first category of the grouping variable, variable value
of the last category of the grouping variable, standard deviation,
variance, kurtosis, standard error of kurtosis, skewness, standard
error of skewness, percentage of total sum, percentage of total *N*,
percentage of sum in, percentage of *N* in, geometric mean, and
harmonic mean.

Summarize Data Considerations

**Data.** Grouping variables are categorical variables whose
values can be numeric or string. The number of categories should be
reasonably small. The other variables should be able to be ranked.

**Assumptions.** Some of the optional subgroup statistics,
such as the mean and standard deviation, are based on normal theory
and are appropriate for quantitative variables with symmetric distributions.
Robust statistics, such as the median and the range, are appropriate
for quantitative variables that may or may not meet the assumption
of normality.

To Obtain Case Summaries

This feature requires the Statistics Base option.

- From the menus choose:
- Select one or more variables.

Optionally, you can:

- Select one or more grouping variables to divide your data into subgroups.
- Click Options to change the output title, add a caption below the output, or exclude cases with missing values.
- Click Statistics for optional statistics.
- Select Display cases to list the cases
in each subgroup. By default, the system lists only the first 100
cases in your file. You can raise or lower the value for Limit
cases to first
*n*or deselect that item to list all cases.