Residual scatterplots

The plot of residuals by predicted values clearly shows the two most underperforming vehicles. Additionally, you can see that the Breeze and SW are quite close to the majority of cases. This suggests that the apparent underperformance of the Breeze and SW could be due to random chance. What is of greater concern in this plot are the clusters of cases far to the left of the general cluster of cases. While the vehicles in these clusters do not have large residuals, their distance from the general cluster may have given these cases undue influence in determining the regression coefficients.
- To check the residuals by factor score 1,
from the menus choose:
Figure 2. Chart Builder - In the Scatter/Dot gallery, select Simple Scatter.
- Select Standardized Residual as the y variable and REGR factor score 1 for analysis 1 as the x variable.
- Click the Groups/Point ID tab and select Point ID Label.
- Select Model as the variable to label cases by.
- Click OK.

The resulting scatterplot reveals that the unusual grouping of points noted in the residuals by predicted values scatterplot have large values for factor score 1; that is, they are high-priced vehicles. Since the distribution of prices is right-skewed, it might be a good idea to use log-transformed prices in future analyses. By recalling the Chart Builder, you can produce similar for the other factor scores.

The charts for factor scores 2 and 3 don't reveal anything interesting, but the residuals by factor score 5 reveal that the Metro may be an influential point because it has a much higher fuel efficiency than any other vehicle in the dataset and lies far outside the main cluster of points.

The residuals by factor score 5 chart reveals that the Viper may also be an influential point because it has an unusually large engine size and lies outside the main cluster of points.