Bulk regression analysis report

The bulk regression analysis report performs regression analysis on weather data for all active accounts that are included in the report selection range.

The regression analysis result and the reporting outcome are included in the CSV report that is sent by email. The following list describes the main columns in the report:
Result

The outcome of running the report:

  • To Be Applied: The regression result meets the R2 acceptance criteria and will be saved into the platform.
  • Not Applied: The regression result does not meet the R2 acceptance criteria and will not be saved.
  • Skipped: The item didn't go through the regression analysis process because it was deliberately excluded, for example, it already has an existing model and the report was run for creating new models only, or the report was run for using an existing location's base HDD and CDD values but they have not yet been configured in the system.
  • Invalid: An invalid regression result was returned for the item.

When running the report in Commit To Save mode, the To Be Applied items will be marked as Applied instead.

Active R2, Active_HDD_Base, Active_CDD_Base
The existing regression R2 result, Base HDD and Base CDD values that are associated with the account, if any. The R2 value is a setting of the account while Base HDD and Base CDD values are settings of its parent Location.
Model R2, Model_HDD_Base, Model_CDD_Base
The new proposed regression R2 result for the best fit Base HDD and Base CDD values that are associated with the account after going through the regression analysis.
Status

The result of the regression analysis:

  • Strong Model: R2 value is more than 0.75.
  • Weak Model: R2 value is between 0.5 and 0.75.
  • Invalid Model: R2 value is less than 0.5, or there is not sufficient data to perform the regression analysis.
Conflicting Location HDD_CDD Base Values

A warning message is displayed in this column if the accounts in the same location have different best fit Base HDD and Base CDD values to each other, or if any of the values is different from its location's existing Base HDD and Base CDD values.

If the report is run in Commit To Save mode, the Model_HDD_Base and Model_CDD_Base values that are associated with the account that has the highest R2 result among all accounts in the same location will be saved into the location settings, and it will be used for normalization reporting for all accounts in the same location.

Sample_Of_Data
Number of months in which consumption data is present and is used for the regression analysis.
Message

Some additional information about the regression result, such as number of nonworking days, unit of temperature, linked weather station and so on. A preliminary assessment of the HDD and CDD t-statistic result will also be displayed in this column if applicable.

The same message is displayed if you run the same regression analysis by using the dashboard.