Selecting Data

Based upon the initial data collection conducted in the previous CRISP-DM phase, you are ready to begin selecting the data relevant to your data mining goals. Generally, there are two ways to select data:

  • Selecting items (rows) involves making decisions such as which accounts, products, or customers to include.
  • Selecting attributes or characteristics (columns) involves making decisions about the use of characteristics such as transaction amount or household income.