History

The tab shows the historical data operations. It consists of three sections:

  • Basic information on the dataset: its type, path, the time when it was synced for the first and last time, and the number of rows.
  • The filters with which you can narrow down the displayed operations according to:
    • Selected time frame
    • Operation type (read or write)
    • Pipeline
    • Issue type reported for the operation (for example, failed operation or schema changes)
  • The list of operations performed on the dataset. It gives you the following information:
    Status

    The operation status in the form of an icon. Hover your cursor over the icon to display the status.

    Type

    The operation type (read or write).

    Issue type

    The issue type reported for the operation (if any), for example failed operations or schema changes.

    Schema

    The detailed data schema (if any). Click the link to display more data, for example the dataset size, the number of records, or the names of the columns.

    Records

    The number of records that are read or written by the operation.

    Origin

    The origin of the operation metadata. This can be a specific pipeline run, a query ID from your data warehouse, or both. When the origin is a pipeline run monitored by Databand, clicking the link will take you to that run's page where you can view more details.

    Time

    The operation start time.

Previewing the data and schema

If a dataframe operation was logged through the Python or Java SDK, you might have the options to preview the dataframe's data and column statistics (if the preview is enabled - for more information go to Datasets).

To see these previews:

  1. Click the More options icon More options.
  2. Select the relevant option.

You can also display the schema by clicking the link in the Schema column.