Create an image dataset for vector output

Create a dataset from images for vector output.

Procedure

  1. From the cluster management console, select Workload > Spark > Deep Learning.
  2. Select the Datasets tab.
  3. Click New.
  4. Create a dataset from Images for Vector Output.
  5. Provide a dataset name.
  6. Specify a Spark instance group.
  7. Specify how the training images are selected from one of the following choices.
    • Specify percentages of training images. For this choice, provide the following.
      1. Provide the percentage of images for validation.
      2. Provide the percentage of images for testing.
    • Specify folder locations. For this choice, provide the following.
      1. Provide a validation folder.
      2. Provide a testing folder.
  8. Specify a TensorFlow model graph.
    The graph must be a .pb file.
  9. Specify the data input tensor name.
  10. Specify the vector output tensor name.
  11. Click Create.

Results

The dataset is created once it is in Created state. If creation failed, see the driver and executor logs in the Spark Applications tab.

What to do next

To view details about the dataset, click the dataset name. To use the dataset in a training run, either create a training model or start a training run.