Create a TensorFlow record dataset
Create a dataset from TensorFlow records (TFRecords). A TFRecord dataset can be used to train TensorFlow models.
Procedure
- From the cluster management console, select .
- Select the Datasets tab.
- Click New.
- Create a dataset from TensorFlow Records.
- Provide a dataset name.
- Specify a Spark instance group.
- Provide a training folder. The full absolute path to the training folder must be provided.The folder must contain an TFRecord file.
- Provide a validation folder. The full absolute path to the validation folder can be provided. To use this dataset for validation, you must specify a validation folder. Otherwise, this dataset cannot be used to validate a training model.The folder must contain an TFRecord file.
- Provide a testing folder. The full absolute path to the training folder must be provided.The folder must contain an TFRecord file.
- Provide a label file.
- Click Create.
Results
The dataset is created once it is in Created state. If creation failed, see the driver and executor logs in the Spark Applications tab.
What to do next
To view details about the dataset, click the dataset name. To use the dataset in a training run, either create a training model or start a training run.