Warm tip: This article is reproduced from stackoverflow.com, please click
tensorflow2.0 kubeflow kubeflow-pipelines

Error occurred when finalizing GeneratorDataset iterator: Cancelled: Operation was cancelled

发布于 2020-03-27 15:43:43

While running kubeflow pipeline having code that uses tensorflow 2.0. below error is displayed at end of each epoch

W tensorflow/core/kernels/data/generator_dataset_op.cc:103] Error occurred when finalizing GeneratorDataset iterator: Cancelled: Operation was cancelled

Also, after some epochs, it does not show log and shows this error

This step is in Failed state with this message: The node was low on resource: memory. Container main was using 100213872Ki, which exceeds its request of 0. Container wait was using 25056Ki, which exceeds its request of 0.

Questioner
Radhi
Viewed
1.3k
Radhi 2020-02-25 13:20

This was due to incompatible CUDA and Tensorflow versions. below versions work well with each other

tensorflow-gpu==2.0.0

tensorflow-addons==0.6.0

nvidia/cuda:10.0-cudnn7-runtime