Tensorflow dataset.shuffle seems not shuffle without repeat()

nessuno 2019-07-03 23:13

Yes, you should call .shuffle during the inner loop. Moreover, it is better to do not mix python code and TensorFlow code when pure tf.* method equivalent to the Python statements are available.

import tensorflow as tf

dataset = tf.data.Dataset.from_tensor_slices(["a", "b", "c", "d"])
# dataset = dataset.shuffle(2)


@tf.function
def loop():
    for epoch in tf.range(10):
        for d in dataset.shuffle(2):
            tf.print(d)


loop()

The loop call produces the different values every time (and tf.print prints the content of the tf.Tensor, differently from print that prints the object).

Related issues

Set values in row to zero before index value of row [NumPy or Tensorflow]

My deep learning model is not training. How do I make it train?

Can YOLO pictures have a bounded box that covering the whole picture?

ValueError: Shapes () and (150, 5) are incompatible Tenosrflow

Save histogram during evaluation with estimator api

Temporarily merge the batch dimension in Keras

TensorFlow: Convert GRUCell weights from compat.v1 to tensorflow 2

How can I customize the gradient computation at training time in keras?

TensorFlow layer that converts a 2D matrix to a vector of certain length

ValueError while installing and running Tensorflow