Helper function to transform the data type in the data set.
Helper function to transform the data type in the data set.
Trigger the computation of this dataset and cache it in memory.
Trigger the computation of this dataset and cache it in memory.
Get a sequence of data
Get a sequence of data
if the data is used in train. If yes, the data sequence is a looped endless sequence, or it has a limited length.
data sequence
Check if rdd is cached.
Check if rdd is cached.
Get the 'origin' RDD of the dataset.
Get the 'origin' RDD of the dataset.
Change the order of the data sequence from the data set
Change the order of the data sequence from the data set
Total size of the data set
Convert current DataSet to a distributed DataSet, in which we use a RDD to represent the data sequence.
Convert current DataSet to a distributed DataSet, in which we use a RDD to represent the data sequence.
Convert current DataSet to a local DataSet, in which we use an iterator to represent the data sequence.
Convert current DataSet to a local DataSet, in which we use an iterator to represent the data sequence.
Helper function to transform the data type in the data set.
Helper function to transform the data type in the data set.
Unpersist rdd.
Unpersist rdd.
Wrap a RDD as a DataSet.