Get a sequence of data
Get a sequence of data
if the data is used in train. If yes, the data sequence is a looped endless sequence, or it has a limited length.
data sequence
Get the 'origin' RDD of the dataset.
Get the 'origin' RDD of the dataset.
Change the order of the data sequence from the data set
Change the order of the data sequence from the data set
Total size of the data set
Helper function to transform the data type in the data set.
Helper function to transform the data type in the data set.
Trigger the computation of this dataset and cache it in memory.
Check if rdd is cached.
Convert current DataSet to a distributed DataSet, in which we use a RDD to represent the data sequence.
Convert current DataSet to a distributed DataSet, in which we use a RDD to represent the data sequence.
Convert current DataSet to a local DataSet, in which we use an iterator to represent the data sequence.
Convert current DataSet to a local DataSet, in which we use an iterator to represent the data sequence.
Helper function to transform the data type in the data set.
Helper function to transform the data type in the data set.
Unpersist rdd.
Represent a distributed data. Use RDD to go through all data.