Get a sequence of data
Get a sequence of data
if the data is used in train. If yes, the data sequence is a looped endless sequence, or it has a limited length.
data sequence
Change the order of the data sequence from the data set
Total size of the data set
Total size of the data set
Helper function to transform the data type in the data set.
Helper function to transform the data type in the data set.
Helper function to transform the data type in the data set.
Helper function to transform the data type in the data set.
Convert current DataSet to a distributed DataSet, in which we use a RDD to represent the data sequence.
Convert current DataSet to a distributed DataSet, in which we use a RDD to represent the data sequence.
Convert current DataSet to a local DataSet, in which we use an iterator to represent the data sequence.
Convert current DataSet to a local DataSet, in which we use an iterator to represent the data sequence.
A set of data which is used in the model optimization process. The dataset can be access in a random data sample sequence. In the training process, the data sequence is a looped endless sequence. While in the validation process, the data sequence is a limited length sequence. User can use the data() method to get the data sequence.
The sequence of the data is not fixed. It can be changed by the shuffle() method.
User can create a dataset from a RDD, an array and a folder, etc. The DataSet object provides many factory methods.
Data type
Represent a sequence of data