com.intel.analytics.bigdl.dataset.text
if oneHot = true: Transform labeled sentences to one-hot format samples e.g. sentence._data: [0, 2, 3] sentence._label: [2, 3, 1] vocabLength: 4
else: The model will use LookupTable for word embedding.
length of dictionary
optional parameter for fixed length of input data
optional parameter for fixed length of labels
Apply this transformer to rdd
if oneHot = true: Transform labeled sentences to one-hot format samples e.g. sentence._data: [0, 2, 3] sentence._label: [2, 3, 1] vocabLength: 4
> input: 0, 0, 0], [0, 0, 1, 0], [0, 0, 0, 1 target: [3, 4, 2]
else: The model will use LookupTable for word embedding.
> input: [1, 2, 3]
> label: [2, 3, 4] The input is an iterator of LabeledSentence class The output is an iterator of Sample class