com.intel.analytics.bigdl.dataset

text

package text

Visibility
  1. Public
  2. All

Type Members

  1. class Dictionary extends Serializable

    Class that help build a dictionary either from tokenized text or from saved dictionary

  2. class LabeledSentence[T] extends Sentence[T]

    Represent a sentence

  3. class LabeledSentenceToSample[T] extends Transformer[LabeledSentence[T], Sample[T]]

    if oneHot = true: Transform labeled sentences to one-hot format samples e.

  4. class SentenceBiPadding extends Transformer[String, String]

    x => ["start", x, "end"]

  5. class SentenceSplitter extends Transformer[String, Array[String]]

    Input a sequence of string, cut it into sentences.

  6. class SentenceTokenizer extends Transformer[String, Array[String]]

    Transformer that tokenizes a Document (article) into a Seq[Seq[String]]

  7. class TextToLabeledSentence[T] extends Transformer[Array[String], LabeledSentence[T]]

    Transform a string of sentence to LabeledSentence.

Value Members

  1. object Dictionary extends Serializable

  2. object LabeledSentenceToSample extends Serializable

  3. object SentenceBiPadding extends Serializable

  4. object SentenceSplitter extends Serializable

  5. object SentenceTokenizer extends Serializable

  6. object TextToLabeledSentence extends Serializable

  7. package utils

Ungrouped