Class that help build a dictionary either from tokenized text or from saved dictionary
Represent a sentence
Transform labeled sentences to one-hot format samples e.
x => ["start", x, "end"]
Input a sequence of string, cut it into sentences.
Transformer that tokenizes a Document (article) into a Seq[Seq[String]]
Transform a string of sentence to LabeledSentence.