com.intel.analytics.bigdl.dataset.text
Transformer that tokenizes a Document (article) into a Seq[Seq[String]]
Apply this transformer to rdd
Transformer that tokenizes a Document (article) into a Seq[Seq[String]]