R/topic_modeling_core.R
Dtm2Lexicon.RdRepresents a document term matrix as a list.
Dtm2Lexicon(dtm, ...)A document term matrix (or term co-occurrence matrix) of class
dgCMatrix.
Other arguments to be passed to TmParallelApply.
Returns a list. Each element of the list represents a row of the input matrix. Each list element contains a numeric vector with as many entries as tokens in the original document. The entries are the column index for that token, minus 1.
if (FALSE) {
# Load pre-formatted data for use
data(nih_sample_dtm)
result <- Dtm2Lexicon(dtm = nih_sample_dtm,
cpus = 2)
}