Represents a document term matrix as a list.

Dtm2Lexicon(dtm, ...)

Arguments

dtm

A document term matrix (or term co-occurrence matrix) of class dgCMatrix.

...

Other arguments to be passed to TmParallelApply.

Value

Returns a list. Each element of the list represents a row of the input matrix. Each list element contains a numeric vector with as many entries as tokens in the original document. The entries are the column index for that token, minus 1.

Examples

# NOT RUN {
# Load pre-formatted data for use
data(nih_sample_dtm)

result <- Dtm2Lexicon(dtm = nih_sample_dtm,
                      cpus = 2)
# }