Counting tokens – the document-term matrix