Chapter 3 Analyzing word and document frequency: tf-idf