3.1 What are stop-words

  • Previous chapter discuss tokenization.

  • It turns out not all words carry the same amount of information.

  • So, we need to pre-process our data and remove those words with little or no information

The process of converting data to something a computer can understand is referred to as pre-processing. One of the major forms of pre-processing is to filter out useless data. In natural language processing, useless words (data), are referred to as stop words.