Stop words

Stop words are words which are filtered out prior to, or after, processing of natural language data (text).

Each supported language has a corresponding stop words file, <language>_stopwords.txt, in the search config folder:

  • en_stopwords.txt
  • fr_stopwords.txt
  • pt_stopwords.txt
  • de_stopwords.txt

These files contain the actual stop words used for each language during indexing. Customize these files by adding or removing words that should not be indexed. Adhere to the file format, which is:

  • One word per line.
  • Lowercase words.
  • Words need to be in UTF8.

Tip: Stop words will be added to the search log.

Back to top