Stemming Support

The current stemming support is based on the Snowball stemmer library, which supports most European languages, as well as Arabic and other. We hope to include more languages soon (if you need a specicif langauge support, please open an issue).

For further details see the .

  • arabic
  • danish
  • dutch
  • finnish
  • french
  • german
  • hungarian
  • italian
  • norwegian
  • romanian
  • russian
  • spanish
  • swedish
  • tamil
  • turkish

Chinese support

Indexing a Chinese document is different than indexing a document in most other languages because of how tokens are extracted. While most languages can have their tokens distinguished by separation characters and whitespace, this is not common in Chinese.

Chinese tokenization is done by scanning the input text and checking every character or sequence of characters against a dictionary of predefined terms and determining the most likely (based on the surrounding terms and characters) match.

Using custom dictionaries

If you wish to use a custom dictionary, you can do so at the module level when loading the module. The setting can point to the location of a file which contains the relevant settings and paths to the dictionary files.

Note that there is no “default” friso.ini file location. Redis Search comes with its own and dictionary files which are compiled into the module binary at build-time.