Tokenizers

The tokenizers module gathers the library’s various tokenizers.

Tokenizers are algorithms whose goal is to take some raw text & split this one into tokens.

Splitting a text into a list of sentences, or sentences into a list of words are both examples of what a tokenizer is.

Summary

Modules under the talisman/tokenizers namespace: