Class Token

Constructors

Properties

byteEnd: number

Index of the last byte of the original lemma.

byteLen: number

The length in bytes of the normalized lemma.

byteStart: number

Index of the first byte of the original lemma.

charCount: number

The count of characters of the normalized lemma.

charEnd: number

Index of the last character of the original lemma.

charStart: number

Index of the first character of the original lemma.

isSeparator: boolean

Whether the token is a separator.

isStopWord: boolean

Whether the token is a stop word.

isWord: boolean

Whether the token is a word. A token is considered as a word if it's not a separator nor a stop word.

kind: TokenKind

Kind of the Token assigned by the classifier.

language: Language

Language of the Token.

lemma: string

The normalized lemma.

originalByteLen: number

The length in bytes of the original lemma.

originalCharCount: number

The count of characters of the original lemma.

script: Script

Script of the Token.

separatorKind: SeparatorKind

The kind of the separator if the token is a separator.

Methods

  • Returns void