Uses of Package com.yahoo.language.process (linguistics 8.126.20 API)

Packages that use com.yahoo.language.process

Package

Description

com.yahoo.language

com.yahoo.language.process

com.yahoo.language.simple

Classes in com.yahoo.language.process used by com.yahoo.language

Class

Description

CharacterClasses

Determines the class of a given character.

GramSplitter

A class which splits consecutive word character sequences into overlapping character n-grams.

Normalizer

This interface provides NFKC normalization of Strings through the underlying linguistics library.

Segmenter

Interface providing segmentation, i.e.

Stemmer

Interface providing stemming of single words.

Tokenizer

Language-sensitive tokenization of a text string.

Transformer

Interface for providers of text transformations such as accent removal.
Classes in com.yahoo.language.process used by com.yahoo.language.process

Class

Description

CharacterClasses

Determines the class of a given character.

Embedder

An embedder converts a text string to a tensor

Embedder.Context

GramSplitter.Gram

An immutable start index and length pair

GramSplitter.GramSplitterIterator

Segmenter

Interface providing segmentation, i.e.

SpecialTokens

An immutable list of special tokens - strings which should override the normal tokenizer semantics and be tokenized into a single token.

SpecialTokens.Token

An immutable special token

StemList

A list of strings which does not allow for duplicate elements.

Stemmer

Interface providing stemming of single words.

StemMode

An enum of the stemming modes which can be requested.

Token

A single token produced by the tokenizer.

Tokenizer

Language-sensitive tokenization of a text string.

TokenScript

List of token scripts (e.g.

TokenType

An enumeration of token types.
Classes in com.yahoo.language.process used by com.yahoo.language.simple

Class

Description

CharacterClasses

Determines the class of a given character.

GramSplitter

A class which splits consecutive word character sequences into overlapping character n-grams.

Normalizer

This interface provides NFKC normalization of Strings through the underlying linguistics library.

Segmenter

Interface providing segmentation, i.e.

SpecialTokenRegistry

Immutable named lists of "special tokens" - strings which should override the normal tokenizer semantics and be tokenized into a single token.

Stemmer

Interface providing stemming of single words.

StemMode

An enum of the stemming modes which can be requested.

Token

A single token produced by the tokenizer.

Tokenizer

Language-sensitive tokenization of a text string.

TokenScript

List of token scripts (e.g.

TokenType

An enumeration of token types.

Transformer

Interface for providers of text transformations such as accent removal.

Uses of Packagecom.yahoo.language.process

Uses of Package
com.yahoo.language.process