A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
All Classes All Packages

A

ABKHAZIAN - com.yahoo.language.Language
Language tag "ab".
AbstractDetector - Class in com.yahoo.language.detect
 
AbstractDetector() - Constructor for class com.yahoo.language.detect.AbstractDetector
 
accentDrop(String, Language) - Method in interface com.yahoo.language.process.Transformer
Remove accents from input text.
add(int, String) - Method in class com.yahoo.language.process.StemList
 
AFAR - com.yahoo.language.Language
Language tag "aa".
AFRIKAANS - com.yahoo.language.Language
Language tag "af".
ALBANIAN - com.yahoo.language.Language
Language tag "sq".
ALL - com.yahoo.language.process.StemMode
 
ALPHABETIC - com.yahoo.language.process.TokenType
 
AMHARIC - com.yahoo.language.Language
Language tag "am".
ARABIC - com.yahoo.language.Language
Language tag "ar".
ARABIC - com.yahoo.language.process.TokenScript
 
ARMENIAN - com.yahoo.language.Language
Language tag "hy".
ARMENIAN - com.yahoo.language.process.TokenScript
 
ASCII - com.yahoo.language.process.TokenScript
 
ASSAMESE - com.yahoo.language.Language
Language tag "as".
AYMARA - com.yahoo.language.Language
Language tag "ay".
AZERBAIJANI - com.yahoo.language.Language
Language tag "az".

B

BASHKIR - com.yahoo.language.Language
Language tag "ba".
BASQUE - com.yahoo.language.Language
Language tag "eu".
BENGALI - com.yahoo.language.Language
Language tag "bn".
BENGALI - com.yahoo.language.process.TokenScript
 
BEST - com.yahoo.language.process.StemMode
 
BHUTANI - com.yahoo.language.Language
Language tag "dz".
BIHARI - com.yahoo.language.Language
Language tag "bh".
BISLAMA - com.yahoo.language.Language
Language tag "bi".
BRAILLE - com.yahoo.language.process.TokenScript
 
BRETON - com.yahoo.language.Language
Language tag "br".
BUGINESE - com.yahoo.language.Language
Language tag "bug".
BUGINESE - com.yahoo.language.process.TokenScript
 
BUHID - com.yahoo.language.process.TokenScript
 
BULGARIAN - com.yahoo.language.Language
Language tag "bg".
BURMESE - com.yahoo.language.Language
Language tag "my".
BYELORUSSIAN - com.yahoo.language.Language
Language tag "be".

C

CAMBODIAN - com.yahoo.language.Language
Language tag "km".
CANADIAN - com.yahoo.language.process.TokenScript
 
CATALAN - com.yahoo.language.Language
Language tag "ca".
CHARACTER_CLASSES - com.yahoo.language.Linguistics.Component
 
CharacterClasses - Class in com.yahoo.language.process
Determines the class of a given character.
CharacterClasses() - Constructor for class com.yahoo.language.process.CharacterClasses
 
CHEROKEE - com.yahoo.language.Language
Language tag "chr".
CHEROKEE - com.yahoo.language.process.TokenScript
 
CHINESE - com.yahoo.language.process.TokenScript
 
CHINESE_SIMPLIFIED - com.yahoo.language.Language
Language tag "zh-hans".
CHINESE_TRADITIONAL - com.yahoo.language.Language
Language tag "zh-hant".
com.yahoo.language - package com.yahoo.language
 
com.yahoo.language.detect - package com.yahoo.language.detect
 
com.yahoo.language.process - package com.yahoo.language.process
 
COMMON - com.yahoo.language.process.TokenScript
 
COPTIC - com.yahoo.language.Language
Language tag "cop".
COPTIC - com.yahoo.language.process.TokenScript
 
CORSICAN - com.yahoo.language.Language
Language tag "co".
CROATIAN - com.yahoo.language.Language
Language tag "hr".
CYPRIOT - com.yahoo.language.process.TokenScript
 
CYRILLIC - com.yahoo.language.process.TokenScript
 
CZECH - com.yahoo.language.Language
Language tag "cs".

D

DANISH - com.yahoo.language.Language
Language tag "da".
DEFAULT - com.yahoo.language.process.StemMode
 
DESERET - com.yahoo.language.process.TokenScript
 
detect(byte[], int, int, Hint) - Method in interface com.yahoo.language.detect.Detector
Detects language and encoding of the supplied byte array, possibly using a language/encoding hint.
detect(String, Hint) - Method in class com.yahoo.language.detect.AbstractDetector
 
detect(String, Hint) - Method in interface com.yahoo.language.detect.Detector
Detects language of the supplied String, possibly using a language hint.
detect(ByteBuffer, Hint) - Method in class com.yahoo.language.detect.AbstractDetector
 
detect(ByteBuffer, Hint) - Method in interface com.yahoo.language.detect.Detector
Detects language and encoding of the supplied ByteBuffer, possibly using a language/encoding hint.
Detection - Class in com.yahoo.language.detect
 
Detection(Language, String, boolean) - Constructor for class com.yahoo.language.detect.Detection
 
DetectionException - Exception in com.yahoo.language.detect
Exception that is thrown when detection fails.
DetectionException(String) - Constructor for exception com.yahoo.language.detect.DetectionException
 
Detector - Interface in com.yahoo.language.detect
Abstract superclass of all Detectors used for language and encoding detection.
DETECTOR - com.yahoo.language.Linguistics.Component
 
DEVANAGARI - com.yahoo.language.process.TokenScript
 
DIVEHI - com.yahoo.language.Language
Language tag "div".
DUTCH - com.yahoo.language.Language
Language tag "nl".

E

ENGLISH - com.yahoo.language.Language
Language tag "en".
equals(Object) - Method in class com.yahoo.language.process.GramSplitter.Gram
 
ESPERANTO - com.yahoo.language.Language
Language tag "eo".
ESTONIAN - com.yahoo.language.Language
Language tag "et".
ETHIOPIC - com.yahoo.language.process.TokenScript
 
extractFrom(GramSplitter.UnicodeString) - Method in class com.yahoo.language.process.GramSplitter.Gram
Returns this gram as a string from the input string
extractFrom(String) - Method in class com.yahoo.language.process.GramSplitter.Gram
Returns this gram as a string from the input string

F

FAROESE - com.yahoo.language.Language
Language tag "fo".
FIJI - com.yahoo.language.Language
Language tag "fj".
FINNISH - com.yahoo.language.Language
Language tag "fi".
FRENCH - com.yahoo.language.Language
Language tag "fr".
FRISIAN - com.yahoo.language.Language
Language tag "fy".
fromEncoding(String) - Static method in enum com.yahoo.language.Language
Returns the language from an encoding, or Language.UNKNOWN if it cannot be determined.
fromLanguageTag(String) - Static method in enum com.yahoo.language.Language
Convenience method for calling fromLocale(LocaleFactory.fromLanguageTag(languageTag)).
fromLanguageTag(String) - Static method in class com.yahoo.language.LocaleFactory
Implements a simple parser for RFC5646 language tags.
fromLocale(Locale) - Static method in enum com.yahoo.language.Language
Returns the Language whose Language.languageCode() is equal to locale.getLanguage(), with the following additions:

G

GALICIAN - com.yahoo.language.Language
Language tag "gl".
GEORGIAN - com.yahoo.language.Language
Language tag "ka".
GEORGIAN - com.yahoo.language.process.TokenScript
 
GERMAN - com.yahoo.language.Language
Language tag "de".
get(int) - Method in class com.yahoo.language.process.StemList
 
getCharacterClasses() - Method in interface com.yahoo.language.Linguistics
Returns a thread-unsafe character classes instance.
getCodePointCount() - Method in class com.yahoo.language.process.GramSplitter.Gram
 
getComponent(int) - Method in interface com.yahoo.language.process.Token
Returns a component token of this
getCountry() - Method in class com.yahoo.language.detect.Hint
 
getDetector() - Method in interface com.yahoo.language.Linguistics
Returns a thread-unsafe detector.
getEncoding() - Method in class com.yahoo.language.detect.Detection
 
getEncodingName() - Method in class com.yahoo.language.detect.Detection
 
getGramSplitter() - Method in interface com.yahoo.language.Linguistics
Returns a thread-unsafe gram splitter.
getLanguage() - Method in class com.yahoo.language.detect.Detection
 
getMarket() - Method in class com.yahoo.language.detect.Hint
 
getNormalizer() - Method in interface com.yahoo.language.Linguistics
Returns a thread-unsafe normalizer.
getNumComponents() - Method in interface com.yahoo.language.process.Token
Returns the number of components, if this token is a compound word (e.g.
getNumStems() - Method in interface com.yahoo.language.process.Token
Returns the number of stem forms available for this token.
getOffset() - Method in interface com.yahoo.language.process.Token
Returns the offset position of this token
getOrig() - Method in interface com.yahoo.language.process.Token
Returns the original form of this token
getReplacementTerm(String) - Method in interface com.yahoo.language.process.Tokenizer
Return a replacement for an input token string.
getScript() - Method in interface com.yahoo.language.process.Token
Returns the script of this token
getSegmenter() - Method in interface com.yahoo.language.Linguistics
Returns a thread-unsafe segmenter.
getStart() - Method in class com.yahoo.language.process.GramSplitter.Gram
 
getStem(int) - Method in interface com.yahoo.language.process.Token
Returns the stem at position i
getStemmer() - Method in interface com.yahoo.language.Linguistics
Returns a thread-unsafe stemmer or lemmatizer.
getTokenizer() - Method in interface com.yahoo.language.Linguistics
Returns a thread-unsafe tokenizer.
getTokenString() - Method in interface com.yahoo.language.process.Token
Returns token string in a form suitable for indexing: The most lowercased variant of the most processed token form available.
getTransformer() - Method in interface com.yahoo.language.Linguistics
Returns a thread-unsafe transformer.
getType() - Method in interface com.yahoo.language.process.Token
Returns the type of this token - word, space or punctuation etc.
getValue() - Method in enum com.yahoo.language.process.TokenType
Returns an int code for this type
GLAGOLITIC - com.yahoo.language.process.TokenScript
 
GOTHIC - com.yahoo.language.Language
Language tag "got".
GOTHIC - com.yahoo.language.process.TokenScript
 
Gram(int, int) - Constructor for class com.yahoo.language.process.GramSplitter.Gram
 
GRAM_SPLITTER - com.yahoo.language.Linguistics.Component
 
GramSplitter - Class in com.yahoo.language.process
A class which splits consecutive word character sequences into overlapping character n-grams.
GramSplitter(CharacterClasses) - Constructor for class com.yahoo.language.process.GramSplitter
 
GramSplitter.Gram - Class in com.yahoo.language.process
An immutable start index and length pair
GramSplitter.GramSplitterIterator - Class in com.yahoo.language.process
 
GramSplitterIterator(String, int, CharacterClasses) - Constructor for class com.yahoo.language.process.GramSplitter.GramSplitterIterator
 
GREEK - com.yahoo.language.Language
Language tag "el".
GREEK - com.yahoo.language.process.TokenScript
 
GREENLANDIC - com.yahoo.language.Language
Language tag "kl".
GUARANI - com.yahoo.language.Language
Language tag "gn".
GUJARATI - com.yahoo.language.Language
Language tag "gu".
GUJARATI - com.yahoo.language.process.TokenScript
 
GURMUKHI - com.yahoo.language.process.TokenScript
 

H

HAN - com.yahoo.language.process.TokenScript
 
HANGUL - com.yahoo.language.process.TokenScript
 
HANUNOO - com.yahoo.language.process.TokenScript
 
hashCode() - Method in class com.yahoo.language.process.GramSplitter.Gram
 
hasNext() - Method in class com.yahoo.language.process.GramSplitter.GramSplitterIterator
 
HAUSA - com.yahoo.language.Language
Language tag "ha".
HEBREW - com.yahoo.language.Language
Language tag "he".
HEBREW - com.yahoo.language.process.TokenScript
 
HINDI - com.yahoo.language.Language
Language tag "hi".
Hint - Class in com.yahoo.language.detect
A hint that can be given to a Detector.
HIRAGANA - com.yahoo.language.process.TokenScript
 
HUNGARIAN - com.yahoo.language.Language
Language tag "hu".

I

ICELANDIC - com.yahoo.language.Language
Language tag "is".
INDONESIAN - com.yahoo.language.Language
Language tag "id".
INHERITED - com.yahoo.language.process.TokenScript
 
INTERLINGUA - com.yahoo.language.Language
Language tag "ia".
INTERLINGUE - com.yahoo.language.Language
Language tag "ie".
INUKTITUT - com.yahoo.language.Language
Language tag "iu".
INUPIAK - com.yahoo.language.Language
Language tag "ik".
IRISH - com.yahoo.language.Language
Language tag "ga".
isCjk() - Method in enum com.yahoo.language.Language
Returns whether this is a "cjk" language.
isDigit(int) - Method in class com.yahoo.language.process.CharacterClasses
Returns true for code points which should be considered digits - same as java.lang.Character.isDigit
isIndexable() - Method in interface com.yahoo.language.process.Token
Whether this token should be indexed
isIndexable() - Method in enum com.yahoo.language.process.TokenType
Marker for whether this type of token can be indexed for search.
isLatin(int) - Method in class com.yahoo.language.process.CharacterClasses
Returns true if this is a latin character
isLatinDigit(int) - Method in class com.yahoo.language.process.CharacterClasses
Returns true if this is a latin digit (other digits are not consistently parsed into numbers by Java)
isLetter(int) - Method in class com.yahoo.language.process.CharacterClasses
Returns true for code points which are letters in unicode 3 or 4, plus some additional characters which are useful to view as letters even though not defined as such in unicode.
isLetterOrDigit(int) - Method in class com.yahoo.language.process.CharacterClasses
Convenience, returns isLetter(c) || isDigit(c)
isLocal() - Method in class com.yahoo.language.detect.Detection
 
isSpecialToken() - Method in interface com.yahoo.language.process.Token
Returns whether this is an instance of a declared special token (e.g.
ITALIAN - com.yahoo.language.Language
Language tag "it".

J

JAPANESE - com.yahoo.language.Language
Language tag "ja".
JAVANESE - com.yahoo.language.Language
Language tag "jw".

K

KANNADA - com.yahoo.language.Language
Language tag "kn".
KANNADA - com.yahoo.language.process.TokenScript
 
KASHMIRI - com.yahoo.language.Language
Language tag "ks".
KATAKANA - com.yahoo.language.process.TokenScript
 
KAZAKH - com.yahoo.language.Language
Language tag "kk".
KHAROSHTHI - com.yahoo.language.process.TokenScript
 
KHMER - com.yahoo.language.process.TokenScript
 
KINYARWANDA - com.yahoo.language.Language
Language tag "rw".
KIRGHIZ - com.yahoo.language.Language
Language tag "ky".
KIRUNDI - com.yahoo.language.Language
Language tag "rn".
KOREAN - com.yahoo.language.Language
Language tag "ko".
KURDISH - com.yahoo.language.Language
Language tag "ku".

L

Language - Enum in com.yahoo.language
 
languageCode() - Method in enum com.yahoo.language.Language
 
LAO - com.yahoo.language.process.TokenScript
 
LAOTHIAN - com.yahoo.language.Language
Language tag "lo".
LATIN - com.yahoo.language.Language
Language tag "la".
LATIN - com.yahoo.language.process.TokenScript
 
LATVIAN - com.yahoo.language.Language
Language tag "lv".
LIMBU - com.yahoo.language.process.TokenScript
 
LINEARB - com.yahoo.language.process.TokenScript
 
LINGALA - com.yahoo.language.Language
Language tag "ln".
Linguistics - Interface in com.yahoo.language
Factory of linguistic processors.
Linguistics.Component - Enum in com.yahoo.language
 
LinguisticsCase - Class in com.yahoo.language
This class provides a case normalization operation to be used e.g.
LinguisticsCase() - Constructor for class com.yahoo.language.LinguisticsCase
 
LITHUANIAN - com.yahoo.language.Language
Language tag "lt".
LocaleFactory - Class in com.yahoo.language
 

M

MACEDONIAN - com.yahoo.language.Language
Language tag "mk".
MALAGASY - com.yahoo.language.Language
Language tag "mg".
MALAY - com.yahoo.language.Language
Language tag "ms".
MALAYALAM - com.yahoo.language.Language
Language tag "ml".
MALAYALAM - com.yahoo.language.process.TokenScript
 
MALTESE - com.yahoo.language.Language
Language tag "mt".
MANIPURI - com.yahoo.language.Language
Language tag "mni".
MAORI - com.yahoo.language.Language
Language tag "mi".
MARATHI - com.yahoo.language.Language
Language tag "mr".
MARKER - com.yahoo.language.process.TokenType
 
MOLDAVIAN - com.yahoo.language.Language
Language tag "mo".
MONGOLIAN - com.yahoo.language.Language
Language tag "mn".
MONGOLIAN - com.yahoo.language.process.TokenScript
 
MUNDA - com.yahoo.language.Language
Language tag "mun".
MYANMAR - com.yahoo.language.process.TokenScript
 

N

NAURU - com.yahoo.language.Language
Language tag "na".
NEPALI - com.yahoo.language.Language
Language tag "ne".
newCountryHint(String) - Static method in class com.yahoo.language.detect.Hint
 
newInstance(String, String) - Static method in class com.yahoo.language.detect.Hint
 
newMarketHint(String) - Static method in class com.yahoo.language.detect.Hint
 
next() - Method in class com.yahoo.language.process.GramSplitter.GramSplitterIterator
 
NONE - com.yahoo.language.process.StemMode
 
normalize(String) - Method in interface com.yahoo.language.process.Normalizer
NFKC normalizes a String.
Normalizer - Interface in com.yahoo.language.process
This interface provides NFKC normalization of Strings through the underlying linguistics library.
NORMALIZER - com.yahoo.language.Linguistics.Component
 
NORWEGIAN_BOKMAL - com.yahoo.language.Language
Language tag "nb".
NORWEGIAN_NYNORSK - com.yahoo.language.Language
Language tag "nn".
NUMERIC - com.yahoo.language.process.TokenType
 

O

OCCITAN - com.yahoo.language.Language
Language tag "oc".
OGHAM - com.yahoo.language.process.TokenScript
 
OLDITALIC - com.yahoo.language.process.TokenScript
 
OLDPERSIAN - com.yahoo.language.process.TokenScript
 
ORIYA - com.yahoo.language.Language
Language tag "or".
ORIYA - com.yahoo.language.process.TokenScript
 
OROMO - com.yahoo.language.Language
Language tag "om".
OSMANYA - com.yahoo.language.process.TokenScript
 

P

PASHTO - com.yahoo.language.Language
Language tag "ps".
PERSIAN - com.yahoo.language.Language
Language tag "fa".
POLISH - com.yahoo.language.Language
Language tag "pl".
PORTUGUESE - com.yahoo.language.Language
Language tag "pt".
ProcessingException - Exception in com.yahoo.language.process
Exception class indicating that a fatal error occured during linguistic processing.
ProcessingException(String) - Constructor for exception com.yahoo.language.process.ProcessingException
 
ProcessingException(String, Throwable) - Constructor for exception com.yahoo.language.process.ProcessingException
 
PUNCTUATION - com.yahoo.language.process.TokenType
 
PUNJABI - com.yahoo.language.Language
Language tag "pa".

Q

QUECHUA - com.yahoo.language.Language
Language tag "qu".

R

remove() - Method in class com.yahoo.language.process.GramSplitter.GramSplitterIterator
 
remove(int) - Method in class com.yahoo.language.process.StemList
 
RHAETO_ROMANCE - com.yahoo.language.Language
Language tag "rm".
ROMANIAN - com.yahoo.language.Language
Language tag "ro".
RUNIC - com.yahoo.language.process.TokenScript
 
RUSSIAN - com.yahoo.language.Language
Language tag "ru".

S

SAMOAN - com.yahoo.language.Language
Language tag "sm".
SANGHO - com.yahoo.language.Language
Language tag "sg".
SANSKRIT - com.yahoo.language.Language
Language tag "sa".
SCOTS_GAELIC - com.yahoo.language.Language
Language tag "gd".
segment(String, Language) - Method in interface com.yahoo.language.process.Segmenter
Split input-string into tokens, and returned a list of tokens in unprocessed form (i.e.
segment(String, Language) - Method in class com.yahoo.language.process.SegmenterImpl
 
Segmenter - Interface in com.yahoo.language.process
Interface providing segmentation, i.e.
SEGMENTER - com.yahoo.language.Linguistics.Component
 
SegmenterImpl - Class in com.yahoo.language.process
 
SegmenterImpl(Tokenizer) - Constructor for class com.yahoo.language.process.SegmenterImpl
 
SERBIAN - com.yahoo.language.Language
Language tag "sr".
SERBO_CROATIAN - com.yahoo.language.Language
Language tag "s".
SESOTHO - com.yahoo.language.Language
Language tag "st".
set(int, String) - Method in class com.yahoo.language.process.StemList
 
SETSWANA - com.yahoo.language.Language
Language tag "tn".
SHAVIAN - com.yahoo.language.process.TokenScript
 
SHONA - com.yahoo.language.Language
Language tag "sn".
SHORTEST - com.yahoo.language.process.StemMode
 
SICHUAN_YI - com.yahoo.language.Language
Language tag "ii".
SINDHI - com.yahoo.language.Language
Language tag "sd".
SINHALA - com.yahoo.language.process.TokenScript
 
SINHALESE - com.yahoo.language.Language
Language tag "si".
SISWATI - com.yahoo.language.Language
Language tag "ss".
size() - Method in class com.yahoo.language.process.StemList
 
SLOVAK - com.yahoo.language.Language
Language tag "sk".
SLOVENIAN - com.yahoo.language.Language
Language tag "sl".
SOMALI - com.yahoo.language.Language
Language tag "so".
SPACE - com.yahoo.language.process.TokenType
 
SPANISH - com.yahoo.language.Language
Language tag "es".
split(String, int) - Method in class com.yahoo.language.process.GramSplitter
Splits the input into grams of size n and returns an iterator over grams represented as [start index,length] pairs into the input string.
stem(String, StemMode, Language) - Method in interface com.yahoo.language.process.Stemmer
Stem input according to specified stemming mode.
stem(String, StemMode, Language) - Method in class com.yahoo.language.process.StemmerImpl
 
StemList - Class in com.yahoo.language.process
A list of strings which does not allow for duplicate elements.
StemList() - Constructor for class com.yahoo.language.process.StemList
 
StemList(String...) - Constructor for class com.yahoo.language.process.StemList
 
Stemmer - Interface in com.yahoo.language.process
Interface providing stemming of single words.
STEMMER - com.yahoo.language.Linguistics.Component
 
StemmerImpl - Class in com.yahoo.language.process
 
StemmerImpl(Tokenizer) - Constructor for class com.yahoo.language.process.StemmerImpl
 
StemMode - Enum in com.yahoo.language.process
An enum of the stemming modes which can be requested.
SUNDANESE - com.yahoo.language.Language
Language tag "su".
SWAHILI - com.yahoo.language.Language
Language tag "sw".
SWEDISH - com.yahoo.language.Language
Language tag "sv".
SYLOTINAGRI - com.yahoo.language.process.TokenScript
 
SYMBOL - com.yahoo.language.process.TokenType
 
SYRIAC - com.yahoo.language.Language
Language tag "syr".
SYRIAC - com.yahoo.language.process.TokenScript
 

T

TAGALOG - com.yahoo.language.Language
Language tag "fil".
TAGALOG - com.yahoo.language.process.TokenScript
 
TAGBANWA - com.yahoo.language.process.TokenScript
 
TAILE - com.yahoo.language.process.TokenScript
 
TAILUE - com.yahoo.language.process.TokenScript
 
TAJIK - com.yahoo.language.Language
Language tag "tg".
TAMIL - com.yahoo.language.Language
Language tag "ta".
TAMIL - com.yahoo.language.process.TokenScript
 
TATAR - com.yahoo.language.Language
Language tag "tt".
TELUGU - com.yahoo.language.Language
Language tag "te".
TELUGU - com.yahoo.language.process.TokenScript
 
THAANA - com.yahoo.language.process.TokenScript
 
THAI - com.yahoo.language.Language
Language tag "th".
THAI - com.yahoo.language.process.TokenScript
 
TIBETAN - com.yahoo.language.Language
Language tag "bo".
TIBETAN - com.yahoo.language.process.TokenScript
 
TIFINAGH - com.yahoo.language.process.TokenScript
 
TIGRINYA - com.yahoo.language.Language
Language tag "ti".
toExtractedList() - Method in class com.yahoo.language.process.GramSplitter.GramSplitterIterator
Convenience list which splits the remaining items in this iterator into a list of gram strings
Token - Interface in com.yahoo.language.process
A single token produced by the tokenizer.
tokenize(String, Language, StemMode, boolean) - Method in interface com.yahoo.language.process.Tokenizer
Returns the tokens produced from an input string under the rules of the given Language and additional options
Tokenizer - Interface in com.yahoo.language.process
Language-sensitive tokenization of a text string.
TOKENIZER - com.yahoo.language.Linguistics.Component
 
TokenScript - Enum in com.yahoo.language.process
List of token scripts (e.g.
TokenType - Enum in com.yahoo.language.process
An enumeration of token types.
toLowerCase(String) - Static method in class com.yahoo.language.LinguisticsCase
The lower casing method to use in Vespa when doing language independent processing of natural language data.
TONGA - com.yahoo.language.Language
Language tag "to".
Transformer - Interface in com.yahoo.language.process
Interface for providers of text transformations such as accent removal.
TRANSFORMER - com.yahoo.language.Linguistics.Component
 
TSONGA - com.yahoo.language.Language
Language tag "ts".
TURKISH - com.yahoo.language.Language
Language tag "tr".
TURKMEN - com.yahoo.language.Language
Language tag "tk".
TWI - com.yahoo.language.Language
Language tag "tw".

U

UGARITIC - com.yahoo.language.Language
Language tag "uga".
UGARITIC - com.yahoo.language.process.TokenScript
 
UIGHUR - com.yahoo.language.Language
Language tag "ug".
UKRAINIAN - com.yahoo.language.Language
Language tag "uk".
UNKNOWN - com.yahoo.language.Language
Language tag "un".
UNKNOWN - com.yahoo.language.process.TokenScript
 
UNKNOWN - com.yahoo.language.process.TokenType
 
URDU - com.yahoo.language.Language
Language tag "ur".
UZBEK - com.yahoo.language.Language
Language tag "uz".

V

valueOf(int) - Static method in enum com.yahoo.language.process.TokenType
Translates this from the int code representation returned from TokenType.getValue()
valueOf(String) - Static method in enum com.yahoo.language.Language
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.yahoo.language.Linguistics.Component
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.yahoo.language.process.StemMode
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.yahoo.language.process.TokenScript
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.yahoo.language.process.TokenType
Returns the enum constant of this type with the specified name.
values() - Static method in enum com.yahoo.language.Language
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.yahoo.language.Linguistics.Component
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.yahoo.language.process.StemMode
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.yahoo.language.process.TokenScript
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.yahoo.language.process.TokenType
Returns an array containing the constants of this enum type, in the order they are declared.
VIETNAMESE - com.yahoo.language.Language
Language tag "vi".
VIETNAMESE - com.yahoo.language.process.TokenScript
 
VOLAPUK - com.yahoo.language.Language
Language tag "vo".

W

WELSH - com.yahoo.language.Language
Language tag "cy".
WOLOF - com.yahoo.language.Language
Language tag "wo".

X

XHOSA - com.yahoo.language.Language
Language tag "xh".

Y

YI - com.yahoo.language.process.TokenScript
 
YIDDISH - com.yahoo.language.Language
Language tag "yi".
YORUBA - com.yahoo.language.Language
Language tag "yo".

Z

ZHUANG - com.yahoo.language.Language
Language tag "za".
ZULU - com.yahoo.language.Language
Language tag "zu".
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
All Classes All Packages