A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- ABKHAZIAN - com.yahoo.language.Language
-
Language tag "ab".
- AbstractDetector - Class in com.yahoo.language.detect
- AbstractDetector() - Constructor for class com.yahoo.language.detect.AbstractDetector
- accentDrop(String, Language) - Method in interface com.yahoo.language.process.Transformer
-
Remove accents from input text.
- add(int, String) - Method in class com.yahoo.language.process.StemList
- AFAR - com.yahoo.language.Language
-
Language tag "aa".
- AFRIKAANS - com.yahoo.language.Language
-
Language tag "af".
- ALBANIAN - com.yahoo.language.Language
-
Language tag "sq".
- ALL - com.yahoo.language.process.StemMode
- ALPHABETIC - com.yahoo.language.process.TokenType
- AMHARIC - com.yahoo.language.Language
-
Language tag "am".
- ARABIC - com.yahoo.language.Language
-
Language tag "ar".
- ARABIC - com.yahoo.language.process.TokenScript
- ARMENIAN - com.yahoo.language.Language
-
Language tag "hy".
- ARMENIAN - com.yahoo.language.process.TokenScript
- ASCII - com.yahoo.language.process.TokenScript
- ASSAMESE - com.yahoo.language.Language
-
Language tag "as".
- AYMARA - com.yahoo.language.Language
-
Language tag "ay".
- AZERBAIJANI - com.yahoo.language.Language
-
Language tag "az".
B
- BASHKIR - com.yahoo.language.Language
-
Language tag "ba".
- BASQUE - com.yahoo.language.Language
-
Language tag "eu".
- BENGALI - com.yahoo.language.Language
-
Language tag "bn".
- BENGALI - com.yahoo.language.process.TokenScript
- BEST - com.yahoo.language.process.StemMode
- BHUTANI - com.yahoo.language.Language
-
Language tag "dz".
- BIHARI - com.yahoo.language.Language
-
Language tag "bh".
- BISLAMA - com.yahoo.language.Language
-
Language tag "bi".
- BRAILLE - com.yahoo.language.process.TokenScript
- BRETON - com.yahoo.language.Language
-
Language tag "br".
- BUGINESE - com.yahoo.language.Language
-
Language tag "bug".
- BUGINESE - com.yahoo.language.process.TokenScript
- BUHID - com.yahoo.language.process.TokenScript
- BULGARIAN - com.yahoo.language.Language
-
Language tag "bg".
- BURMESE - com.yahoo.language.Language
-
Language tag "my".
- BYELORUSSIAN - com.yahoo.language.Language
-
Language tag "be".
C
- CAMBODIAN - com.yahoo.language.Language
-
Language tag "km".
- CANADIAN - com.yahoo.language.process.TokenScript
- CATALAN - com.yahoo.language.Language
-
Language tag "ca".
- CHARACTER_CLASSES - com.yahoo.language.Linguistics.Component
- CharacterClasses - Class in com.yahoo.language.process
-
Determines the class of a given character.
- CharacterClasses() - Constructor for class com.yahoo.language.process.CharacterClasses
- CHEROKEE - com.yahoo.language.Language
-
Language tag "chr".
- CHEROKEE - com.yahoo.language.process.TokenScript
- CHINESE - com.yahoo.language.process.TokenScript
- CHINESE_SIMPLIFIED - com.yahoo.language.Language
-
Language tag "zh-hans".
- CHINESE_TRADITIONAL - com.yahoo.language.Language
-
Language tag "zh-hant".
- com.yahoo.language - package com.yahoo.language
- com.yahoo.language.detect - package com.yahoo.language.detect
- com.yahoo.language.process - package com.yahoo.language.process
- COMMON - com.yahoo.language.process.TokenScript
- COPTIC - com.yahoo.language.Language
-
Language tag "cop".
- COPTIC - com.yahoo.language.process.TokenScript
- CORSICAN - com.yahoo.language.Language
-
Language tag "co".
- CROATIAN - com.yahoo.language.Language
-
Language tag "hr".
- CYPRIOT - com.yahoo.language.process.TokenScript
- CYRILLIC - com.yahoo.language.process.TokenScript
- CZECH - com.yahoo.language.Language
-
Language tag "cs".
D
- DANISH - com.yahoo.language.Language
-
Language tag "da".
- DEFAULT - com.yahoo.language.process.StemMode
- DESERET - com.yahoo.language.process.TokenScript
- detect(byte[], int, int, Hint) - Method in interface com.yahoo.language.detect.Detector
-
Detects language and encoding of the supplied byte array, possibly using a language/encoding hint.
- detect(String, Hint) - Method in class com.yahoo.language.detect.AbstractDetector
- detect(String, Hint) - Method in interface com.yahoo.language.detect.Detector
-
Detects language of the supplied String, possibly using a language hint.
- detect(ByteBuffer, Hint) - Method in class com.yahoo.language.detect.AbstractDetector
- detect(ByteBuffer, Hint) - Method in interface com.yahoo.language.detect.Detector
-
Detects language and encoding of the supplied ByteBuffer, possibly using a language/encoding hint.
- Detection - Class in com.yahoo.language.detect
- Detection(Language, String, boolean) - Constructor for class com.yahoo.language.detect.Detection
- DetectionException - Exception in com.yahoo.language.detect
-
Exception that is thrown when detection fails.
- DetectionException(String) - Constructor for exception com.yahoo.language.detect.DetectionException
- Detector - Interface in com.yahoo.language.detect
-
Abstract superclass of all Detectors used for language and encoding detection.
- DETECTOR - com.yahoo.language.Linguistics.Component
- DEVANAGARI - com.yahoo.language.process.TokenScript
- DIVEHI - com.yahoo.language.Language
-
Language tag "div".
- DUTCH - com.yahoo.language.Language
-
Language tag "nl".
E
- ENGLISH - com.yahoo.language.Language
-
Language tag "en".
- equals(Object) - Method in class com.yahoo.language.process.GramSplitter.Gram
- ESPERANTO - com.yahoo.language.Language
-
Language tag "eo".
- ESTONIAN - com.yahoo.language.Language
-
Language tag "et".
- ETHIOPIC - com.yahoo.language.process.TokenScript
- extractFrom(String) - Method in class com.yahoo.language.process.GramSplitter.Gram
-
Returns this gram as a string from the input string
F
- FAROESE - com.yahoo.language.Language
-
Language tag "fo".
- FIJI - com.yahoo.language.Language
-
Language tag "fj".
- FINNISH - com.yahoo.language.Language
-
Language tag "fi".
- FRENCH - com.yahoo.language.Language
-
Language tag "fr".
- FRISIAN - com.yahoo.language.Language
-
Language tag "fy".
- fromEncoding(String) - Static method in enum com.yahoo.language.Language
-
Returns the language from an encoding, or
Language.UNKNOWN
if it cannot be determined. - fromLanguageTag(String) - Static method in enum com.yahoo.language.Language
-
Convenience method for calling
fromLocale(LocaleFactory.fromLanguageTag(languageTag))
. - fromLanguageTag(String) - Static method in class com.yahoo.language.LocaleFactory
-
Implements a simple parser for RFC5646 language tags.
- fromLocale(Locale) - Static method in enum com.yahoo.language.Language
-
Returns the
Language
whoseLanguage.languageCode()
is equal tolocale.getLanguage()
, with the following additions:
G
- GALICIAN - com.yahoo.language.Language
-
Language tag "gl".
- GEORGIAN - com.yahoo.language.Language
-
Language tag "ka".
- GEORGIAN - com.yahoo.language.process.TokenScript
- GERMAN - com.yahoo.language.Language
-
Language tag "de".
- get(int) - Method in class com.yahoo.language.process.StemList
- getCharacterClasses() - Method in interface com.yahoo.language.Linguistics
-
Returns a thread-unsafe character classes instance.
- getComponent(int) - Method in interface com.yahoo.language.process.Token
-
Returns a component token of this
- getCountry() - Method in class com.yahoo.language.detect.Hint
- getDetector() - Method in interface com.yahoo.language.Linguistics
-
Returns a thread-unsafe detector.
- getEncoding() - Method in class com.yahoo.language.detect.Detection
- getEncodingName() - Method in class com.yahoo.language.detect.Detection
- getGramSplitter() - Method in interface com.yahoo.language.Linguistics
-
Returns a thread-unsafe gram splitter.
- getLanguage() - Method in class com.yahoo.language.detect.Detection
- getLength() - Method in class com.yahoo.language.process.GramSplitter.Gram
- getMarket() - Method in class com.yahoo.language.detect.Hint
- getNormalizer() - Method in interface com.yahoo.language.Linguistics
-
Returns a thread-unsafe normalizer.
- getNumComponents() - Method in interface com.yahoo.language.process.Token
-
Returns the number of components, if this token is a compound word (e.g.
- getNumStems() - Method in interface com.yahoo.language.process.Token
-
Returns the number of stem forms available for this token.
- getOffset() - Method in interface com.yahoo.language.process.Token
-
Returns the offset position of this token
- getOrig() - Method in interface com.yahoo.language.process.Token
-
Returns the original form of this token
- getReplacementTerm(String) - Method in interface com.yahoo.language.process.Tokenizer
-
Return a replacement for an input token string.
- getScript() - Method in interface com.yahoo.language.process.Token
-
Returns the script of this token
- getSegmenter() - Method in interface com.yahoo.language.Linguistics
-
Returns a thread-unsafe segmenter.
- getStart() - Method in class com.yahoo.language.process.GramSplitter.Gram
- getStem(int) - Method in interface com.yahoo.language.process.Token
-
Returns the stem at position i
- getStemmer() - Method in interface com.yahoo.language.Linguistics
-
Returns a thread-unsafe stemmer or lemmatizer.
- getTokenizer() - Method in interface com.yahoo.language.Linguistics
-
Returns a thread-unsafe tokenizer.
- getTokenString() - Method in interface com.yahoo.language.process.Token
-
Returns token string in a form suitable for indexing: The most lowercased variant of the most processed token form available.
- getTransformer() - Method in interface com.yahoo.language.Linguistics
-
Returns a thread-unsafe transformer.
- getType() - Method in interface com.yahoo.language.process.Token
-
Returns the type of this token - word, space or punctuation etc.
- getValue() - Method in enum com.yahoo.language.process.TokenType
-
Returns an int code for this type
- GLAGOLITIC - com.yahoo.language.process.TokenScript
- GOTHIC - com.yahoo.language.Language
-
Language tag "got".
- GOTHIC - com.yahoo.language.process.TokenScript
- Gram(int, int) - Constructor for class com.yahoo.language.process.GramSplitter.Gram
- GRAM_SPLITTER - com.yahoo.language.Linguistics.Component
- GramSplitter - Class in com.yahoo.language.process
-
A class which splits consecutive word character sequences into overlapping character n-grams.
- GramSplitter(CharacterClasses) - Constructor for class com.yahoo.language.process.GramSplitter
- GramSplitter.Gram - Class in com.yahoo.language.process
-
An immutable start index and length pair
- GramSplitter.GramSplitterIterator - Class in com.yahoo.language.process
- GramSplitterIterator(String, int, CharacterClasses) - Constructor for class com.yahoo.language.process.GramSplitter.GramSplitterIterator
- GREEK - com.yahoo.language.Language
-
Language tag "el".
- GREEK - com.yahoo.language.process.TokenScript
- GREENLANDIC - com.yahoo.language.Language
-
Language tag "kl".
- GUARANI - com.yahoo.language.Language
-
Language tag "gn".
- GUJARATI - com.yahoo.language.Language
-
Language tag "gu".
- GUJARATI - com.yahoo.language.process.TokenScript
- GURMUKHI - com.yahoo.language.process.TokenScript
H
- HAN - com.yahoo.language.process.TokenScript
- HANGUL - com.yahoo.language.process.TokenScript
- HANUNOO - com.yahoo.language.process.TokenScript
- hashCode() - Method in class com.yahoo.language.process.GramSplitter.Gram
- hasNext() - Method in class com.yahoo.language.process.GramSplitter.GramSplitterIterator
- HAUSA - com.yahoo.language.Language
-
Language tag "ha".
- HEBREW - com.yahoo.language.Language
-
Language tag "he".
- HEBREW - com.yahoo.language.process.TokenScript
- HINDI - com.yahoo.language.Language
-
Language tag "hi".
- Hint - Class in com.yahoo.language.detect
-
A hint that can be given to a
Detector
. - HIRAGANA - com.yahoo.language.process.TokenScript
- HUNGARIAN - com.yahoo.language.Language
-
Language tag "hu".
I
- ICELANDIC - com.yahoo.language.Language
-
Language tag "is".
- INDONESIAN - com.yahoo.language.Language
-
Language tag "id".
- INHERITED - com.yahoo.language.process.TokenScript
- INTERLINGUA - com.yahoo.language.Language
-
Language tag "ia".
- INTERLINGUE - com.yahoo.language.Language
-
Language tag "ie".
- INUKTITUT - com.yahoo.language.Language
-
Language tag "iu".
- INUPIAK - com.yahoo.language.Language
-
Language tag "ik".
- IRISH - com.yahoo.language.Language
-
Language tag "ga".
- isCjk() - Method in enum com.yahoo.language.Language
-
Returns whether this is a "cjk" language.
- isDigit(int) - Method in class com.yahoo.language.process.CharacterClasses
-
Returns true for code points which should be considered digits - same as java.lang.Character.isDigit
- isIndexable() - Method in interface com.yahoo.language.process.Token
-
Whether this token should be indexed
- isIndexable() - Method in enum com.yahoo.language.process.TokenType
-
Marker for whether this type of token can be indexed for search.
- isLatin(int) - Method in class com.yahoo.language.process.CharacterClasses
-
Returns true if this is a latin character
- isLatinDigit(int) - Method in class com.yahoo.language.process.CharacterClasses
-
Returns true if this is a latin digit (other digits are not consistently parsed into numbers by Java)
- isLetter(int) - Method in class com.yahoo.language.process.CharacterClasses
-
Returns true for code points which are letters in unicode 3 or 4, plus some additional characters which are useful to view as letters even though not defined as such in unicode.
- isLetterOrDigit(int) - Method in class com.yahoo.language.process.CharacterClasses
-
Convenience, returns isLetter(c) || isDigit(c)
- isLocal() - Method in class com.yahoo.language.detect.Detection
- isSpecialToken() - Method in interface com.yahoo.language.process.Token
-
Returns whether this is an instance of a declared special token (e.g.
- ITALIAN - com.yahoo.language.Language
-
Language tag "it".
J
- JAPANESE - com.yahoo.language.Language
-
Language tag "ja".
- JAVANESE - com.yahoo.language.Language
-
Language tag "jw".
K
- KANNADA - com.yahoo.language.Language
-
Language tag "kn".
- KANNADA - com.yahoo.language.process.TokenScript
- KASHMIRI - com.yahoo.language.Language
-
Language tag "ks".
- KATAKANA - com.yahoo.language.process.TokenScript
- KAZAKH - com.yahoo.language.Language
-
Language tag "kk".
- KHAROSHTHI - com.yahoo.language.process.TokenScript
- KHMER - com.yahoo.language.process.TokenScript
- KINYARWANDA - com.yahoo.language.Language
-
Language tag "rw".
- KIRGHIZ - com.yahoo.language.Language
-
Language tag "ky".
- KIRUNDI - com.yahoo.language.Language
-
Language tag "rn".
- KOREAN - com.yahoo.language.Language
-
Language tag "ko".
- KURDISH - com.yahoo.language.Language
-
Language tag "ku".
L
- Language - Enum in com.yahoo.language
- languageCode() - Method in enum com.yahoo.language.Language
- LAO - com.yahoo.language.process.TokenScript
- LAOTHIAN - com.yahoo.language.Language
-
Language tag "lo".
- LATIN - com.yahoo.language.Language
-
Language tag "la".
- LATIN - com.yahoo.language.process.TokenScript
- LATVIAN - com.yahoo.language.Language
-
Language tag "lv".
- LIMBU - com.yahoo.language.process.TokenScript
- LINEARB - com.yahoo.language.process.TokenScript
- LINGALA - com.yahoo.language.Language
-
Language tag "ln".
- Linguistics - Interface in com.yahoo.language
-
Factory of linguistic processors.
- Linguistics.Component - Enum in com.yahoo.language
- LinguisticsCase - Class in com.yahoo.language
-
This class provides a case normalization operation to be used e.g.
- LinguisticsCase() - Constructor for class com.yahoo.language.LinguisticsCase
- LITHUANIAN - com.yahoo.language.Language
-
Language tag "lt".
- LocaleFactory - Class in com.yahoo.language
M
- MACEDONIAN - com.yahoo.language.Language
-
Language tag "mk".
- MALAGASY - com.yahoo.language.Language
-
Language tag "mg".
- MALAY - com.yahoo.language.Language
-
Language tag "ms".
- MALAYALAM - com.yahoo.language.Language
-
Language tag "ml".
- MALAYALAM - com.yahoo.language.process.TokenScript
- MALTESE - com.yahoo.language.Language
-
Language tag "mt".
- MANIPURI - com.yahoo.language.Language
-
Language tag "mni".
- MAORI - com.yahoo.language.Language
-
Language tag "mi".
- MARATHI - com.yahoo.language.Language
-
Language tag "mr".
- MARKER - com.yahoo.language.process.TokenType
- MOLDAVIAN - com.yahoo.language.Language
-
Language tag "mo".
- MONGOLIAN - com.yahoo.language.Language
-
Language tag "mn".
- MONGOLIAN - com.yahoo.language.process.TokenScript
- MUNDA - com.yahoo.language.Language
-
Language tag "mun".
- MYANMAR - com.yahoo.language.process.TokenScript
N
- NAURU - com.yahoo.language.Language
-
Language tag "na".
- NEPALI - com.yahoo.language.Language
-
Language tag "ne".
- newCountryHint(String) - Static method in class com.yahoo.language.detect.Hint
- newInstance(String, String) - Static method in class com.yahoo.language.detect.Hint
- newMarketHint(String) - Static method in class com.yahoo.language.detect.Hint
- next() - Method in class com.yahoo.language.process.GramSplitter.GramSplitterIterator
- NONE - com.yahoo.language.process.StemMode
- normalize(String) - Method in interface com.yahoo.language.process.Normalizer
-
NFKC normalizes a String.
- Normalizer - Interface in com.yahoo.language.process
-
This interface provides NFKC normalization of Strings through the underlying linguistics library.
- NORMALIZER - com.yahoo.language.Linguistics.Component
- NORWEGIAN_BOKMAL - com.yahoo.language.Language
-
Language tag "nb".
- NORWEGIAN_NYNORSK - com.yahoo.language.Language
-
Language tag "nn".
- NUMERIC - com.yahoo.language.process.TokenType
O
- OCCITAN - com.yahoo.language.Language
-
Language tag "oc".
- OGHAM - com.yahoo.language.process.TokenScript
- OLDITALIC - com.yahoo.language.process.TokenScript
- OLDPERSIAN - com.yahoo.language.process.TokenScript
- ORIYA - com.yahoo.language.Language
-
Language tag "or".
- ORIYA - com.yahoo.language.process.TokenScript
- OROMO - com.yahoo.language.Language
-
Language tag "om".
- OSMANYA - com.yahoo.language.process.TokenScript
P
- PASHTO - com.yahoo.language.Language
-
Language tag "ps".
- PERSIAN - com.yahoo.language.Language
-
Language tag "fa".
- POLISH - com.yahoo.language.Language
-
Language tag "pl".
- PORTUGUESE - com.yahoo.language.Language
-
Language tag "pt".
- ProcessingException - Exception in com.yahoo.language.process
-
Exception class indicating that a fatal error occured during linguistic processing.
- ProcessingException(String) - Constructor for exception com.yahoo.language.process.ProcessingException
- ProcessingException(String, Throwable) - Constructor for exception com.yahoo.language.process.ProcessingException
- PUNCTUATION - com.yahoo.language.process.TokenType
- PUNJABI - com.yahoo.language.Language
-
Language tag "pa".
Q
R
- remove() - Method in class com.yahoo.language.process.GramSplitter.GramSplitterIterator
- remove(int) - Method in class com.yahoo.language.process.StemList
- RHAETO_ROMANCE - com.yahoo.language.Language
-
Language tag "rm".
- ROMANIAN - com.yahoo.language.Language
-
Language tag "ro".
- RUNIC - com.yahoo.language.process.TokenScript
- RUSSIAN - com.yahoo.language.Language
-
Language tag "ru".
S
- SAMOAN - com.yahoo.language.Language
-
Language tag "sm".
- SANGHO - com.yahoo.language.Language
-
Language tag "sg".
- SANSKRIT - com.yahoo.language.Language
-
Language tag "sa".
- SCOTS_GAELIC - com.yahoo.language.Language
-
Language tag "gd".
- segment(String, Language) - Method in interface com.yahoo.language.process.Segmenter
-
Split input-string into tokens, and returned a list of tokens in unprocessed form (i.e.
- segment(String, Language) - Method in class com.yahoo.language.process.SegmenterImpl
- Segmenter - Interface in com.yahoo.language.process
-
Interface providing segmentation, i.e.
- SEGMENTER - com.yahoo.language.Linguistics.Component
- SegmenterImpl - Class in com.yahoo.language.process
- SegmenterImpl(Tokenizer) - Constructor for class com.yahoo.language.process.SegmenterImpl
- SERBIAN - com.yahoo.language.Language
-
Language tag "sr".
- SERBO_CROATIAN - com.yahoo.language.Language
-
Language tag "s".
- SESOTHO - com.yahoo.language.Language
-
Language tag "st".
- set(int, String) - Method in class com.yahoo.language.process.StemList
- SETSWANA - com.yahoo.language.Language
-
Language tag "tn".
- SHAVIAN - com.yahoo.language.process.TokenScript
- SHONA - com.yahoo.language.Language
-
Language tag "sn".
- SHORTEST - com.yahoo.language.process.StemMode
- SICHUAN_YI - com.yahoo.language.Language
-
Language tag "ii".
- SINDHI - com.yahoo.language.Language
-
Language tag "sd".
- SINHALA - com.yahoo.language.process.TokenScript
- SINHALESE - com.yahoo.language.Language
-
Language tag "si".
- SISWATI - com.yahoo.language.Language
-
Language tag "ss".
- size() - Method in class com.yahoo.language.process.StemList
- SLOVAK - com.yahoo.language.Language
-
Language tag "sk".
- SLOVENIAN - com.yahoo.language.Language
-
Language tag "sl".
- SOMALI - com.yahoo.language.Language
-
Language tag "so".
- SPACE - com.yahoo.language.process.TokenType
- SPANISH - com.yahoo.language.Language
-
Language tag "es".
- split(String, int) - Method in class com.yahoo.language.process.GramSplitter
-
Splits the input into grams of size n and returns an iterator over grams represented as [start index,length] pairs into the input string.
- stem(String, StemMode, Language) - Method in interface com.yahoo.language.process.Stemmer
-
Stem input according to specified stemming mode.
- stem(String, StemMode, Language) - Method in class com.yahoo.language.process.StemmerImpl
- StemList - Class in com.yahoo.language.process
-
A list of strings which does not allow for duplicate elements.
- StemList() - Constructor for class com.yahoo.language.process.StemList
- StemList(String...) - Constructor for class com.yahoo.language.process.StemList
- Stemmer - Interface in com.yahoo.language.process
-
Interface providing stemming of single words.
- STEMMER - com.yahoo.language.Linguistics.Component
- StemmerImpl - Class in com.yahoo.language.process
- StemmerImpl(Tokenizer) - Constructor for class com.yahoo.language.process.StemmerImpl
- StemMode - Enum in com.yahoo.language.process
-
An enum of the stemming modes which can be requested.
- SUNDANESE - com.yahoo.language.Language
-
Language tag "su".
- SWAHILI - com.yahoo.language.Language
-
Language tag "sw".
- SWEDISH - com.yahoo.language.Language
-
Language tag "sv".
- SYLOTINAGRI - com.yahoo.language.process.TokenScript
- SYMBOL - com.yahoo.language.process.TokenType
- SYRIAC - com.yahoo.language.Language
-
Language tag "syr".
- SYRIAC - com.yahoo.language.process.TokenScript
T
- TAGALOG - com.yahoo.language.Language
-
Language tag "fil".
- TAGALOG - com.yahoo.language.process.TokenScript
- TAGBANWA - com.yahoo.language.process.TokenScript
- TAILE - com.yahoo.language.process.TokenScript
- TAILUE - com.yahoo.language.process.TokenScript
- TAJIK - com.yahoo.language.Language
-
Language tag "tg".
- TAMIL - com.yahoo.language.Language
-
Language tag "ta".
- TAMIL - com.yahoo.language.process.TokenScript
- TATAR - com.yahoo.language.Language
-
Language tag "tt".
- TELUGU - com.yahoo.language.Language
-
Language tag "te".
- TELUGU - com.yahoo.language.process.TokenScript
- THAANA - com.yahoo.language.process.TokenScript
- THAI - com.yahoo.language.Language
-
Language tag "th".
- THAI - com.yahoo.language.process.TokenScript
- TIBETAN - com.yahoo.language.Language
-
Language tag "bo".
- TIBETAN - com.yahoo.language.process.TokenScript
- TIFINAGH - com.yahoo.language.process.TokenScript
- TIGRINYA - com.yahoo.language.Language
-
Language tag "ti".
- toExtractedList() - Method in class com.yahoo.language.process.GramSplitter.GramSplitterIterator
-
Convenience list which splits the remaining items in this iterator into a list of gram strings
- Token - Interface in com.yahoo.language.process
-
A single token produced by the tokenizer.
- tokenize(String, Language, StemMode, boolean) - Method in interface com.yahoo.language.process.Tokenizer
-
Returns the tokens produced from an input string under the rules of the given Language and additional options
- Tokenizer - Interface in com.yahoo.language.process
-
Language-sensitive tokenization of a text string.
- TOKENIZER - com.yahoo.language.Linguistics.Component
- TokenScript - Enum in com.yahoo.language.process
-
List of token scripts (e.g.
- TokenType - Enum in com.yahoo.language.process
-
An enumeration of token types.
- toLowerCase(String) - Static method in class com.yahoo.language.LinguisticsCase
-
The lower casing method to use in Vespa when doing language independent processing of natural language data.
- TONGA - com.yahoo.language.Language
-
Language tag "to".
- Transformer - Interface in com.yahoo.language.process
-
Interface for providers of text transformations such as accent removal.
- TRANSFORMER - com.yahoo.language.Linguistics.Component
- TSONGA - com.yahoo.language.Language
-
Language tag "ts".
- TURKISH - com.yahoo.language.Language
-
Language tag "tr".
- TURKMEN - com.yahoo.language.Language
-
Language tag "tk".
- TWI - com.yahoo.language.Language
-
Language tag "tw".
U
- UGARITIC - com.yahoo.language.Language
-
Language tag "uga".
- UGARITIC - com.yahoo.language.process.TokenScript
- UIGHUR - com.yahoo.language.Language
-
Language tag "ug".
- UKRAINIAN - com.yahoo.language.Language
-
Language tag "uk".
- UNKNOWN - com.yahoo.language.Language
-
Language tag "un".
- UNKNOWN - com.yahoo.language.process.TokenScript
- UNKNOWN - com.yahoo.language.process.TokenType
- URDU - com.yahoo.language.Language
-
Language tag "ur".
- UZBEK - com.yahoo.language.Language
-
Language tag "uz".
V
- valueOf(int) - Static method in enum com.yahoo.language.process.TokenType
-
Translates this from the int code representation returned from
TokenType.getValue()
- valueOf(String) - Static method in enum com.yahoo.language.Language
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.yahoo.language.Linguistics.Component
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.yahoo.language.process.StemMode
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.yahoo.language.process.TokenScript
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum com.yahoo.language.process.TokenType
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum com.yahoo.language.Language
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.yahoo.language.Linguistics.Component
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.yahoo.language.process.StemMode
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.yahoo.language.process.TokenScript
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum com.yahoo.language.process.TokenType
-
Returns an array containing the constants of this enum type, in the order they are declared.
- VIETNAMESE - com.yahoo.language.Language
-
Language tag "vi".
- VIETNAMESE - com.yahoo.language.process.TokenScript
- VOLAPUK - com.yahoo.language.Language
-
Language tag "vo".
W
- WELSH - com.yahoo.language.Language
-
Language tag "cy".
- WOLOF - com.yahoo.language.Language
-
Language tag "wo".
X
Y
- YI - com.yahoo.language.process.TokenScript
- YIDDISH - com.yahoo.language.Language
-
Language tag "yi".
- YORUBA - com.yahoo.language.Language
-
Language tag "yo".
Z
- ZHUANG - com.yahoo.language.Language
-
Language tag "za".
- ZULU - com.yahoo.language.Language
-
Language tag "zu".
All Classes All Packages