- Static method in class smile.nlp.embedding.
- score - Variable in class smile.nlp.collocation.Bigram
-
The chi-square statistical score of the collocation.
- score(int, int, double, int, int, double, int, int, double, long, long) - Method in class smile.nlp.relevance.BM25
-
Returns a relevance score between a term and a document based on a corpus.
- score(double, long, long) - Method in class smile.nlp.relevance.BM25
-
Returns a relevance score between a term and a document based on a corpus.
- score(double, int, double, long, long) - Method in class smile.nlp.relevance.BM25
-
Returns a relevance score between a term and a document based on a corpus.
- score - Variable in class smile.nlp.relevance.Relevance
-
The relevance score.
- search(String) - Method in interface smile.nlp.Corpus
-
Returns an iterator over the set of documents containing the given term.
- search(RelevanceRanker, String) - Method in interface smile.nlp.Corpus
-
Returns an iterator over the set of documents containing the given term
in descending order of relevance.
- search(RelevanceRanker, String[]) - Method in interface smile.nlp.Corpus
-
Returns an iterator over the set of documents containing (at least one
of) the given terms in descending order of relevance.
- search(String) - Method in class smile.nlp.SimpleCorpus
-
- search(RelevanceRanker, String) - Method in class smile.nlp.SimpleCorpus
-
- search(RelevanceRanker, String[]) - Method in class smile.nlp.SimpleCorpus
-
- SentenceSplitter - Interface in smile.nlp.tokenizer
-
A sentence splitter segments text into sentences (a string of words
satisfying the grammatical rules of a language).
- setAnchor(String) - Method in interface smile.nlp.AnchorText
-
Sets the anchor text.
- setAnchor(String) - Method in class smile.nlp.SimpleText
-
Sets the anchor text.
- SimpleCorpus - Class in smile.nlp
-
An in-memory text corpus.
- SimpleCorpus() - Constructor for class smile.nlp.SimpleCorpus
-
Constructor.
- SimpleCorpus(SentenceSplitter, Tokenizer, StopWords, Punctuations) - Constructor for class smile.nlp.SimpleCorpus
-
Constructor.
- SimpleDictionary - Class in smile.nlp.dictionary
-
A simple implementation of dictionary interface.
- SimpleDictionary(String) - Constructor for class smile.nlp.dictionary.SimpleDictionary
-
Constructor.
- SimpleNormalizer - Class in smile.nlp.normalizer
-
A baseline normalizer for processing Unicode text.
- SimpleParagraphSplitter - Class in smile.nlp.tokenizer
-
This is a simple paragraph splitter.
- SimpleSentenceSplitter - Class in smile.nlp.tokenizer
-
This is a simple sentence splitter for English.
- SimpleText - Class in smile.nlp
-
A list-of-words representation of documents.
- SimpleText(String, String, String, String[]) - Constructor for class smile.nlp.SimpleText
-
Constructor.
- SimpleTokenizer - Class in smile.nlp.tokenizer
-
A word tokenizer that tokenizes English sentences with some differences from
TreebankWordTokenizer, notably on handling not-contractions.
- SimpleTokenizer() - Constructor for class smile.nlp.tokenizer.SimpleTokenizer
-
Constructor.
- SimpleTokenizer(boolean) - Constructor for class smile.nlp.tokenizer.SimpleTokenizer
-
Constructor.
- size() - Method in interface smile.nlp.Corpus
-
Returns the number of words in the corpus.
- size() - Method in interface smile.nlp.dictionary.Dictionary
-
Returns the number of elements in this dictionary.
- size() - Method in enum smile.nlp.dictionary.EnglishDictionary
-
- size() - Method in class smile.nlp.dictionary.EnglishPunctuations
-
- size() - Method in enum smile.nlp.dictionary.EnglishStopWords
-
- size() - Method in class smile.nlp.dictionary.SimpleDictionary
-
- size() - Method in class smile.nlp.SimpleCorpus
-
- size() - Method in class smile.nlp.SimpleText
-
- size() - Method in interface smile.nlp.TextTerms
-
Returns the number of words.
- size() - Method in class smile.nlp.Trie
-
Returns the number of entries.
- smile.nlp - package smile.nlp
-
Natural language processing.
- smile.nlp.collocation - package smile.nlp.collocation
-
Collocation finding algorithms.
- smile.nlp.dictionary - package smile.nlp.dictionary
-
Common dictionaries such as stop words, punctuation, common English words, etc.
- smile.nlp.embedding - package smile.nlp.embedding
-
Word embedding.
- smile.nlp.keyword - package smile.nlp.keyword
-
Keyword extraction.
- smile.nlp.normalizer - package smile.nlp.normalizer
-
Text normalization.
- smile.nlp.pos - package smile.nlp.pos
-
Part-of-speech taggers.
- smile.nlp.relevance - package smile.nlp.relevance
-
Term-document relevance ranking algorithms.
- smile.nlp.stemmer - package smile.nlp.stemmer
-
English word stemmer algorithms.
- smile.nlp.tokenizer - package smile.nlp.tokenizer
-
Sentence splitter and word tokenizer.
- split(String) - Method in class smile.nlp.tokenizer.BreakIteratorSentenceSplitter
-
- split(String) - Method in class smile.nlp.tokenizer.BreakIteratorTokenizer
-
- split(String) - Method in interface smile.nlp.tokenizer.ParagraphSplitter
-
Splits the text into paragraphs.
- split(String) - Method in class smile.nlp.tokenizer.PennTreebankTokenizer
-
- split(String) - Method in interface smile.nlp.tokenizer.SentenceSplitter
-
Splits the text into sentences.
- split(String) - Method in class smile.nlp.tokenizer.SimpleParagraphSplitter
-
- split(String) - Method in class smile.nlp.tokenizer.SimpleSentenceSplitter
-
- split(String) - Method in class smile.nlp.tokenizer.SimpleTokenizer
-
- split(String) - Method in interface smile.nlp.tokenizer.Tokenizer
-
Splits the string into a list of tokens.
- stem(String) - Method in class smile.nlp.stemmer.LancasterStemmer
-
- stem(String) - Method in class smile.nlp.stemmer.PorterStemmer
-
- stem(String) - Method in interface smile.nlp.stemmer.Stemmer
-
Transforms a word into its root form.
- Stemmer - Interface in smile.nlp.stemmer
-
A Stemmer transforms a word into its root form.
- StopWords - Interface in smile.nlp.dictionary
-
A set of stop words in some language.
- stripPluralParticiple(String) - Method in class smile.nlp.stemmer.PorterStemmer
-
Remove plurals and participles.