A B C D E F G H I K L M N O P Q R S T U V W _
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- ABBREVIATIONS - Static variable in class org.predict4all.nlp.language.french.FrenchLanguageUtils
- AbstractLanguageModel - Class in org.predict4all.nlp.language
- AbstractLanguageModel() - Constructor for class org.predict4all.nlp.language.AbstractLanguageModel
- AbstractNGramDictionary<T extends AbstractNGramTrieNode<T>> - Class in org.predict4all.nlp.ngram.dictionary
-
Represent an ngram dictionary in an abstract way : dictionary can be static or dynamic.
Each type of dictionary can or can't support operation, such as dictionary saving, or updating probabilities.
The dictionary has aAbstractNGramDictionary.maxOrder
that represents the max order gram that can be found in the dictionary. - AbstractNGramDictionary(T, int) - Constructor for class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Construct a dictionary with a given root node and a max possible order.
- AbstractNGramTrieNode<T extends AbstractNGramTrieNode<?>> - Class in org.predict4all.nlp.ngram.trie
-
Represent a node in a trie structure to represent ngrams.
- AbstractNGramTrieNode() - Constructor for class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
- AbstractPredictionToCompute - Class in org.predict4all.nlp.prediction.model
- AbstractPredictionToCompute() - Constructor for class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- AbstractRecursiveMatcher - Class in org.predict4all.nlp.parser.matcher
- AbstractRecursiveMatcher(boolean, Separator, String) - Constructor for class org.predict4all.nlp.parser.matcher.AbstractRecursiveMatcher
- AbstractRecursiveMatcher(boolean, Separator, String, EquivalenceClass) - Constructor for class org.predict4all.nlp.parser.matcher.AbstractRecursiveMatcher
- AbstractTokenTrainingDocument - Class in org.predict4all.nlp.trainer.corpus
- AbstractTokenTrainingDocument(TrainingStep, File, File) - Constructor for class org.predict4all.nlp.trainer.corpus.AbstractTokenTrainingDocument
- AbstractTrainingDocument - Class in org.predict4all.nlp.trainer.corpus
- AbstractTrainingDocument(TrainingStep, String, File, File) - Constructor for class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- AbstractWord - Class in org.predict4all.nlp.words.model
- AbstractWord(int) - Constructor for class org.predict4all.nlp.words.model.AbstractWord
- ACCENTS - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- ACRONYM - org.predict4all.nlp.EquivalenceClass
- AcronymMatcher - Class in org.predict4all.nlp.language.french.matcher
- AcronymMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.AcronymMatcher
- add(T) - Method in class org.predict4all.nlp.utils.FifoSet
- ADD_LETTER - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- addAndReturnAdded(double) - Method in class org.predict4all.nlp.utils.SingleThreadDoubleAdder
- addChild(CorrectionRuleNode) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- addCorrectionsFor(String, Map<BiIntegerKey, NextWord>, boolean, TIntHashSet) - Method in class org.predict4all.nlp.words.correction.WordCorrectionGenerator
- addGeneralRules(List<CorrectionRuleBuilder>, PredictionParameter) - Static method in class org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator
- addPart(StringBuilder) - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- addSeparator(Separator) - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- addTo(Collection<CorrectionRuleBuilder>) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- addWordsToContext(Collection<String>) - Method in class org.predict4all.nlp.prediction.WordPredictor
- APOSTROPHE - org.predict4all.nlp.Separator
- ApostropheMatcher - Class in org.predict4all.nlp.language.french.matcher
- ApostropheMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.ApostropheMatcher
- append(Token) - Method in class org.predict4all.nlp.io.TokenFileOutputStream
- append(Token) - Method in interface org.predict4all.nlp.parser.TokenAppender
- append(Token) - Method in class org.predict4all.nlp.parser.TokenListAppender
- appendCorrectionToDebugInfo(String, Pair<StringBuilder, StringBuilder>, CorrectionRule) - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- appendDebugInformationForCurrentPart(StringBuilder, Pair<StringBuilder, StringBuilder>, CorrectionRule) - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- appendDebugInformationForCurrentPart(StringBuilder, Pair<StringBuilder, StringBuilder>, CorrectionRule) - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- appendToCurrentPart(CharSequence) - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- appendToCurrentPart(CharSequence) - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- AZERTY_KEYBOARD - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
B
- BACKSLASH - org.predict4all.nlp.Separator
- BaseWordDictionary - Interface in org.predict4all.nlp.language
-
A language specific dictionary : contains lower case words and their unigram frequencies.
- BiIntegerKey - Class in org.predict4all.nlp.utils
- build() - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- build() - Method in class org.predict4all.nlp.trainer.DataTrainerResult.Builder
- BUILD_DATE - Static variable in class org.predict4all.nlp.Predict4AllInfo
- buildAndAddTo(PredictionParameter, Map<CorrectionRule, CorrectionRule>) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- buildAndAddTo(PredictionParameter, Map<CorrectionRule, CorrectionRule>, BiConsumer<CorrectionRule, CorrectionRule>) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- builder() - Static method in class org.predict4all.nlp.trainer.DataTrainerResult
-
Creates builder to build
DataTrainerResult
. - buildList(PredictionParameter) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
C
- calculateGrownCapacity() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- call() - Method in class org.predict4all.nlp.trainer.TrainerTask
- capacity() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Returns the capacity of the hash table.
- capitalize(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- capture(String) - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- changeCurrentPartTo(StringBuilder) - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- changeCurrentPartTo(StringBuilder) - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- checkChildrenLoading(FileChannel) - Method in class org.predict4all.nlp.ngram.trie.StaticNGramTrieNode
-
Check that children for this node are loaded.
If not, try to load children from the given fileChannel. - checkChildrenLoading(DynamicNGramTrieNode) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- checkChildrenLoading(StaticNGramTrieNode) - Method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- checkChildrenLoading(T) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
To check that the children of a given node are loaded into memory (and can be used)
- checkNull(T, String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
-
Throws a
IllegalArgumentException
if a given object is null - children - Variable in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
-
Represent the children node for this node.
Each child is stored by its value (= word id) and represent the possible next value.
To save memory, the map is created on demand, so even if this node has children, the map can be null if children are not loaded yet. - childrenBackoffWeight - Variable in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
-
Backoff weight for this node children frequencies
- childrenPosition - Variable in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
-
Contains the children nodes position in file.
Position in aFileChannel
is a long type, but to save memory the value is stored as an int (trie file never contains more thanInteger.MAX_VALUE
byte) - clear() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Empties the collection.
- clear() - Method in class org.predict4all.nlp.utils.FifoSet
- clearContext() - Method in class org.predict4all.nlp.prediction.WordPredictor
- clearNextCache() - Method in interface org.predict4all.nlp.parser.token.Token
- clone() - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- clone() - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- clone(int) - Method in class org.predict4all.nlp.words.model.EquivalenceClassWord
- clone(int) - Method in class org.predict4all.nlp.words.model.SimpleWord
- clone(int) - Method in class org.predict4all.nlp.words.model.TagWord
- clone(int) - Method in class org.predict4all.nlp.words.model.UserWord
- clone(int) - Method in interface org.predict4all.nlp.words.model.Word
- close() - Method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- close() - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- close() - Method in class org.predict4all.nlp.parser.TokenListAppender
- close() - Method in class org.predict4all.nlp.parser.TokenListProvider
- CLOSE_HOOK - org.predict4all.nlp.Separator
- COMMA - org.predict4all.nlp.Separator
- compact() - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Compact the nodes in this dictionary (this will call
AbstractNGramTrieNode.compact()
on root) - compact() - Method in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
-
compact the children of this node (if this node has children)
- compact() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Compresses the hashtable to the minimum prime size (as defined by PrimeFinder) that will hold all of the elements currently in the table.
- compact() - Method in class org.predict4all.nlp.words.WordDictionary
- compareTo(AbstractPredictionToCompute) - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- computeD(TrainingConfiguration) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Compute the optimal value for d (absolute discounting parameter).
Usually d is computed with formula :
D = C1 / (C1 + 2 * C2)
Where C1 = number of ngram with count == 1, and C2 = number of ngram with count == 2. - computeD(TrainingConfiguration) - Method in class org.predict4all.nlp.ngram.dictionary.DynamicNGramDictionary
- computeD(TrainingConfiguration) - Method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- computeD(TrainingConfiguration) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- computeMaxSize(int) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Computes the values of maxSize.
- computePrediction(WordDictionary) - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- computePrediction(WordDictionary) - Method in class org.predict4all.nlp.prediction.model.DoublePredictionToCompute
- computePrediction(WordDictionary) - Method in class org.predict4all.nlp.prediction.model.UniquePredictionToCompute
- computeProbabilityForChildren(int, double[], boolean) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
-
Compute frequency for this node using absolute discounting.
Compute this node frequency, backoff weight, and then compute the frequency for node children. - consumeFreeSlot - Variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- contains(int) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Searches the set for
val
- contains(T) - Method in class org.predict4all.nlp.utils.FifoSet
- containsKey(int) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- containsUpperCase(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- containsValue(Object) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- containsWord(int) - Method in class org.predict4all.nlp.language.french.FrenchStopWordDictionary
- containsWord(int) - Method in interface org.predict4all.nlp.language.StopWordDictionary
- containsWord(String) - Method in interface org.predict4all.nlp.language.BaseWordDictionary
- containsWord(String) - Method in class org.predict4all.nlp.language.french.FrenchBaseWordDictionary
- convertWrittenYearToExactYear(int) - Static method in class org.predict4all.nlp.language.french.FrenchLanguageUtils
- CoOccurrenceKey - Class in org.predict4all.nlp.semantic
- CoOccurrenceKey(int, int) - Constructor for class org.predict4all.nlp.semantic.CoOccurrenceKey
- correction - Variable in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- CorrectionRule - Class in org.predict4all.nlp.words.correction.model
- CorrectionRuleBuilder - Class in org.predict4all.nlp.words.correction.model
- CorrectionRuleNode - Class in org.predict4all.nlp.words.correction.model
- CorrectionRuleNode(CorrectionRuleNode.CorrectionRuleNodeType) - Constructor for class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- CorrectionRuleNode.CorrectionRuleNodeType - Enum in org.predict4all.nlp.words.correction.model
- count - Variable in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- COUNT_FORMAT - Static variable in class org.predict4all.nlp.words.WordDictionaryGenerator
- countEndUntilNextSeparator(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- countNGram(int, int, AtomicInteger, AtomicInteger) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
-
Count the number of ngram on a order : count the total count (occurence count) and the unique count (difference ngram count)
- countNGrams() - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- countOneAndTwoOccurenceNGrams(int, AtomicInteger[], AtomicInteger[]) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
-
Count the number of ngram with a count == 1 or == 2.
This ignore ngram containingTag.START
- countStartUntilNextSeparator(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- create(int) - Static method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
-
Create an empty training ngram trie dictionary
- create(int, String) - Static method in class org.predict4all.nlp.words.model.SimpleWord
- create(int, String) - Static method in class org.predict4all.nlp.words.model.UserWord
- create(int, String, double, boolean, boolean, long, int) - Static method in class org.predict4all.nlp.words.model.UserWord
- create(String) - Static method in class org.predict4all.nlp.parser.token.WordToken
- create(String, EquivalenceClass) - Static method in class org.predict4all.nlp.parser.token.EquivalenceClassToken
- create(Separator) - Static method in class org.predict4all.nlp.parser.token.SeparatorToken
- create(Tag) - Static method in class org.predict4all.nlp.parser.token.TagToken
- createDouble(int, int, Separator, double, boolean, StringBuilder) - Static method in class org.predict4all.nlp.words.NextWord
- createMap(T...) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- createMatchedString(List<String>) - Method in class org.predict4all.nlp.language.french.matcher.AcronymMatcher
- createMatchedString(List<String>) - Method in class org.predict4all.nlp.language.french.matcher.HyphenMatcher
- createMatchedString(List<String>) - Method in class org.predict4all.nlp.parser.matcher.AbstractRecursiveMatcher
- createModified(int, String, boolean, boolean, double, boolean, boolean) - Static method in class org.predict4all.nlp.words.model.SimpleWord
- createPrefixFor(List<Token>, WordPrefixDetected, int, boolean) - Method in class org.predict4all.nlp.ngram.NGramWordPredictorUtils
-
Create the prefix for a given raw context (token list) : the context is meant to be used for ngram trie exploring.
The context takes care of using only the last sentence, to detect the current written word, and to retrieve a context of the wanted order. - createUnique(int, double, boolean, StringBuilder) - Static method in class org.predict4all.nlp.words.NextWord
- createWordDictionary(TrainingCorpus, Consumer<List<TrainerTask>>, File) - Method in class org.predict4all.nlp.words.WordDictionaryGenerator
- CURRENCY_EURO_SYMBOL - org.predict4all.nlp.Separator
- currentPartFinishedAndNewPartStarted(Separator, StringBuilder) - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- currentPartFinishedAndNewPartStarted(Separator, StringBuilder) - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- CUSTOM - org.predict4all.nlp.EquivalenceClass
D
- DaemonThreadFactory - Class in org.predict4all.nlp.utils
- DaemonThreadFactory() - Constructor for class org.predict4all.nlp.utils.DaemonThreadFactory
- DataTrainer - Class in org.predict4all.nlp.trainer
-
Class to create prediction data to be used with a word predictor.
- DataTrainer(File, File, File, File, LanguageModel, TrainingConfiguration) - Constructor for class org.predict4all.nlp.trainer.DataTrainer
- DataTrainerResult - Class in org.predict4all.nlp.trainer
- DataTrainerResult.Builder - Class in org.predict4all.nlp.trainer
-
Builder to build
DataTrainerResult
. - DATE_DAY_MONTH - org.predict4all.nlp.EquivalenceClass
- DATE_FULL_DIGIT - org.predict4all.nlp.EquivalenceClass
- DATE_FULL_TEXT - org.predict4all.nlp.EquivalenceClass
- DATE_HOUR - org.predict4all.nlp.EquivalenceClass
- DATE_MONTH - org.predict4all.nlp.EquivalenceClass
- DATE_MONTH_YEAR - org.predict4all.nlp.EquivalenceClass
- DATE_WEEK_DAY - org.predict4all.nlp.EquivalenceClass
- DateDayMonthMatcher - Class in org.predict4all.nlp.language.french.matcher
- DateDayMonthMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.DateDayMonthMatcher
- DateFullDigitMatcher - Class in org.predict4all.nlp.language.french.matcher
- DateFullDigitMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.DateFullDigitMatcher
- DateFullTextMatcher - Class in org.predict4all.nlp.language.french.matcher
- DateFullTextMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.DateFullTextMatcher
- DateMonthYearMatcher - Class in org.predict4all.nlp.language.french.matcher
- DateMonthYearMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.DateMonthYearMatcher
- DateWeekDayMatcher - Class in org.predict4all.nlp.language.french.matcher
- DateWeekDayMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.DateWeekDayMatcher
- debug(WordDictionary, AbstractNGramTrieNode<?>) - Method in interface org.predict4all.nlp.ngram.debug.NGramDebugger
- debugInformation - Variable in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- DECIMAL - org.predict4all.nlp.EquivalenceClass
- defaultConfiguration() - Static method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- defaultConfiguration(File) - Static method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- DefaultCorrectionRuleGenerator - Class in org.predict4all.nlp.words.correction
- DefaultCorrectionRuleGenerator() - Constructor for class org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator
- DefaultCorrectionRuleGenerator.CorrectionRuleType - Enum in org.predict4all.nlp.words.correction
- DefaultCorrectionRuleGenerator.TranslationProvider - Interface in org.predict4all.nlp.words.correction
- DICTIONARY_INFORMATION_BYTE_COUNT - Static variable in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Byte count needed to save general information about this dictionary.
- dispose() - Method in class org.predict4all.nlp.prediction.WordPredictor
- dispose() - Method in class org.predict4all.nlp.words.correction.WordCorrectionGenerator
- document - Variable in class org.predict4all.nlp.trainer.TrainerTask
- DOUBLE_LETTER - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- DoublePredictionToCompute - Class in org.predict4all.nlp.prediction.model
-
Represent the prediction for two word in a row.
Could have been generic (more than two, but for computing performance, limit combination to two word only) - DoublePredictionToCompute(int, int, boolean, int[], int[], double, boolean, StringBuilder) - Constructor for class org.predict4all.nlp.prediction.model.DoublePredictionToCompute
- duplicate() - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- DYNAMIC_TRIE_NODE_SIZE_BYTE - Static variable in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
-
Dynamic node byte size (4 integer) Integer : word id, children size, children position, count
- DynamicNGramDictionary - Class in org.predict4all.nlp.ngram.dictionary
-
Represent a
TrainingNGramDictionary
that can also be opened to be trained again.
This type of dictionary is useful when using a dynamic user model : the dynamic user dictionary is loaded and trained during each session, and then saved to be used in the next sessions. - DynamicNGramDictionary(int) - Constructor for class org.predict4all.nlp.ngram.dictionary.DynamicNGramDictionary
- DynamicNGramTrieNode - Class in org.predict4all.nlp.ngram.trie
-
Represent a dynamic trie node structure : this trie node is useful when the ngram count has to be retrieved.
Dynamic trie node children are always fully loaded (they are not loaded on demand) and their frequencies can change.
Because dynamic trie node are used to be saved and loaded asStaticNGramTrieNode
orDynamicNGramTrieNode
they contains two write method :DynamicNGramTrieNode.writeStaticNode(FileChannel, int)
if they are saved to be loaded asStaticNGramTrieNode
andDynamicNGramTrieNode.writeDynamicNode(FileChannel, int)
if they are saved to be loaded asDynamicNGramTrieNode
: one save static information about the node (frequency, bow), the other only save dynamic information (count) because frequencies are dynamically computed. - DynamicNGramTrieNode() - Constructor for class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
E
- encoding - Variable in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- endCorrection(double) - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- endCorrection(double) - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- endsWith(String, String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- EQUAL - org.predict4all.nlp.Separator
- equals(Object) - Method in class org.predict4all.nlp.ngram.NGramKey
- equals(Object) - Method in class org.predict4all.nlp.parser.token.SeparatorToken
- equals(Object) - Method in class org.predict4all.nlp.parser.token.WordToken
- equals(Object) - Method in class org.predict4all.nlp.semantic.CoOccurrenceKey
- equals(Object) - Method in class org.predict4all.nlp.utils.BiIntegerKey
- equals(Object) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- EquivalenceClass - Enum in org.predict4all.nlp
-
Represent a equivalence class type that can be used when training a language model.
Useful to group same kind of element in a corpus under a same concept instead of textual data.
3 These are especially used in semantic data. - EquivalenceClassToken - Class in org.predict4all.nlp.parser.token
- EquivalenceClassWord - Class in org.predict4all.nlp.words.model
- EquivalenceClassWord(EquivalenceClass) - Constructor for class org.predict4all.nlp.words.model.EquivalenceClassWord
- EXCLAMATION - org.predict4all.nlp.Separator
- executeLSATrainingForR(TrainingCorpus, File, Consumer<List<? extends TrainerTask>>) - Method in class org.predict4all.nlp.semantic.SemanticDictionaryGenerator
- executeNGramTraining(TrainingCorpus, File, Consumer<List<TrainerTask>>) - Method in class org.predict4all.nlp.ngram.NGramDictionaryGenerator
- executeTermDetection(List<Token>) - Method in class org.predict4all.nlp.parser.matcher.TokenConverter
- executeTokenPatternMatching(TrainingCorpus) - Method in class org.predict4all.nlp.parser.matcher.TokenConverter
- executeWriteLevelOnRoot(FileChannel, int) - Method in class org.predict4all.nlp.ngram.dictionary.DynamicNGramDictionary
- executeWriteLevelOnRoot(FileChannel, int) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
-
Call the correct node method to save a trie level to file.
- exploreChildren(int, int, BiConsumer<Integer, DynamicNGramTrieNode>) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
F
- factor - Variable in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- FifoSet<T> - Class in org.predict4all.nlp.utils
-
A set maintaining exactly
FifoSet.maxSize
or less but keeping there insertion order to always delete the first inserted element when set is full. - FifoSet(int) - Constructor for class org.predict4all.nlp.utils.FifoSet
- FILENAME_LSA_DICTIONARY - Static variable in class org.predict4all.nlp.trainer.DataTrainer
- FILENAME_NGRAM_DICTIONARY - Static variable in class org.predict4all.nlp.trainer.DataTrainer
- FILENAME_WORD_DICTIONARY - Static variable in class org.predict4all.nlp.trainer.DataTrainer
- forceInvalid - Variable in class org.predict4all.nlp.words.model.SimpleWord
- forceValid - Variable in class org.predict4all.nlp.words.model.SimpleWord
- forEach(TIntProcedure) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Executes
procedure
for each element in the set. - forEachEntry(TIntObjectProcedure<? super V>) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- forEachKey(TIntProcedure) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- forEachValue(TObjectProcedure<? super V>) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- forEachValue(Consumer<? super V>) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- FOUR_DIGIT_FORMAT_ALWAYS - Static variable in class org.predict4all.nlp.language.french.FrenchLanguageUtils
- FREE - Static variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
flag indicating that a slot in the hashtable is available
- FrenchBaseWordDictionary - Class in org.predict4all.nlp.language.french
-
French dictionary based on Lexique.org
- FrenchBaseWordDictionary(String) - Constructor for class org.predict4all.nlp.language.french.FrenchBaseWordDictionary
- FrenchLanguageModel - Class in org.predict4all.nlp.language.french
- FrenchLanguageModel() - Constructor for class org.predict4all.nlp.language.french.FrenchLanguageModel
- FrenchLanguageUtils - Class in org.predict4all.nlp.language.french
-
Utils methods for french language.
- FrenchStopWordDictionary - Class in org.predict4all.nlp.language.french
- FrenchStopWordDictionary(String) - Constructor for class org.predict4all.nlp.language.french.FrenchStopWordDictionary
- frequency - Variable in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
-
Computed frequency for this node
- from(TrainingConfiguration) - Static method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- FULL - Static variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
flag indicating that a slot in the hashtable is occupied
G
- GE_GU_SOUND - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- generateNodeFor(PredictionParameter) - Method in enum org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- GeneratingCorrection - Class in org.predict4all.nlp.words.correction.model
- GeneratingCorrection(String, boolean) - Constructor for class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- GeneratingCorrectionI - Interface in org.predict4all.nlp.words.correction.model
- get(int) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- getAbbreviationOrRegex() - Static method in class org.predict4all.nlp.language.french.FrenchLanguageUtils
- getAllWords() - Method in class org.predict4all.nlp.words.WordDictionary
- getAppliedCorrections() - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- getAverageVocabularySize() - Method in class org.predict4all.nlp.language.french.FrenchLanguageModel
- getAverageVocabularySize() - Method in interface org.predict4all.nlp.language.LanguageModel
-
Average total vocabulary size (different existing words)
- getAverageWordLength() - Method in class org.predict4all.nlp.language.french.FrenchLanguageModel
- getAverageWordLength() - Method in interface org.predict4all.nlp.language.LanguageModel
- getBaseWordDictionary(TrainingConfiguration) - Method in class org.predict4all.nlp.language.french.FrenchLanguageModel
- getBaseWordDictionary(TrainingConfiguration) - Method in interface org.predict4all.nlp.language.LanguageModel
- getBaseWordDictionaryPath() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getById(byte) - Static method in enum org.predict4all.nlp.Tag
- getChildren() - Method in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
- getChildren() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- getChildrenBackoffWeight() - Method in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
- getChildrenCountSum() - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
- getChildrenSize() - Method in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
- getChildrenSize() - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
- getChildrenSize() - Method in class org.predict4all.nlp.ngram.trie.StaticNGramTrieNode
- getConcurrencyLevel() - Method in class org.predict4all.nlp.trainer.corpus.TrainingCorpus
- getConfiguration() - Method in class org.predict4all.runtime.LanguageDataModelTrainerArgs
- getContextText() - Method in class org.predict4all.nlp.prediction.WordPredictor
- getConvertCaseFromDictionaryModelThreshold() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getCorpus() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getCorrectionDefaultCost() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getCorrectionDefaultFactor() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getCorrectionDoubleWordLeftMinFrequency() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getCorrectionMaxCost() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getCorrectionRule() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- getCorrectionRulesRoot() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getCost() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getCost() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- getCount() - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
- getCount() - Method in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- getCount() - Method in class org.predict4all.nlp.utils.progressindicator.LoggingProgressIndicator
- getCount() - Method in class org.predict4all.nlp.utils.progressindicator.NoOpProgressIndicator
- getCount() - Method in interface org.predict4all.nlp.utils.progressindicator.ProgressIndicator
- getCount() - Method in class org.predict4all.nlp.words.model.AbstractWord
- getCount() - Method in class org.predict4all.nlp.words.model.UserWord
- getCount() - Method in interface org.predict4all.nlp.words.model.Word
- getCountMap() - Method in class org.predict4all.nlp.utils.FifoSet
- getCurrentPart() - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- getCurrentPart() - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- getCurrentPart() - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- getCurrentPartLength() - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- getCurrentPartLength() - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- getCustomParameters() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getDebugInfo() - Method in class org.predict4all.nlp.prediction.WordPrediction
- getDebugInfo() - Method in class org.predict4all.nlp.prediction.WordPredictionResult
- getDebugInformation() - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- getDebugInformation() - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- getDebugInformation() - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- getDebugInformation() - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- getDebugInformation() - Method in class org.predict4all.nlp.words.NextWord
- getDebugPrefix() - Method in class org.predict4all.nlp.ngram.NGramDictionaryGenerator
- getDebugPrefix() - Method in class org.predict4all.nlp.trainer.DataTrainer
- getDensitiesMap() - Method in class org.predict4all.nlp.semantic.SemanticDictionary
- getDescriptionId() - Method in enum org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- getDirectlyValidWordCountThreshold() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getDocuments(TrainingStep) - Method in class org.predict4all.nlp.trainer.corpus.TrainingCorpus
- getDynamicModelMinimumWeight() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getDynamicNGramDictionary() - Method in class org.predict4all.nlp.prediction.WordPredictor
- getDynamicTrainingTime() - Method in class org.predict4all.nlp.prediction.WordPredictor
- getECById(byte) - Static method in enum org.predict4all.nlp.EquivalenceClass
- getEncoding() - Method in class org.predict4all.nlp.trainer.corpus.TrainingCorpus
- getEndFactor() - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- getEndFactor() - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- getEndFactor() - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- getEndPart(int) - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- getEndPart(int) - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- getEndSeparator(int) - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- getEndSeparator(int) - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- getEndUntilNextSeparator(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- getEquivalenceClass() - Method in class org.predict4all.nlp.parser.token.EquivalenceClassToken
- getEquivalenceClass() - Method in interface org.predict4all.nlp.parser.token.Token
- getEquivalenceClass() - Method in class org.predict4all.nlp.words.model.AbstractWord
- getEquivalenceClass() - Method in interface org.predict4all.nlp.words.model.Word
- getEquivalenceClassId() - Method in class org.predict4all.nlp.words.model.AbstractWord
- getEquivalenceClassId() - Method in class org.predict4all.nlp.words.model.EquivalenceClassWord
- getEquivalenceClassId() - Method in interface org.predict4all.nlp.words.model.Word
- getError() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getErrors() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- getExactWordsWithPrefixExist(String) - Method in class org.predict4all.nlp.words.WordDictionary
- getExampleId() - Method in enum org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- getExtractedTokenValues() - Method in class org.predict4all.nlp.parser.matcher.TokenRegexResult
- getExtractedValue(int) - Method in class org.predict4all.nlp.parser.matcher.TokenRegexResult
- getFactor() - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- getFactor() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getFactor() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- getFactor() - Method in class org.predict4all.nlp.words.NextWord
- getFirstPrefix() - Method in class org.predict4all.nlp.prediction.model.DoublePredictionToCompute
- getFirstWordId() - Method in class org.predict4all.nlp.prediction.model.DoublePredictionToCompute
- getFormattedText() - Method in class org.predict4all.nlp.parser.matcher.PatternMatched
- getFrequency() - Method in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
- getFrequency(String) - Method in interface org.predict4all.nlp.language.BaseWordDictionary
- getFrequency(String) - Method in class org.predict4all.nlp.language.french.FrenchBaseWordDictionary
- getId() - Method in enum org.predict4all.nlp.EquivalenceClass
- getId() - Method in class org.predict4all.nlp.language.french.FrenchLanguageModel
- getId() - Method in interface org.predict4all.nlp.language.LanguageModel
- getId() - Method in enum org.predict4all.nlp.Separator
- getId() - Method in enum org.predict4all.nlp.Tag
- getID() - Method in class org.predict4all.nlp.words.model.AbstractWord
- getID() - Method in interface org.predict4all.nlp.words.model.Word
- getIdByte() - Method in enum org.predict4all.nlp.EquivalenceClass
- getIdByte() - Method in enum org.predict4all.nlp.Separator
- getIdByte() - Method in enum org.predict4all.nlp.Tag
- getIDGeneratorState() - Method in class org.predict4all.nlp.words.WordDictionary
- getInput() - Method in class org.predict4all.runtime.LanguageDataModelTrainerArgs
- getInputFile() - Method in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- getKey() - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- getKey() - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- getKey() - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- getLanguage() - Method in class org.predict4all.runtime.LanguageDataModelTrainerArgs
- getLanguageModel() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getLastMatchedToken() - Method in class org.predict4all.nlp.parser.matcher.TokenRegexResult
- getLeft() - Method in class org.predict4all.nlp.utils.Pair
- getLeft() - Method in class org.predict4all.nlp.utils.Triple
- getLongestMatchingWords(List<Token>, int, TIntHashSet) - Method in class org.predict4all.nlp.words.WordPrefixDetector
-
Try to detect if the end of the given sentence finish with a word already started.
This is much more precise than just checking if the last token is a separator, because a word could have separator inside (e.g. : "New York" has a space, "là-bas" has a hyphen). - getLongestWordPrefix() - Method in class org.predict4all.nlp.words.WordPrefixDetected
- getLsaDensitySize() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getLsaFrequentWordSize() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getLsaTargetSvdSize() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getLsaVocabularySize() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getLsaWindowSize() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getMax() - Method in class org.predict4all.nlp.utils.progressindicator.LoggingProgressIndicator
- getMax() - Method in class org.predict4all.nlp.utils.progressindicator.NoOpProgressIndicator
- getMax() - Method in interface org.predict4all.nlp.utils.progressindicator.ProgressIndicator
- getMaxIdValue() - Static method in enum org.predict4all.nlp.EquivalenceClass
- getMaxIndexFromEnd() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getMaxIndexFromEnd() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- getMaxIndexFromStart() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getMaxIndexFromStart() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- getMaxOrder() - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
- getMiddle() - Method in class org.predict4all.nlp.utils.Triple
- getMinCountToProvideCorrection() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getMinCountToProvidePrediction() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getMinIndexFromEnd() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getMinIndexFromEnd() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- getMinIndexFromStart() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getMinIndexFromStart() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- getMinUseCountToValidateNewWord() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getMonthOrRegex() - Static method in class org.predict4all.nlp.language.french.FrenchLanguageUtils
- getName() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- getNameId() - Method in enum org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- getNext() - Method in class org.predict4all.nlp.io.TokenFileInputStream
- getNext() - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher
- getNext() - Method in class org.predict4all.nlp.parser.TokenListProvider
- getNext() - Method in interface org.predict4all.nlp.parser.TokenProvider
- getNext(TokenProvider) - Method in interface org.predict4all.nlp.parser.token.Token
- getNextCharCountToRemove() - Method in class org.predict4all.nlp.prediction.WordPredictionResult
- getNextWord(int[]) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Return the immediate next words for a given prefix (without any filter)
- getNgramCounts() - Method in class org.predict4all.nlp.trainer.DataTrainerResult
- getNgramDebugAfterPruning() - Method in class org.predict4all.nlp.ngram.NGramDictionaryGenerator
- getNgramDebugBeforePruning() - Method in class org.predict4all.nlp.ngram.NGramDictionaryGenerator
- getNgramDictionary() - Method in class org.predict4all.runtime.LanguageDataModelTrainerArgs
- getNgramMaxLastTextLength() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getNgramMaxLastTokenCountFactor() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getNgramOrder() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getNgramPruningCountThreshold() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getNgramPruningOrderCountThresholds() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getNgramPruningWeightedDifferenceThreshold() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getNGramTag() - Method in class org.predict4all.nlp.words.model.AbstractWord
- getNGramTag() - Method in interface org.predict4all.nlp.words.model.Word
- getNGramTagId() - Method in class org.predict4all.nlp.words.model.AbstractWord
- getNGramTagId() - Method in class org.predict4all.nlp.words.model.TagWord
- getNGramTagId() - Method in interface org.predict4all.nlp.words.model.Word
- getNodeFor(int[], int, int) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
-
Retrieve a node for a given prefix (if exists)
- getNodeFor(FileChannel, int[], int) - Method in class org.predict4all.nlp.ngram.trie.StaticNGramTrieNode
-
Will try to retrieve a node for a given prefix.
Load needed node on demand while browsing the trie.
Children of the returned node are not loaded yet. - getNodeForPrefix(int[], int) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Use to retrieve a node for a given prefix.
For example, for prefix = [1,2] will return the trie node corresponding to {2}.
The children of the given node may have not been loaded. - getNodeForPrefix(int[], int) - Method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- getNodeForPrefix(int[], int) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- getOfficialChar() - Method in enum org.predict4all.nlp.Separator
- getOfficialCharString() - Method in enum org.predict4all.nlp.Separator
- getOrDefault(T, T) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- getOutputDirectory() - Method in class org.predict4all.nlp.trainer.corpus.TrainingCorpus
- getOutputDirectoryName() - Method in enum org.predict4all.nlp.trainer.step.TrainingStep
- getOutputFile() - Method in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- getPart(int) - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- getPartCount() - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- getPartCount() - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- getPartCount() - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- getPattern(String, int) - Static method in class org.predict4all.nlp.parser.matcher.TermMatcherUtils
- getPrediction() - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- getPredictionParameter() - Method in class org.predict4all.nlp.prediction.WordPredictor
- getPredictions() - Method in class org.predict4all.nlp.prediction.WordPredictionResult
- getPredictionToDisplay() - Method in class org.predict4all.nlp.prediction.WordPrediction
- getPredictionToInsert() - Method in class org.predict4all.nlp.prediction.WordPrediction
- getPrefixSearchMaxTokenCount() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getPreviousCharCountToRemove() - Method in class org.predict4all.nlp.prediction.WordPrediction
- getPreviousEndToken() - Method in class org.predict4all.nlp.parser.matcher.PatternMatched
- getPreviousStep() - Method in enum org.predict4all.nlp.trainer.step.TrainingStep
- getProbability(int[], int, int, int) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Return the probability of a word for a given prefix.
Given index = 0 and length = prefix.length will return the maximum order probability (e.g. prefix.length = 3, will return probability for order 3) - getProbFactor() - Method in class org.predict4all.nlp.words.model.AbstractWord
- getProbFactor() - Method in class org.predict4all.nlp.words.model.SimpleWord
- getProbFactor() - Method in interface org.predict4all.nlp.words.model.Word
- getPruningMethod() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getRawProbability(int[], int, int, int) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
- getRegex() - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher
- getReplacement() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getReplacementLeftPart() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getReplacementRightPart() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getReplacements() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- getReplacementSeparator() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getReplacementSeparatorIndex() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getRight() - Method in class org.predict4all.nlp.utils.Pair
- getRight() - Method in class org.predict4all.nlp.utils.Triple
- getRoot() - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
- getRootBlockSize() - Method in class org.predict4all.nlp.ngram.dictionary.DynamicNGramDictionary
- getRootBlockSize() - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- getScore() - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- getScore() - Method in class org.predict4all.nlp.prediction.WordPrediction
- getSecondPrefix() - Method in class org.predict4all.nlp.prediction.model.DoublePredictionToCompute
- getSecondWordId() - Method in class org.predict4all.nlp.prediction.model.DoublePredictionToCompute
- getSemanticContrastFactor() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getSemanticContrastFactor() - Method in interface org.predict4all.nlp.semantic.SemanticDictionaryConfiguration
- getSemanticDensityMaxBound() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getSemanticDensityMaxBound() - Method in interface org.predict4all.nlp.semantic.SemanticDictionaryConfiguration
- getSemanticDensityMinBound() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getSemanticDensityMinBound() - Method in interface org.predict4all.nlp.semantic.SemanticDictionaryConfiguration
- getSemanticInputContextSize() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- getSeparator() - Method in class org.predict4all.nlp.parser.token.SeparatorToken
- getSeparator() - Method in interface org.predict4all.nlp.parser.token.Token
- getSeparator() - Method in class org.predict4all.nlp.words.NextWord
- getSeparator(int) - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- getSeparatorById(byte) - Static method in enum org.predict4all.nlp.Separator
- getSeparatorFor(char) - Static method in enum org.predict4all.nlp.Separator
- getSet() - Method in class org.predict4all.nlp.utils.FifoSet
- getSimilarityCosineFor(Collection<Integer>, List<AbstractPredictionToCompute>, double) - Method in class org.predict4all.nlp.semantic.SemanticDictionary
- getSmoothingDiscountValue() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getSmoothingDiscountValueLowerBound() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getSmoothingDiscountValueUpperBound() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getSrcBuilder() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- getStartUntilNextSeparator(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- getStaticNgramDictionary() - Method in class org.predict4all.nlp.prediction.WordPredictor
- getStep() - Method in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- getStep(File, TrainingCorpus) - Method in enum org.predict4all.nlp.trainer.step.TrainingStep
- getStopWordDictionary(TrainingConfiguration) - Method in class org.predict4all.nlp.language.french.FrenchLanguageModel
- getStopWordDictionary(TrainingConfiguration) - Method in interface org.predict4all.nlp.language.LanguageModel
- getStopWordDictionaryPath() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getTag() - Method in class org.predict4all.nlp.parser.token.TagToken
- getTag() - Method in interface org.predict4all.nlp.parser.token.Token
- getText() - Method in class org.predict4all.nlp.parser.token.SeparatorToken
- getText() - Method in interface org.predict4all.nlp.parser.token.Token
- getText() - Method in class org.predict4all.nlp.parser.token.WordToken
- getTextForType() - Method in interface org.predict4all.nlp.parser.token.Token
- getTokenCount() - Method in class org.predict4all.nlp.words.WordPrefixDetected
- getTokenMatchersForNGram() - Method in class org.predict4all.nlp.language.AbstractLanguageModel
- getTokenMatchersForNGram() - Method in class org.predict4all.nlp.language.french.FrenchLanguageModel
- getTokenMatchersForNGram() - Method in interface org.predict4all.nlp.language.LanguageModel
- getTokenMatchersForSemanticAnalysis() - Method in class org.predict4all.nlp.language.AbstractLanguageModel
- getTokenMatchersForSemanticAnalysis() - Method in class org.predict4all.nlp.language.french.FrenchLanguageModel
- getTokenMatchersForSemanticAnalysis() - Method in interface org.predict4all.nlp.language.LanguageModel
- getTotalCountFor(TrainingStep) - Method in class org.predict4all.nlp.trainer.corpus.TrainingCorpus
- getType() - Method in class org.predict4all.nlp.parser.matcher.PatternMatched
- getType() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- getType() - Method in class org.predict4all.nlp.words.model.EquivalenceClassWord
- getType() - Method in class org.predict4all.nlp.words.model.SimpleWord
- getType() - Method in class org.predict4all.nlp.words.model.TagWord
- getType() - Method in class org.predict4all.nlp.words.model.UserWord
- getType() - Method in interface org.predict4all.nlp.words.model.Word
- getUnknownWordCountThreshold() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getUpdateDate() - Method in class org.predict4all.nlp.words.model.AbstractWord
- getUpdateDate() - Method in class org.predict4all.nlp.words.model.UserWord
- getUpdateDate() - Method in interface org.predict4all.nlp.words.model.Word
- getUpperCaseReplacementThreshold() - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- getValidOneCharWords() - Method in class org.predict4all.nlp.language.AbstractLanguageModel
- getValidOneCharWords() - Method in class org.predict4all.nlp.language.french.FrenchLanguageModel
- getValidOneCharWords() - Method in interface org.predict4all.nlp.language.LanguageModel
- getValidWordForPredictionByPrefix(String, PredictionParameter, int, TIntHashSet) - Method in class org.predict4all.nlp.words.WordDictionary
-
Returns all the words that starts with a given prefix.
The returned list is not sorted. - getWeekDaysOrRegex() - Static method in class org.predict4all.nlp.language.french.FrenchLanguageUtils
- getWord() - Method in class org.predict4all.nlp.words.model.EquivalenceClassWord
- getWord() - Method in class org.predict4all.nlp.words.model.SimpleWord
- getWord() - Method in class org.predict4all.nlp.words.model.TagWord
- getWord() - Method in interface org.predict4all.nlp.words.model.Word
- getWord(int) - Method in class org.predict4all.nlp.words.WordDictionary
- getWord(String) - Method in class org.predict4all.nlp.words.WordDictionary
- getWordCorrectionGenerator() - Method in class org.predict4all.nlp.prediction.WordPredictor
- getWordDictionary() - Method in class org.predict4all.nlp.prediction.WordPredictor
- getWordDictionary() - Method in class org.predict4all.runtime.LanguageDataModelTrainerArgs
- getWordId() - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- getWordId() - Method in class org.predict4all.nlp.prediction.model.DoublePredictionToCompute
- getWordId() - Method in class org.predict4all.nlp.prediction.model.UniquePredictionToCompute
- getWordId() - Method in class org.predict4all.nlp.prediction.WordPrediction
- getWordId(String) - Method in class org.predict4all.nlp.words.WordDictionary
- getWordId(WordDictionary) - Method in interface org.predict4all.nlp.parser.token.Token
- getWordId1() - Method in class org.predict4all.nlp.words.NextWord
- getWordId2() - Method in class org.predict4all.nlp.words.NextWord
- getWords() - Method in class org.predict4all.nlp.words.WordPrefixDetected
- getWordUsed() - Method in class org.predict4all.nlp.ngram.dictionary.DynamicNGramDictionary
- GSON - Static variable in class org.predict4all.nlp.prediction.PredictionParameter
H
- HASH - org.predict4all.nlp.Separator
- hashCode() - Method in class org.predict4all.nlp.ngram.NGramKey
- hashCode() - Method in class org.predict4all.nlp.parser.token.SeparatorToken
- hashCode() - Method in class org.predict4all.nlp.parser.token.WordToken
- hashCode() - Method in class org.predict4all.nlp.semantic.CoOccurrenceKey
- hashCode() - Method in class org.predict4all.nlp.utils.BiIntegerKey
- hashCode() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- hasNext() - Method in interface org.predict4all.nlp.parser.StringProducer
- HEARING_CONFUSION - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- HOMOPHONE - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- HYPHEN - org.predict4all.nlp.Separator
- HyphenMatcher - Class in org.predict4all.nlp.language.french.matcher
-
Term matcher to match word sequence with hyphen between each word.
The sequence should start and end with hyphen, examples : a-t : valid a-t-elle : valid a-t-elle- : not valid -test- : not valid - HyphenMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.HyphenMatcher
I
- increment() - Method in class org.predict4all.nlp.utils.progressindicator.LoggingProgressIndicator
- increment() - Method in class org.predict4all.nlp.utils.progressindicator.NoOpProgressIndicator
- increment() - Method in interface org.predict4all.nlp.utils.progressindicator.ProgressIndicator
- increment() - Method in class org.predict4all.nlp.words.model.AbstractWord
- increment() - Method in class org.predict4all.nlp.words.model.UserWord
- increment() - Method in interface org.predict4all.nlp.words.model.Word
- incrementUserWord(int) - Method in class org.predict4all.nlp.words.WordDictionary
- index(int) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Locates the index of
val
. - indexOfInCurrentPart(String, int) - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- indexOfInCurrentPart(String, int) - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- INF_SUP - org.predict4all.nlp.Separator
- INFO_EXTENSION - Static variable in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- initialize() - Method in interface org.predict4all.nlp.language.BaseWordDictionary
- initialize() - Method in class org.predict4all.nlp.language.french.FrenchBaseWordDictionary
- initialize(WordDictionary) - Method in class org.predict4all.nlp.language.french.FrenchStopWordDictionary
- initialize(WordDictionary) - Method in interface org.predict4all.nlp.language.StopWordDictionary
- initializeInformation() - Method in class org.predict4all.nlp.trainer.corpus.AbstractTokenTrainingDocument
- initializeInformation() - Method in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- initializeInformation() - Method in class org.predict4all.nlp.trainer.step.ParserTrainingDocument
- initStep(TrainingStep) - Method in class org.predict4all.nlp.trainer.corpus.TrainingCorpus
- inputFile - Variable in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- insertKey(int) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Locates the index at which
val
can be inserted. if there is already a value equal()ingval
in the set, returns that value as a negative integer. - INSTANCE - Static variable in class org.predict4all.nlp.utils.progressindicator.NoOpProgressIndicator
- INTEGER - org.predict4all.nlp.EquivalenceClass
- isAddNewWordsEnabled() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- isBidirectionnal() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- isBlank(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- isCapitalized(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- isCapitalizedWord() - Method in class org.predict4all.nlp.words.WordPrefixDetected
- isCaptureValue() - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher
- isCorrection() - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- isCorrection() - Method in class org.predict4all.nlp.prediction.WordPrediction
- isCorrection() - Method in class org.predict4all.nlp.words.NextWord
- isDouble() - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- isDouble() - Method in class org.predict4all.nlp.prediction.model.DoublePredictionToCompute
- isDouble() - Method in class org.predict4all.nlp.prediction.model.UniquePredictionToCompute
- isDouble() - Method in class org.predict4all.nlp.words.NextWord
- isDynamicModelEnabled() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- isEmpty() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Tells whether this set is currently holding any elements.
- isEmpty(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- isEnabled() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- isEnableDebugInformation() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- isEnableDistanceWordCorrection() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- isEnableWordCorrection() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- isEnableWordPredictorDebug() - Method in class org.predict4all.nlp.prediction.WordPredictor
- isEquivalenceClass() - Method in class org.predict4all.nlp.parser.token.EquivalenceClassToken
- isEquivalenceClass() - Method in interface org.predict4all.nlp.parser.token.Token
- isEquivalenceClass() - Method in class org.predict4all.nlp.words.model.AbstractWord
- isEquivalenceClass() - Method in interface org.predict4all.nlp.words.model.Word
- isExactWordWithPrefixExist(String) - Method in class org.predict4all.nlp.words.WordDictionary
- isForceInvalid() - Method in class org.predict4all.nlp.words.model.AbstractWord
- isForceInvalid() - Method in class org.predict4all.nlp.words.model.SimpleWord
- isForceInvalid() - Method in interface org.predict4all.nlp.words.model.Word
- isForceValid() - Method in class org.predict4all.nlp.words.model.AbstractWord
- isForceValid() - Method in class org.predict4all.nlp.words.model.SimpleWord
- isForceValid() - Method in interface org.predict4all.nlp.words.model.Word
- isFullUpperCase(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- isInitialized() - Method in interface org.predict4all.nlp.language.BaseWordDictionary
- isInitialized() - Method in class org.predict4all.nlp.language.french.FrenchBaseWordDictionary
- isInitialized() - Method in class org.predict4all.nlp.language.french.FrenchStopWordDictionary
- isInitialized() - Method in interface org.predict4all.nlp.language.StopWordDictionary
- isInsertSpacePossible() - Method in class org.predict4all.nlp.prediction.WordPrediction
- isModifiedBySystem() - Method in class org.predict4all.nlp.words.model.AbstractWord
- isModifiedBySystem() - Method in class org.predict4all.nlp.words.model.SimpleWord
- isModifiedBySystem() - Method in interface org.predict4all.nlp.words.model.Word
- isModifiedByUser() - Method in class org.predict4all.nlp.words.model.AbstractWord
- isModifiedByUser() - Method in class org.predict4all.nlp.words.model.SimpleWord
- isModifiedByUser() - Method in interface org.predict4all.nlp.words.model.Word
- isModifiedByUserOrSystem() - Method in class org.predict4all.nlp.words.model.AbstractWord
- isModifiedByUserOrSystem() - Method in class org.predict4all.nlp.words.model.SimpleWord
- isModifiedByUserOrSystem() - Method in interface org.predict4all.nlp.words.model.Word
- isNextWordsCapitalized(List<Token>, String, int) - Method in class org.predict4all.nlp.words.WordPrefixDetector
- isNGramTag() - Method in class org.predict4all.nlp.words.model.AbstractWord
- isNGramTag() - Method in class org.predict4all.nlp.words.model.TagWord
- isNGramTag() - Method in interface org.predict4all.nlp.words.model.Word
- isNotBlank(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- isNotEmpty(Collection<?>) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- isOptional() - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher
- isOverwrite() - Method in class org.predict4all.runtime.LanguageDataModelTrainerArgs
- isPredictionInitialized() - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- isSemanticModelEnabled() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- isSentenceSeparator() - Method in enum org.predict4all.nlp.Separator
- isSeparator() - Method in class org.predict4all.nlp.parser.token.SeparatorToken
- isSeparator() - Method in interface org.predict4all.nlp.parser.token.Token
- isSpaceBetween() - Method in class org.predict4all.nlp.prediction.model.DoublePredictionToCompute
- isTag() - Method in class org.predict4all.nlp.parser.token.TagToken
- isTag() - Method in interface org.predict4all.nlp.parser.token.Token
- isTokenValidToCreateUserWord(Token) - Method in class org.predict4all.nlp.words.WordDictionary
- isTwoWordPredictionEnabled() - Method in class org.predict4all.nlp.prediction.PredictionParameter
- isUnique() - Method in class org.predict4all.nlp.words.NextWord
- isUserWord() - Method in class org.predict4all.nlp.words.model.AbstractWord
- isUserWord() - Method in class org.predict4all.nlp.words.model.UserWord
- isUserWord() - Method in interface org.predict4all.nlp.words.model.Word
- isValidForSaving() - Method in class org.predict4all.nlp.words.model.AbstractWord
- isValidForSaving() - Method in class org.predict4all.nlp.words.model.EquivalenceClassWord
- isValidForSaving() - Method in class org.predict4all.nlp.words.model.TagWord
- isValidForSaving() - Method in interface org.predict4all.nlp.words.model.Word
- isValidToBePredicted(PredictionParameter) - Method in class org.predict4all.nlp.words.model.AbstractWord
- isValidToBePredicted(PredictionParameter) - Method in class org.predict4all.nlp.words.model.EquivalenceClassWord
- isValidToBePredicted(PredictionParameter) - Method in class org.predict4all.nlp.words.model.SimpleWord
- isValidToBePredicted(PredictionParameter) - Method in class org.predict4all.nlp.words.model.TagWord
- isValidToBePredicted(PredictionParameter) - Method in class org.predict4all.nlp.words.model.UserWord
- isValidToBePredicted(PredictionParameter) - Method in interface org.predict4all.nlp.words.model.Word
- isValidToPredictionContext(PredictionParameter) - Method in class org.predict4all.nlp.words.model.AbstractWord
- isValidToPredictionContext(PredictionParameter) - Method in class org.predict4all.nlp.words.model.TagWord
- isValidToPredictionContext(PredictionParameter) - Method in class org.predict4all.nlp.words.model.UserWord
- isValidToPredictionContext(PredictionParameter) - Method in interface org.predict4all.nlp.words.model.Word
- isWord() - Method in class org.predict4all.nlp.parser.token.EquivalenceClassToken
- isWord() - Method in interface org.predict4all.nlp.parser.token.Token
- isWord() - Method in class org.predict4all.nlp.parser.token.WordToken
- isWordExists(String) - Method in class org.predict4all.nlp.words.WordDictionary
K
- keys() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- keys(int[]) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
L
- LanguageDataModelTrainer - Class in org.predict4all.runtime
- LanguageDataModelTrainer() - Constructor for class org.predict4all.runtime.LanguageDataModelTrainer
- LanguageDataModelTrainerArgs - Class in org.predict4all.runtime
- LanguageDataModelTrainerArgs() - Constructor for class org.predict4all.runtime.LanguageDataModelTrainerArgs
- LanguageModel - Interface in org.predict4all.nlp.language
-
Represent a model specific to the input language.
This model is useful to better perform on NLP task by using specific parameters from a language.
E.G. - launchLSATraining(TrainingStep) - Method in class org.predict4all.nlp.trainer.DataTrainer
- launchNGramTraining(TrainingStep) - Method in class org.predict4all.nlp.trainer.DataTrainer
- LBRACKET - org.predict4all.nlp.Separator
- LEAF - org.predict4all.nlp.words.correction.model.CorrectionRuleNode.CorrectionRuleNodeType
- length(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- listNextWords(int[], WordDictionary, PredictionParameter, TIntHashSet, Map<BiIntegerKey, NextWord>, int, boolean) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Will go through each ngram dictionary order to find the next possible words for a given prefix
Will first go through the highest order for the given prefix (e.g. prefix length == 3 = order is 4), and if the wantedCount is not reached, will go to the lower order to find new next possible. - listTrieLeaves(int[], int, int, int, BiConsumer<int[], Integer>) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
-
Detect each unique trie leaves for a wanted order, and then call the found callback with the found prefix and word id
- load(File) - Static method in class org.predict4all.nlp.ngram.dictionary.DynamicNGramDictionary
-
Create and open a existing dynamic ngram dictionary.
- loadDictionary(File, SemanticDictionaryConfiguration) - Static method in class org.predict4all.nlp.semantic.SemanticDictionary
- loadDictionary(LanguageModel, File) - Static method in class org.predict4all.nlp.words.WordDictionary
- loadFrom(File, File) - Static method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- loadFrom(LanguageModel, File) - Static method in class org.predict4all.nlp.prediction.PredictionParameter
- loadUserDictionary(File) - Method in class org.predict4all.nlp.words.WordDictionary
- LoggingProgressIndicator - Class in org.predict4all.nlp.utils.progressindicator
- LoggingProgressIndicator(String, long) - Constructor for class org.predict4all.nlp.utils.progressindicator.LoggingProgressIndicator
- lowerCase(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
M
- M_FRONT_MBP - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- main(String[]) - Static method in class org.predict4all.runtime.LanguageDataModelTrainer
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.ApostropheMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.DateDayMonthMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.DateFullDigitMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.DateFullTextMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.DateMonthYearMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.DateWeekDayMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.NumberDecimalMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.NumberIntMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.PercentMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.ProperNameMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.language.french.matcher.SpecialWordMatcher
- match(Token, TokenProvider) - Method in class org.predict4all.nlp.parser.matcher.AbstractRecursiveMatcher
- match(Token, TokenProvider) - Method in interface org.predict4all.nlp.parser.matcher.TokenMatcher
- MATCHERS_NGRAM_FR - Static variable in class org.predict4all.nlp.language.french.FrenchLanguageModel
- MATCHERS_SEMANTIC_ANALYSIS_FR - Static variable in class org.predict4all.nlp.language.french.FrenchLanguageModel
- matchRegexPattern(Token, TokenRegexMatcher, TokenProvider, int) - Static method in class org.predict4all.nlp.parser.matcher.TermMatcherUtils
- maxOrder - Variable in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Max order possible to store in this dictionary.
Could be retrieved by opening the dictionary, or set by user as a limit. - MISC - org.predict4all.nlp.EquivalenceClass
- modifiedBySystem - Variable in class org.predict4all.nlp.words.model.SimpleWord
- modifiedByUser - Variable in class org.predict4all.nlp.words.model.SimpleWord
- MONEY_AMOUNT - org.predict4all.nlp.EquivalenceClass
- MONTHS - Static variable in class org.predict4all.nlp.language.french.FrenchLanguageUtils
N
- NEWLINE - org.predict4all.nlp.Separator
- newThread(Runnable) - Method in class org.predict4all.nlp.utils.DaemonThreadFactory
- next() - Method in interface org.predict4all.nlp.parser.StringProducer
- NextWord - Class in org.predict4all.nlp.words
- NGRAM_COUNT_FORMAT - Static variable in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- NGRAM_DICTIONARY - org.predict4all.nlp.trainer.step.TrainingStep
- NGramDebugger - Interface in org.predict4all.nlp.ngram.debug
-
This interface can be used to check an ngram dictionary while training models.
- NGramDictionaryGenerator - Class in org.predict4all.nlp.ngram
-
Use this generator to train an ngram model.
It will load texts from aTrainingCorpus
and generate a ngram file that could be later opened with aStaticNGramTrieDictionary
- NGramDictionaryGenerator(LanguageModel, TrainingConfiguration, WordDictionary) - Constructor for class org.predict4all.nlp.ngram.NGramDictionaryGenerator
- NGramKey - Class in org.predict4all.nlp.ngram
- NGramPruningMethod - Enum in org.predict4all.nlp.trainer.configuration
- NGramTrainingDocument - Class in org.predict4all.nlp.trainer.step
- NGramTrainingDocument(File, File) - Constructor for class org.predict4all.nlp.trainer.step.NGramTrainingDocument
- NGramWordPredictorUtils - Class in org.predict4all.nlp.ngram
-
Utils class useful when predicting words with an ngram dictionaries.
- NGramWordPredictorUtils(WordDictionary, PredictionParameter) - Constructor for class org.predict4all.nlp.ngram.NGramWordPredictorUtils
- NODE - org.predict4all.nlp.words.correction.model.CorrectionRuleNode.CorrectionRuleNodeType
- NONE - org.predict4all.nlp.trainer.configuration.NGramPruningMethod
- NoOpProgressIndicator - Class in org.predict4all.nlp.utils.progressindicator
- NoOpProgressIndicator() - Constructor for class org.predict4all.nlp.utils.progressindicator.NoOpProgressIndicator
- normalizeRow(double[]) - Static method in class org.predict4all.nlp.semantic.SemanticDictionary
- NumberDecimalMatcher - Class in org.predict4all.nlp.language.french.matcher
- NumberDecimalMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.NumberDecimalMatcher
- NumberIntMatcher - Class in org.predict4all.nlp.language.french.matcher
- NumberIntMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.NumberIntMatcher
O
- of(int) - Static method in class org.predict4all.nlp.utils.BiIntegerKey
- of(int, int) - Static method in class org.predict4all.nlp.utils.BiIntegerKey
- of(K, T) - Static method in class org.predict4all.nlp.utils.Pair
- of(K, T, V) - Static method in class org.predict4all.nlp.utils.Triple
- oneInstanceCount - Static variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- open(File) - Static method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
-
Create a static ngram dictionary from a given file.
- OPEN_HOOK - org.predict4all.nlp.Separator
- openDictionary(File) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Open a dictionary from a file.
To use the dictionary, the sameWordDictionary
used to save it should be used. - openDictionary(File) - Method in class org.predict4all.nlp.ngram.dictionary.DynamicNGramDictionary
- openDictionary(File) - Method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- openDictionary(File) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- opposite() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- optional(String) - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- optional(Separator) - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- or(String...) - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- or(Separator...) - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- or(Separator, String) - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- ORDER_COUNT - org.predict4all.nlp.trainer.configuration.NGramPruningMethod
- org.predict4all.nlp - package org.predict4all.nlp
- org.predict4all.nlp.exception - package org.predict4all.nlp.exception
- org.predict4all.nlp.io - package org.predict4all.nlp.io
- org.predict4all.nlp.language - package org.predict4all.nlp.language
- org.predict4all.nlp.language.french - package org.predict4all.nlp.language.french
- org.predict4all.nlp.language.french.matcher - package org.predict4all.nlp.language.french.matcher
- org.predict4all.nlp.ngram - package org.predict4all.nlp.ngram
- org.predict4all.nlp.ngram.debug - package org.predict4all.nlp.ngram.debug
- org.predict4all.nlp.ngram.dictionary - package org.predict4all.nlp.ngram.dictionary
- org.predict4all.nlp.ngram.trie - package org.predict4all.nlp.ngram.trie
- org.predict4all.nlp.ngram.trie.map - package org.predict4all.nlp.ngram.trie.map
- org.predict4all.nlp.parser - package org.predict4all.nlp.parser
- org.predict4all.nlp.parser.matcher - package org.predict4all.nlp.parser.matcher
- org.predict4all.nlp.parser.token - package org.predict4all.nlp.parser.token
- org.predict4all.nlp.prediction - package org.predict4all.nlp.prediction
- org.predict4all.nlp.prediction.model - package org.predict4all.nlp.prediction.model
- org.predict4all.nlp.semantic - package org.predict4all.nlp.semantic
-
SEMANTIC IS STILL NOT STABLE AND WIDELY TESTED.
- org.predict4all.nlp.trainer - package org.predict4all.nlp.trainer
- org.predict4all.nlp.trainer.configuration - package org.predict4all.nlp.trainer.configuration
- org.predict4all.nlp.trainer.corpus - package org.predict4all.nlp.trainer.corpus
- org.predict4all.nlp.trainer.step - package org.predict4all.nlp.trainer.step
- org.predict4all.nlp.utils - package org.predict4all.nlp.utils
- org.predict4all.nlp.utils.progressindicator - package org.predict4all.nlp.utils.progressindicator
- org.predict4all.nlp.words - package org.predict4all.nlp.words
- org.predict4all.nlp.words.correction - package org.predict4all.nlp.words.correction
- org.predict4all.nlp.words.correction.model - package org.predict4all.nlp.words.correction.model
- org.predict4all.nlp.words.model - package org.predict4all.nlp.words.model
- org.predict4all.runtime - package org.predict4all.runtime
- OTHER - org.predict4all.nlp.Separator
- OUTPUT_EXTENSION - Static variable in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- outputFile - Variable in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
P
- Pair<K,T> - Class in org.predict4all.nlp.utils
- PARSER - org.predict4all.nlp.trainer.step.TrainingStep
- ParserTrainingDocument - Class in org.predict4all.nlp.trainer.step
- ParserTrainingDocument(String, File, File) - Constructor for class org.predict4all.nlp.trainer.step.ParserTrainingDocument
- PatternMatched - Class in org.predict4all.nlp.parser.matcher
- PatternMatched(String, Token) - Constructor for class org.predict4all.nlp.parser.matcher.PatternMatched
- PatternMatched(EquivalenceClass, String, Token) - Constructor for class org.predict4all.nlp.parser.matcher.PatternMatched
- PERCENT - org.predict4all.nlp.EquivalenceClass
- PERCENT - org.predict4all.nlp.Separator
- PERCENT_FORMAT - Static variable in class org.predict4all.nlp.trainer.DataTrainer
- PERCENT_FORMAT - Static variable in class org.predict4all.nlp.words.WordDictionaryGenerator
- PercentMatcher - Class in org.predict4all.nlp.language.french.matcher
- PercentMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.PercentMatcher
- PHONEM_CONFUSION_SET - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- PIPE - org.predict4all.nlp.Separator
- POINT - org.predict4all.nlp.Separator
- postInsertHook(boolean) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
After an insert, this hook is called to adjust the size/free values of the set and to perform rehashing if necessary.
- predict(String, String, int) - Method in class org.predict4all.nlp.prediction.WordPredictor
- predict(String, String, int, TIntHashSet) - Method in class org.predict4all.nlp.prediction.WordPredictor
- Predict4AllInfo - Class in org.predict4all.nlp
-
This retrieves information about the library (version and build date).
This should mostly be used to ensure consistency on saved data (i.e. save and load data from same versions) - Predict4AllInfo() - Constructor for class org.predict4all.nlp.Predict4AllInfo
- Predict4AllUtils - Class in org.predict4all.nlp.utils
-
Contains different utils methods that are used in NLP taks.
- prediction - Variable in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- PredictionParameter - Class in org.predict4all.nlp.prediction
- PredictionParameter(LanguageModel) - Constructor for class org.predict4all.nlp.prediction.PredictionParameter
- probFactor - Variable in class org.predict4all.nlp.words.model.SimpleWord
- progressIndicator - Variable in class org.predict4all.nlp.trainer.TrainerTask
- ProgressIndicator - Interface in org.predict4all.nlp.utils.progressindicator
- PROPER_NAME - org.predict4all.nlp.EquivalenceClass
- ProperNameMatcher - Class in org.predict4all.nlp.language.french.matcher
- ProperNameMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.ProperNameMatcher
- pruneNGramsCount(int, TrainingConfiguration) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- pruneNGramsOrderCount(int[], TrainingConfiguration) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- pruneNGramsWeightedDifference(double, TrainingConfiguration, NGramPruningMethod) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
-
Execute a pruning on the dictionary.
Pruning is implemented with a "weighted difference" algorithm : difference is computed between high order model and a lower order model (e.g. difference between 4-gram - 3gram, then 3-gram - 2-gram) and if the difference is bellow a certain level (threshold), the high order model is deleted.
Difference pruning is executed for max order to bigram level, probabilities are computed again after the pruning. - pruningCountingNGram(int, int, int) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
- put(int, V) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- putAll(Map<? extends Integer, ? extends V>) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- putAndIncrementBy(int[], int) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Add a given ngram to the dictionary and to increment its count.
If the ngram is already in the dictionary, will just increment its count.
This will callAbstractNGramDictionary.putAndIncrementBy(int[], int, int)
with a index = 0 - putAndIncrementBy(int[], int) - Method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- putAndIncrementBy(int[], int) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- putAndIncrementBy(int[], int, int) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Add a given ngram to the dictionary and to increment its count.
If the ngram is already in the dictionary, will just increment its count. - putAndIncrementBy(int[], int, int) - Method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- putAndIncrementBy(int[], int, int) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- putAndIncrementBy(int[], int, int) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
-
Put a ngram into this trie structure, create node if needed, and increment the existing one.
- putIfAbsent(int, V) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- putUserWord(String) - Method in class org.predict4all.nlp.words.WordDictionary
- putUserWord(Token) - Method in class org.predict4all.nlp.words.WordDictionary
- putWordTraining(String) - Method in class org.predict4all.nlp.words.WordDictionary
Q
R
- RAW_COUNT - org.predict4all.nlp.trainer.configuration.NGramPruningMethod
- RBRACKET - org.predict4all.nlp.Separator
- readAllChildren(FileChannel, int) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
-
Read all children from a given file channel, then load recursively all the children.
- readDictionaryInformation(ByteBuffer) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Read the general information for this dictionary from a given buffer (doesn't do any check)
- readFrom(String) - Static method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- readNodeInformation(ByteBuffer) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
-
Read this node information from a given buffer.
- readNodeInformation(ByteBuffer) - Method in class org.predict4all.nlp.ngram.trie.StaticNGramTrieNode
-
Read the node information contained into the given buffer to this node (without any check).
- readToken() - Method in class org.predict4all.nlp.io.TokenFileInputStream
- readWord() - Method in class org.predict4all.nlp.io.WordFileInputStream
- rehash(int) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- remove(int) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- REMOVE_LETTER - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- removeAt(int) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Delete the record at
index
. - REMOVED - Static variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
flag indicating that the value of a slot in the hashtable was deleted
- retainEntries(TIntObjectProcedure<? super V>) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- rootNode - Variable in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Root node of this dictionary (this node contains as children the whole vocabulary)
- ruleBuilder() - Static method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- run() - Method in class org.predict4all.nlp.trainer.TrainerTask
S
- saveDictionary(File) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Save this dictionary to a file.
Will save the dictionary relative with id only, this means that the same word dictionary should be loaded if this dictionary is opened later. - saveDictionary(File) - Method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- saveDictionary(File) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- saveTo(File) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- saveTo(File) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- saveUserDictionary(File) - Method in class org.predict4all.nlp.words.WordDictionary
- SEMANTIC_DICTIONARY - org.predict4all.nlp.trainer.step.TrainingStep
- SemanticDictionary - Class in org.predict4all.nlp.semantic
-
Represents a semantic dictionary to be used to predict next words.
WARNING : THIS IS A WIP - SemanticDictionaryConfiguration - Interface in org.predict4all.nlp.semantic
- SemanticDictionaryGenerator - Class in org.predict4all.nlp.semantic
-
To generate a
SemanticDictionary
from an input corpus.
This creates a term x term matrix and then reduces it with SVD (via an optimized R script, "Rscript" should be available in path). - SemanticDictionaryGenerator(LanguageModel, WordDictionary, TrainingConfiguration) - Constructor for class org.predict4all.nlp.semantic.SemanticDictionaryGenerator
- SemanticTrainingDocument - Class in org.predict4all.nlp.trainer.step
- SemanticTrainingDocument(File, File) - Constructor for class org.predict4all.nlp.trainer.step.SemanticTrainingDocument
- SEMICOLON - org.predict4all.nlp.Separator
- Separator - Enum in org.predict4all.nlp
-
Represent chars between words.
This is preferred to regex pattern because separator are fully controlled.
If you add any new separator, watch the last used id - SeparatorToken - Class in org.predict4all.nlp.parser.token
- SeparatorToken(Separator) - Constructor for class org.predict4all.nlp.parser.token.SeparatorToken
- SEQUENCES - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- setAddNewWordsEnabled(boolean) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setBaseWordDictionaryPath(String) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setConvertCaseFromDictionaryModelThreshold(double) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setCorrectionDefaultCost(double) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setCorrectionDefaultFactor(double) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setCorrectionDoubleWordLeftMinFrequency(double) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setCorrectionMaxCost(double) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setCorrectionRule(CorrectionRuleBuilder) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- setCorrectionRulesRoot(CorrectionRuleNode) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setCurrentPart(StringBuilder) - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- setDebugPrefix(String) - Method in class org.predict4all.nlp.ngram.NGramDictionaryGenerator
- setDebugPrefix(String) - Method in class org.predict4all.nlp.trainer.DataTrainer
- setDirectlyValidWordCountThreshold(int) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setDynamicModelEnabled(boolean) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setDynamicModelMinimumWeight(double) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setEnabled(boolean) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- setEnableDebugInformation(boolean) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setEnableDistanceWordCorrection(boolean) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setEnableWordCorrection(boolean) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setEnableWordPredictorDebug(boolean) - Method in class org.predict4all.nlp.prediction.WordPredictor
- setEndFactor(double) - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- setForceInvalid(boolean, boolean) - Method in class org.predict4all.nlp.words.model.AbstractWord
- setForceInvalid(boolean, boolean) - Method in class org.predict4all.nlp.words.model.SimpleWord
- setForceInvalid(boolean, boolean) - Method in interface org.predict4all.nlp.words.model.Word
- setForceValid(boolean, boolean) - Method in class org.predict4all.nlp.words.model.AbstractWord
- setForceValid(boolean, boolean) - Method in class org.predict4all.nlp.words.model.SimpleWord
- setForceValid(boolean, boolean) - Method in interface org.predict4all.nlp.words.model.Word
- setLanguageModel(LanguageModel) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setLsaDensitySize(int) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setLsaFrequentWordSize(int) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setLsaTargetSvdSize(int) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setLsaVocabularySize(int) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setLsaWindowSize(int) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setMinCountToProvideCorrection(int) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setMinCountToProvidePrediction(int) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setMinUseCountToValidateNewWord(int) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setModifiedBySystem(boolean) - Method in class org.predict4all.nlp.words.model.AbstractWord
- setModifiedBySystem(boolean) - Method in interface org.predict4all.nlp.words.model.Word
- setModifiedByUser(boolean) - Method in class org.predict4all.nlp.words.model.AbstractWord
- setModifiedByUser(boolean) - Method in class org.predict4all.nlp.words.model.SimpleWord
- setModifiedByUser(boolean) - Method in interface org.predict4all.nlp.words.model.Word
- setName(String) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- setNgramDebugAfterPruning(NGramDebugger) - Method in class org.predict4all.nlp.ngram.NGramDictionaryGenerator
- setNgramDebugAfterPruning(NGramDebugger) - Method in class org.predict4all.nlp.trainer.DataTrainer
- setNgramDebugBeforePruning(NGramDebugger) - Method in class org.predict4all.nlp.ngram.NGramDictionaryGenerator
- setNgramDebugBeforePruning(NGramDebugger) - Method in class org.predict4all.nlp.trainer.DataTrainer
- setNgramMaxLastTextLength(int) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setNgramMaxLastTokenCountFactor(int) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setNgramOrder(int) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setNgramPruningCountThreshold(int) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setNgramPruningOrderCountThresholds(int[]) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setNgramPruningWeightedDifferenceThreshold(double) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setPrefixSearchMaxTokenCount(int) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setProbFactor(double, boolean) - Method in class org.predict4all.nlp.words.model.AbstractWord
- setProbFactor(double, boolean) - Method in class org.predict4all.nlp.words.model.SimpleWord
- setProbFactor(double, boolean) - Method in interface org.predict4all.nlp.words.model.Word
- setPruningMethod(NGramPruningMethod) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setScore(double) - Method in class org.predict4all.nlp.prediction.model.AbstractPredictionToCompute
- setSemanticContrastFactor(double) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setSemanticDensityMaxBound(double) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setSemanticDensityMinBound(double) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setSemanticInputContextSize(int) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setSemanticModelEnabled(boolean) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setSmoothingDiscountValue(double) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setSmoothingDiscountValueLowerBound(double) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setSmoothingDiscountValueUpperBound(double) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setStopWordDictionaryPath(String) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setTranslationProvider(DefaultCorrectionRuleGenerator.TranslationProvider) - Static method in class org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator
- setTwoWordPredictionEnabled(boolean) - Method in class org.predict4all.nlp.prediction.PredictionParameter
- setUnknownWordCountThreshold(int) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- setUp(int) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
initializes the hashtable to a prime capacity which is at least
initialCapacity + 1
. - setUpperCaseReplacementThreshold(double) - Method in class org.predict4all.nlp.trainer.configuration.TrainingConfiguration
- SimpleGeneratingCorrection - Class in org.predict4all.nlp.words.correction.model
- SimpleGeneratingCorrection(String, boolean) - Constructor for class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- SimpleWord - Class in org.predict4all.nlp.words.model
- SimpleWord(int, String) - Constructor for class org.predict4all.nlp.words.model.SimpleWord
- SimpleWord(int, String, boolean, boolean, double, boolean, boolean) - Constructor for class org.predict4all.nlp.words.model.SimpleWord
- SingleThreadDoubleAdder - Class in org.predict4all.nlp.utils
-
Similar to
DoubleAdder
but for a single threaded usage.
Just a simple double reference without any overhead. - SingleThreadDoubleAdder(double) - Constructor for class org.predict4all.nlp.utils.SingleThreadDoubleAdder
- size() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Returns the number of distinct elements in this collection.
- size() - Method in interface org.predict4all.nlp.parser.StringProducer
- size() - Method in class org.predict4all.nlp.utils.FifoSet
- size() - Method in class org.predict4all.nlp.words.WordDictionary
- SLASH - org.predict4all.nlp.Separator
- SPACE - org.predict4all.nlp.Separator
- SpecialWordMatcher - Class in org.predict4all.nlp.language.french.matcher
- SpecialWordMatcher() - Constructor for class org.predict4all.nlp.language.french.matcher.SpecialWordMatcher
- start() - Static method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- START - org.predict4all.nlp.Tag
- startsWith(String) - Method in class org.predict4all.nlp.words.model.AbstractWord
- startsWith(String) - Method in interface org.predict4all.nlp.words.model.Word
- STATIC_TRIE_NODE_SIZE_BYTE - Static variable in class org.predict4all.nlp.ngram.trie.AbstractNGramTrieNode
-
Static node byte size (3 integer, 2 double).
Integer : word id, children size, children position.
Double : frequency, backoff weight. - StaticNGramTrieDictionary - Class in org.predict4all.nlp.ngram.dictionary
-
Represent a static ngram dictionary where trie node are loaded "on demand" while browsing through the nodes.
This dictionary is read only and cannot be updated or saved : methods likeStaticNGramTrieDictionary.updateProbabilities(double[])
,StaticNGramTrieDictionary.putAndIncrementBy(int[], int)
are not supported by this dictionary. - StaticNGramTrieDictionary() - Constructor for class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- StaticNGramTrieNode - Class in org.predict4all.nlp.ngram.trie
-
Represent a static ngram trie node : when node are used only to retrieve information and compute probabilities, but children are never updated.
This node is particular because children node are loaded on demand from aFileChannel
.
This node is produced in a read only version : to create this node,DynamicNGramTrieNode
andTrainingNGramDictionary
should be used. - StaticNGramTrieNode() - Constructor for class org.predict4all.nlp.ngram.trie.StaticNGramTrieNode
- StopWordDictionary - Interface in org.predict4all.nlp.language
-
A language specific dictionary : contains every stop words for a language
- strEquals(String, String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- strEqualsIgnoreCase(String, String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- StringProducer - Interface in org.predict4all.nlp.parser
- strSplit(String, String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- substringInCurrentPart(int, int) - Method in interface org.predict4all.nlp.words.correction.model.GeneratingCorrectionI
- substringInCurrentPart(int, int) - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- sum() - Method in class org.predict4all.nlp.utils.SingleThreadDoubleAdder
T
- TAB - org.predict4all.nlp.Separator
- Tag - Enum in org.predict4all.nlp
-
Represent a specific value in a corpus.
Useful to tag specific part of the corpus without any semantic information.
START : represent a sentence start UNKNOWN : represent a word/expression out of vocabulary - TagToken - Class in org.predict4all.nlp.parser.token
- TagWord - Class in org.predict4all.nlp.words.model
- TagWord(Tag) - Constructor for class org.predict4all.nlp.words.model.TagWord
- TermMatcherUtils - Class in org.predict4all.nlp.parser.matcher
- testAll() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- then(String) - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- then(Separator) - Method in class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- Token - Interface in org.predict4all.nlp.parser.token
-
Represent the lowest unit when parsing a text.
- TOKEN_CONVERT - org.predict4all.nlp.trainer.step.TrainingStep
- TOKEN_COUNT_FORMAT - Static variable in class org.predict4all.nlp.parser.Tokenizer
- TokenAppender - Interface in org.predict4all.nlp.parser
- TokenConverter - Class in org.predict4all.nlp.parser.matcher
-
This token converter will convert input token list to another token list, with matched
TokenMatcher
pattern. - TokenConverter(TokenMatcher[]) - Constructor for class org.predict4all.nlp.parser.matcher.TokenConverter
- TokenConverterTrainingDocument - Class in org.predict4all.nlp.trainer.step
- TokenConverterTrainingDocument(File, File) - Constructor for class org.predict4all.nlp.trainer.step.TokenConverterTrainingDocument
- TokenFileInputStream - Class in org.predict4all.nlp.io
- TokenFileInputStream(File) - Constructor for class org.predict4all.nlp.io.TokenFileInputStream
- TokenFileOutputStream - Class in org.predict4all.nlp.io
- TokenFileOutputStream(File) - Constructor for class org.predict4all.nlp.io.TokenFileOutputStream
- tokenize(String) - Method in class org.predict4all.nlp.parser.Tokenizer
- tokenize(TrainingCorpus) - Method in class org.predict4all.nlp.parser.Tokenizer
- Tokenizer - Class in org.predict4all.nlp.parser
-
This takes a raw text and to create tokens from it.
- Tokenizer(LanguageModel) - Constructor for class org.predict4all.nlp.parser.Tokenizer
- TokenListAppender - Class in org.predict4all.nlp.parser
- TokenListAppender(List<Token>) - Constructor for class org.predict4all.nlp.parser.TokenListAppender
- TokenListProvider - Class in org.predict4all.nlp.parser
- TokenListProvider(Collection<Token>) - Constructor for class org.predict4all.nlp.parser.TokenListProvider
- TokenMatcher - Interface in org.predict4all.nlp.parser.matcher
-
Represent a matcher that will try to detect if a given token match a specific pattern.
If so, thePatternMatched
contains the the normalized representation of the matched tokens and eventually anEquivalenceClass
. - TokenProvider - Interface in org.predict4all.nlp.parser
- TokenRegexMatcher - Class in org.predict4all.nlp.parser.matcher
- TokenRegexMatcher.TokenRegexMatcherBuilder - Class in org.predict4all.nlp.parser.matcher
- TokenRegexMatcherBuilder() - Constructor for class org.predict4all.nlp.parser.matcher.TokenRegexMatcher.TokenRegexMatcherBuilder
- TokenRegexResult - Class in org.predict4all.nlp.parser.matcher
- TokenRegexResult(Token, List<String>) - Constructor for class org.predict4all.nlp.parser.matcher.TokenRegexResult
- toPrimitive(Integer[]) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- toString() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- toString() - Method in class org.predict4all.nlp.prediction.WordPrediction
- toString() - Method in class org.predict4all.nlp.utils.Pair
- toString() - Method in class org.predict4all.nlp.utils.Triple
- toString() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- toString() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- toString() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleNode
- toString() - Method in class org.predict4all.nlp.words.correction.model.GeneratingCorrection
- toString() - Method in class org.predict4all.nlp.words.correction.model.SimpleGeneratingCorrection
- toString() - Method in class org.predict4all.nlp.words.model.AbstractWord
- toString() - Method in class org.predict4all.nlp.words.model.UserWord
- toString() - Method in class org.predict4all.nlp.words.NextWord
- toString() - Method in class org.predict4all.nlp.words.WordPrefixDetected
- trainDynamicModel(String, boolean) - Method in class org.predict4all.nlp.prediction.WordPredictor
- TrainerTask - Class in org.predict4all.nlp.trainer
- TrainerTask(ProgressIndicator, AbstractTrainingDocument) - Constructor for class org.predict4all.nlp.trainer.TrainerTask
- TrainingConfiguration - Class in org.predict4all.nlp.trainer.configuration
- TrainingCorpus - Class in org.predict4all.nlp.trainer.corpus
- TrainingCorpus(int, File, File, String) - Constructor for class org.predict4all.nlp.trainer.corpus.TrainingCorpus
- TrainingNGramDictionary - Class in org.predict4all.nlp.ngram.dictionary
-
Represent a training dictionary : a ngram dictionary used while training an ngram model.
This dictionary is useful because it supports dynamic insertion and probabilities computing... - TrainingNGramDictionary(int) - Constructor for class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- TrainingNGramDictionary(DynamicNGramTrieNode, int) - Constructor for class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- TrainingStep - Enum in org.predict4all.nlp.trainer.step
-
Represent the possible training steps.
This allow training to be stopped and started again at a specific step : going to converted tokens, and then running WORDS_DICTIONARY multiple times. - transformValues(TObjectFunction<V, V>) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- transitive(CorrectionRule, CorrectionRule) - Static method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- transitivePossible(CorrectionRule) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRule
- translate(String, Object...) - Method in interface org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.TranslationProvider
- TrieNodeMap<V> - Class in org.predict4all.nlp.ngram.trie.map
-
Custom implementation copied from
TIntObjectHashMap
but with less attribute to reduce the heap size in Trie.
Source is copied from class hierarchy (with manually merging methods):THash
TPrimitiveHash
TIntHash
TIntObjectHashMap
The implementation is modified to keep the minimum attribute count on this Map because this TrieNodeMap will be created a lot of time ! - TrieNodeMap() - Constructor for class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
Creates a new
THash
instance with a prime capacity at or near the minimum needed to holdinitialCapacity
elements with load factorloadFactor
without triggering a rehash. - TrieNodeMapConstant - Class in org.predict4all.nlp.ngram.trie.map
- TrieNodeMapConstant() - Constructor for class org.predict4all.nlp.ngram.trie.map.TrieNodeMapConstant
- trimToSize() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
This simply calls
compact
. - Triple<K,T,V> - Class in org.predict4all.nlp.utils
- TWO_DIGIT_FORMAT_ALWAYS - Static variable in class org.predict4all.nlp.language.french.FrenchLanguageUtils
- TWO_DIGIT_FORMAT_SOMETIMES - Static variable in class org.predict4all.nlp.language.french.FrenchLanguageUtils
- TWOPOINT - org.predict4all.nlp.Separator
- TYPE_EQUIVALENCE_CLASS - Static variable in interface org.predict4all.nlp.parser.token.Token
- TYPE_EQUIVALENCE_CLASS - Static variable in interface org.predict4all.nlp.words.model.Word
- TYPE_NGRAM_TAG - Static variable in interface org.predict4all.nlp.words.model.Word
- TYPE_SEPARATOR - Static variable in interface org.predict4all.nlp.parser.token.Token
- TYPE_SIMPLE - Static variable in interface org.predict4all.nlp.words.model.Word
- TYPE_TAG - Static variable in interface org.predict4all.nlp.parser.token.Token
- TYPE_USER_WORD - Static variable in interface org.predict4all.nlp.words.model.Word
- TYPE_WORD - Static variable in interface org.predict4all.nlp.parser.token.Token
U
- uncapitalize(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- UniquePredictionToCompute - Class in org.predict4all.nlp.prediction.model
- UniquePredictionToCompute(int, double, boolean, StringBuilder) - Constructor for class org.predict4all.nlp.prediction.model.UniquePredictionToCompute
- UNKNOWN - org.predict4all.nlp.Tag
- updateProbabilities(double[]) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Update the whole probabilities in this dictionary.
Can take a while if there is a lot of nodes in the dictionary. - updateProbabilities(double[]) - Method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- updateProbabilities(double[]) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- updateProbabilities(int[], int, double[]) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Update probabilities in this dictionary for a specific ngram prefix : this will update the probabilities of the prefix children, and update the backoff weight of the parent node.
This is much more optimized thanAbstractNGramDictionary.updateProbabilities(double[])
- updateProbabilities(int[], int, double[]) - Method in class org.predict4all.nlp.ngram.dictionary.StaticNGramTrieDictionary
- updateProbabilities(int[], int, double[]) - Method in class org.predict4all.nlp.ngram.dictionary.TrainingNGramDictionary
- upperCase(String) - Static method in class org.predict4all.nlp.utils.Predict4AllUtils
- UserWord - Class in org.predict4all.nlp.words.model
- UserWord(int, String) - Constructor for class org.predict4all.nlp.words.model.UserWord
- UserWord(int, String, double, boolean, boolean, long, int) - Constructor for class org.predict4all.nlp.words.model.UserWord
V
- valueOf(String) - Static method in enum org.predict4all.nlp.EquivalenceClass
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.predict4all.nlp.Separator
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.predict4all.nlp.Tag
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.predict4all.nlp.trainer.configuration.NGramPruningMethod
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.predict4all.nlp.trainer.step.TrainingStep
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.predict4all.nlp.words.correction.model.CorrectionRuleNode.CorrectionRuleNodeType
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum org.predict4all.nlp.EquivalenceClass
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- values() - Static method in enum org.predict4all.nlp.Separator
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.predict4all.nlp.Tag
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.predict4all.nlp.trainer.configuration.NGramPruningMethod
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.predict4all.nlp.trainer.step.TrainingStep
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.predict4all.nlp.words.correction.model.CorrectionRuleNode.CorrectionRuleNodeType
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values(V[]) - Method in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
- VERSION - Static variable in class org.predict4all.nlp.Predict4AllInfo
- VISUAL_CONFUSION - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
W
- WEEK_DAYS - Static variable in class org.predict4all.nlp.language.french.FrenchLanguageUtils
- WEIGHTED_DIFFERENCE_FULL_PROB - org.predict4all.nlp.trainer.configuration.NGramPruningMethod
- WEIGHTED_DIFFERENCE_RAW_PROB - org.predict4all.nlp.trainer.configuration.NGramPruningMethod
- withBidirectionnal(boolean) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- withConfusionSet(String...) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- withCost(double) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- withError(String...) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- withFactor(double) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- withMaxIndexFromEnd(int) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- withMaxIndexFromStart(int) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- withMinIndexFromEnd(int) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- withMinIndexFromStart(int) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- withNgramCounts(Map<Integer, Pair<Integer, Integer>>) - Method in class org.predict4all.nlp.trainer.DataTrainerResult.Builder
- withReplacement(String...) - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- word - Variable in class org.predict4all.nlp.words.model.SimpleWord
- Word - Interface in org.predict4all.nlp.words.model
-
Represent a word stored in a
WordDictionary
- word are stored with a int ID to optimize memory usage. - WORD_ENDINGS - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- WORD_SPACE_APOSTROPHE - org.predict4all.nlp.words.correction.DefaultCorrectionRuleGenerator.CorrectionRuleType
- WordCorrectionGenerator - Class in org.predict4all.nlp.words.correction
-
Idées inversion à distance de 2 = "renuméré" Gestion des inversions
- WordCorrectionGenerator(WordDictionary, AbstractNGramDictionary<? extends AbstractNGramTrieNode<?>>, PredictionParameter) - Constructor for class org.predict4all.nlp.words.correction.WordCorrectionGenerator
- WordDictionary - Class in org.predict4all.nlp.words
-
Represent a word dictionary.
This dictionary identify each sequence of chars as an unique "word" and keep information for this word (frequency, etc...) - WordDictionary(LanguageModel, String) - Constructor for class org.predict4all.nlp.words.WordDictionary
- WordDictionaryGenerator - Class in org.predict4all.nlp.words
-
This will generate a word dictionary from a
TrainingCorpus
: this will detect different word in training corpus and try to filter out words : match lower/upper case words, filter on aBaseWordDictionary
, exclude low count words, etc. - WordDictionaryGenerator(LanguageModel, TrainingConfiguration) - Constructor for class org.predict4all.nlp.words.WordDictionaryGenerator
- WordDictionaryMatchingException - Exception in org.predict4all.nlp.exception
-
This exception is mainly thrown if an user dictionary is loaded but is was saved from a previous dictionary.
- WordDictionaryMatchingException() - Constructor for exception org.predict4all.nlp.exception.WordDictionaryMatchingException
- WordDictionaryTrainingDocument - Class in org.predict4all.nlp.trainer.step
- WordDictionaryTrainingDocument(File, File) - Constructor for class org.predict4all.nlp.trainer.step.WordDictionaryTrainingDocument
- WordFileInputStream - Class in org.predict4all.nlp.io
- WordFileInputStream(File) - Constructor for class org.predict4all.nlp.io.WordFileInputStream
- WordFileOutputStream - Class in org.predict4all.nlp.io
- WordFileOutputStream(File) - Constructor for class org.predict4all.nlp.io.WordFileOutputStream
- WordPrediction - Class in org.predict4all.nlp.prediction
- WordPrediction(String, String, boolean, double, int, boolean, int, String) - Constructor for class org.predict4all.nlp.prediction.WordPrediction
- WordPredictionResult - Class in org.predict4all.nlp.prediction
- WordPredictionResult(String, int, List<WordPrediction>) - Constructor for class org.predict4all.nlp.prediction.WordPredictionResult
- WordPredictor - Class in org.predict4all.nlp.prediction
- WordPredictor(PredictionParameter, WordDictionary, AbstractNGramDictionary<? extends AbstractNGramTrieNode<?>>) - Constructor for class org.predict4all.nlp.prediction.WordPredictor
- WordPredictor(PredictionParameter, WordDictionary, AbstractNGramDictionary<? extends AbstractNGramTrieNode<?>>, AbstractNGramDictionary<? extends AbstractNGramTrieNode<?>>, SemanticDictionary) - Constructor for class org.predict4all.nlp.prediction.WordPredictor
- WordPrefixDetected - Class in org.predict4all.nlp.words
-
Contains information about a started word (found in dictionary)
- WordPrefixDetected(String, int, Map<BiIntegerKey, NextWord>, boolean) - Constructor for class org.predict4all.nlp.words.WordPrefixDetected
- WordPrefixDetector - Class in org.predict4all.nlp.words
-
Useful to detect if a existing word is started in a token list.
It's important to detect if a word is already started when predicting next word, because the prediction result should always takes care of giving prediction result that starts like the already started word.
Because word are allowed to have word separator inside (hyphen, etc...), started word detection is much more complicated that just checking if the token list ends with a token separator. - WordPrefixDetector(WordDictionary, WordCorrectionGenerator, PredictionParameter) - Constructor for class org.predict4all.nlp.words.WordPrefixDetector
- WORDS_DICTIONARY - org.predict4all.nlp.trainer.step.TrainingStep
- WordToken - Class in org.predict4all.nlp.parser.token
- WordToken(String) - Constructor for class org.predict4all.nlp.parser.token.WordToken
- write() - Method in class org.predict4all.nlp.words.correction.model.CorrectionRuleBuilder
- writeDictionaryInfo(ByteBuffer) - Method in class org.predict4all.nlp.ngram.dictionary.AbstractNGramDictionary
-
Write the general information for this dictionary to a given buffer
- writeInformations(int) - Method in class org.predict4all.nlp.trainer.corpus.AbstractTrainingDocument
- writeLevelForDynamicUse(FileChannel, int, int, int) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
-
Write a trie level with
DynamicNGramTrieNode.writeLevel(FileChannel, int, int, int, BiConsumer)
withDynamicNGramTrieNode.writeDynamicNode(FileChannel, int)
as save method.
Result trie file should be read asDynamicNGramTrieNode
- writeLevelForStaticUse(FileChannel, int, int, int) - Method in class org.predict4all.nlp.ngram.trie.DynamicNGramTrieNode
-
Write a trie level with
DynamicNGramTrieNode.writeLevel(FileChannel, int, int, int, BiConsumer)
withDynamicNGramTrieNode.writeStaticNode(FileChannel, int)
as save method.
Result trie file should be read asStaticNGramTrieNode
- writeToken(Token) - Method in class org.predict4all.nlp.io.TokenFileOutputStream
- writeWord(Word) - Method in class org.predict4all.nlp.io.WordFileOutputStream
_
- _free - Variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
the current number of free slots in the hash.
- _maxSize - Variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
The maximum number of elements allowed without allocating more space.
- _set - Variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
the set of ints
- _size - Variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
the current number of occupied slots in the hash.
- _states - Variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
flags indicating whether each position in the hash is FREE, FULL, or REMOVED
- _values - Variable in class org.predict4all.nlp.ngram.trie.map.TrieNodeMap
-
the values of the map
All Classes All Packages