Package org.predict4all.nlp
-
Class Summary Class Description Predict4AllInfo This retrieves information about the library (version and build date).
This should mostly be used to ensure consistency on saved data (i.e. save and load data from same versions) -
Enum Summary Enum Description EquivalenceClass Represent a equivalence class type that can be used when training a language model.
Useful to group same kind of element in a corpus under a same concept instead of textual data.
3 These are especially used in semantic data.Separator Represent chars between words.
This is preferred to regex pattern because separator are fully controlled.
If you add any new separator, watch the last used idTag Represent a specific value in a corpus.
Useful to tag specific part of the corpus without any semantic information.
START : represent a sentence start UNKNOWN : represent a word/expression out of vocabulary