Annotator to match exact phrases (by token) provided in a file against a Document.
Instantiated model of the BigTextMatcher.
Instantiated model of the BigTextMatcher. For usage and examples see the documentation of the main class.
This is the companion object of BigTextMatcher.
This is the companion object of BigTextMatcher. Please refer to that class for the documentation.
This is the companion object of BigTextMatcherModel.
This is the companion object of BigTextMatcherModel. Please refer to that class for the documentation.
Annotator to match exact phrases (by token) provided in a file against a Document.
A text file of predefined phrases must be provided with
setStoragePath
. The text file can als be set directly as an ExternalResource.In contrast to the normal
TextMatcher
, theBigTextMatcher
is designed for large corpora.For extended examples of usage, see the BigTextMatcherTestSpec.
Example
In this example, the entities file is of the form
where each line represents an entity phrase to be extracted.