QueryEngine (mimir-core 6.2 API)

java.lang.Object
- gate.mimir.search.QueryEngine

```
public class QueryEngine
extends Object
```
This class represents the entry point to the Mimir search API.

Nested Class Summary

Nested Classes
Modifier and Type Class and Description

static class QueryEngine.IndexType
Represents the type of index that should be searched.

Nested Classes
Modifier and Type	Class and Description
`static class`	`QueryEngine.IndexType` Represents the type of index that should be searched.

Field Summary

Fields
Modifier and Type	Field and Description
`static int`	`DEFAULT_DOCUMENT_BLOCK_SIZE` The default value for the document block size.
`protected Executor`	`executor` The executor used to run tasks for query execution.
`protected MimirIndex`	`index` The index being searched.
`protected IndexConfig`	`indexConfig` The index configuration this index was built from.
`protected static org.slf4j.Logger`	`logger`
`static long`	`MAX_IN_MEMORY_INDEX` The maximum size of an index that can be loaded in memory (by default 64 MB).
`protected gate.LanguageAnalyser`	`queryTokeniser` The tokeniser (technically any GATE LA) used to split the text segments found in queries into individual tokens.
`protected Callable<MimirScorer>`	`scorerSource` A callable that produces new `MimirScorer` instances on request.
`protected boolean`	`subBindingsEnabled` Should sub-bindings be generated when searching?

Constructor Summary

Constructors
Constructor and Description

QueryEngine(MimirIndex index)
Constructs a new query engine for a MimirIndex.

Constructors
Constructor and Description
`QueryEngine(MimirIndex index)` Constructs a new query engine for a `MimirIndex`.

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`void`	`close()` Closes this `QueryEngine` and releases all resources.
`SemanticAnnotationHelper`	`getAnnotationHelper(AnnotationQuery query)` Get the `SemanticAnnotationHelper` corresponding to a query's annotation type.
`SemanticAnnotationHelper`	`getAnnotationHelper(String annotationType)`
`AtomicAnnotationIndex`	`getAnnotationIndex(String annotationType)` Returns the index that stores the data for a particular semantic annotation type.
`int`	`getDocumentBlockSize()` Gets the configuration parameter specifying the number of documents that get processed as a block.
`Serializable`	`getDocumentMetadataField(long docID, String fieldName)` Obtains an arbitrary document metadata field from the stored document data.
`String`	`getDocumentTitle(long docID)`
`String`	`getDocumentURI(long docID)`
`Executor`	`getExecutor()` Gets the executor used by this query engine.
`String[][]`	`getHitText(Binding hit)` Gets the text covered by a given binding.
`String[][]`	`getHitText(Binding hit, int leftContext, int rightContext)` Obtains the document text for a given search hit.
`MimirIndex`	`getIndex()` Gets the index this query engine is searching.
`IndexConfig`	`getIndexConfig()`
`String[][]`	`getLeftContext(Binding hit, int numTokens)` Get the text to the left of the given binding.
`QueryRunner`	`getQueryRunner(QueryNode query)` Obtains a query executor for a given `QueryNode`.
`QueryRunner`	`getQueryRunner(String query)` Obtains a query executor for a given query, expressed as a String.
`String[][]`	`getRightContext(Binding hit, int numTokens)` Get the text to the right of the given binding.
`Callable<MimirScorer>`	`getScorerSource()` Gets the current source of scorers.
`int`	`getSubIndexPosition(QueryEngine.IndexType indexType, String indexName)` Finds the location for a given sub-index in the arrays returned by `#getIndexes()` and `#getDirectIndexes()`.
`String[][]`	`getText(long documentID, int termPosition, int length)` Obtains the text for a specified region of a document.
`AtomicTokenIndex`	`getTokenIndex(String featureName)` Returns the index that stores the data for a particular feature of token annotations.
`boolean`	`isSubBindingsEnabled()` Are sub-bindings used in this query engine.
`void`	`releaseQueryRunner(QueryRunner qRunner)` Notifies the QueryEngine that the given QueryRunner has been closed.
`void`	`renderDocument(long docID, List<Binding> hits, Appendable output)` Renders a document and a list of hits.
`void`	`setDocumentBlockSize(int documentBlockSize)` Sets the configuration parameter specifying the number of documents that get processed in one go (e.g.
`void`	`setExecutor(Executor executor)` Sets the `Executor` used for executing tasks required for running queries.
`void`	`setQueryTokeniser(gate.LanguageAnalyser queryTokeniser)` Sets the tokeniser (technically any GATE analyser) used to split the text segments found in queries into individual tokens.
`void`	`setScorerSource(Callable<MimirScorer> scorerSource)` Provides a `Callable` that the Query Engine can use for obtaining new instances of `MimirScorer` to be used for ranking new queries.
`void`	`setSubBindingsEnabled(boolean subBindingsEnabled)`

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Field Detail
  - MAX_IN_MEMORY_INDEX
```
public static final long MAX_IN_MEMORY_INDEX
```
    The maximum size of an index that can be loaded in memory (by default 64 MB).
    
    See Also:
    
    Constant Field Values
  - DEFAULT_DOCUMENT_BLOCK_SIZE
```
public static final int DEFAULT_DOCUMENT_BLOCK_SIZE
```
    The default value for the document block size.
    
    See Also:
    
    setDocumentBlockSize(int), Constant Field Values
  - index
```
protected final MimirIndex index
```
    The index being searched.
  - indexConfig
```
protected IndexConfig indexConfig
```
    The index configuration this index was built from.
  - subBindingsEnabled
```
protected boolean subBindingsEnabled
```
    Should sub-bindings be generated when searching?
  - scorerSource
```
protected Callable<MimirScorer> scorerSource
```
    A callable that produces new MimirScorer instances on request.
  - logger
```
protected static final org.slf4j.Logger logger
```
  - queryTokeniser
```
protected gate.LanguageAnalyser queryTokeniser
```
    The tokeniser (technically any GATE LA) used to split the text segments found in queries into individual tokens. The same tokeniser used to create the indexed documents should be used here. If this value is not set, then a default ANNIE tokeniser will be used.
  - executor
```
protected Executor executor
```
    The executor used to run tasks for query execution. If the value is not set, then new threads are created as needed.
- Constructor Detail
  - QueryEngine
```
public QueryEngine(MimirIndex index)
```
    Constructs a new query engine for a MimirIndex.
    
    Parameters:
    
    index - the index to be searched.
- Method Detail
  - isSubBindingsEnabled
```
public boolean isSubBindingsEnabled()
```
    Are sub-bindings used in this query engine. Sub-bindings are used to associate sub-queries with segments of the returned hits. This can be useful for showing high-level details about the returned hits. By default, sub-bindings are not used.
    
    Returns:
    
    the subBindingsEnabled
  - setSubBindingsEnabled
```
public void setSubBindingsEnabled(boolean subBindingsEnabled)
```
    Parameters:
    
    subBindingsEnabled - the subBindingsEnabled to set
  - getDocumentBlockSize
```
public int getDocumentBlockSize()
```
    Gets the configuration parameter specifying the number of documents that get processed as a block. This is used to optimise the search process by limiting the number of results that get calculated by default.
    
    Returns:
  - setDocumentBlockSize
```
public void setDocumentBlockSize(int documentBlockSize)
```
    Sets the configuration parameter specifying the number of documents that get processed in one go (e.g. the number of documents that get ranked when enumerating results). This is used to optimise the search process by limiting the number of results that get calculated by default. Defaults to DEFAULT_DOCUMENT_BLOCK_SIZE.
    
    Parameters:
    
    documentBlockSize -
  - getScorerSource
```
public Callable<MimirScorer> getScorerSource()
```
    Gets the current source of scorers.
    
    Returns:
    
    See Also:
    
    setScorerSource(Callable)
  - setScorerSource
```
public void setScorerSource(Callable<MimirScorer> scorerSource)
```
    Provides a Callable that the Query Engine can use for obtaining new instances of MimirScorer to be used for ranking new queries.
    
    Parameters:
    
    scorerSource -
  - getExecutor
```
public Executor getExecutor()
```
    Gets the executor used by this query engine.
    
    Returns:
    
    an executor that can be used for running tasks pertinent to this QueryEngine.
  - setExecutor
```
public void setExecutor(Executor executor)
```
    Sets the Executor used for executing tasks required for running queries. This allows the use of some type thread pooling, is needed. If this value is not set, then new threads are created as required.
    
    Parameters:
    
    executor -
  - setQueryTokeniser
```
public void setQueryTokeniser(gate.LanguageAnalyser queryTokeniser)
```
    Sets the tokeniser (technically any GATE analyser) used to split the text segments found in queries into individual tokens. The same tokeniser used to create the indexed documents should be used here. If this value is not set, then a default ANNIE tokeniser will be used.
    
    Parameters:
    
    queryTokeniser - the new tokeniser to be used for parsing queries.
  - getSubIndexPosition
```
public int getSubIndexPosition(QueryEngine.IndexType indexType,
                               String indexName)
```
    Finds the location for a given sub-index in the arrays returned by #getIndexes() and #getDirectIndexes().
    
    Parameters:
    
    indexType - the IndexType of the requested sub-index (tokens or annotations).
    
    indexName - the "name" of the requested sub-index (the indexed feature name for QueryEngine.IndexType.TOKENS indexes, or the annotation type in the case of QueryEngine.IndexType.ANNOTATIONS indexes).
    
    Returns:
    
    the position in the indexes array for the requested index, or -1 if the requested index does not exist.
  - getTokenIndex
```
public AtomicTokenIndex getTokenIndex(String featureName)
```
    Returns the index that stores the data for a particular feature of token annotations.
    
    Parameters:
    
    featureName -
    
    Returns:
  - getAnnotationIndex
```
public AtomicAnnotationIndex getAnnotationIndex(String annotationType)
```
    Returns the index that stores the data for a particular semantic annotation type.
    
    Parameters:
    
    annotationType -
    
    Returns:
  - getAnnotationHelper
```
public SemanticAnnotationHelper getAnnotationHelper(String annotationType)
```
  - getIndex
```
public MimirIndex getIndex()
```
    Gets the index this query engine is searching.
    
    Returns:
  - getIndexConfig
```
public IndexConfig getIndexConfig()
```
    Returns:
    
    the index configuration for this index
  - getAnnotationHelper
```
public SemanticAnnotationHelper getAnnotationHelper(AnnotationQuery query)
```
    Get the SemanticAnnotationHelper corresponding to a query's annotation type.
    
    Throws:
    
    IllegalArgumentException - if the annotation helper for this type cannot be found.
  - getQueryRunner
```
public QueryRunner getQueryRunner(QueryNode query)
                           throws IOException
```
    Obtains a query executor for a given QueryNode.
    
    Parameters:
    
    query - the query to be executed.
    
    Returns:
    
    a QueryExecutor for the provided query, running over the indexes in this query engine.
    
    Throws:
    
    IOException - if the index files cannot be accessed.
  - releaseQueryRunner
```
public void releaseQueryRunner(QueryRunner qRunner)
```
    Notifies the QueryEngine that the given QueryRunner has been closed.
    
    Parameters:
    
    qRunner -
  - getQueryRunner
```
public QueryRunner getQueryRunner(String query)
                           throws IOException,
                                  ParseException
```
    Obtains a query executor for a given query, expressed as a String.
    
    Parameters:
    
    query - the query to be executed.
    
    Returns:
    
    a QueryExecutor for the provided query, running over the indexes in this query engine.
    
    Throws:
    
    IOException - if the index files cannot be accessed.
    
    ParseException - if the string provided for the query cannot be parsed.
  - getHitText
```
public String[][] getHitText(Binding hit,
                             int leftContext,
                             int rightContext)
                      throws IndexException
```
    Obtains the document text for a given search hit.
    
    Parameters:
    
    hit - the search hit for which the text is sought.
    
    leftContext - the number of tokens to the left of the hit to be included in the result.
    
    rightContext - the number of tokens to the right of the hit to be included in the result.
    
    Returns:
    
    an array of arrays of Strings, representing the tokens and spaces at the location of the search hit. The first element of the array is an array of tokens, the second element contains the spaces.The first element of each array corresponds to the first token of the left context.
    
    Throws:
    
    IOException
    
    IndexException
  - getHitText
```
public String[][] getHitText(Binding hit)
                      throws IndexException
```
    Gets the text covered by a given binding.
    
    Parameters:
    
    hit - the binding.
    
    Returns:
    
    an array of two string arrays, the first representing the tokens covered by the binding and the second the spaces after each token.
    
    Throws:
    
    IOException
    
    IndexException
  - getLeftContext
```
public String[][] getLeftContext(Binding hit,
                                 int numTokens)
                          throws IndexException
```
    Get the text to the left of the given binding.
    
    Parameters:
    
    hit - the binding.
    
    numTokens - the maximum number of tokens of context to return. The actual number of tokens returned may be smaller than this if the hit starts within numTokens tokens of the start of the document.
    
    Returns:
    
    an array of two string arrays, the first representing the tokens before the binding and the second the spaces after each token.
    
    Throws:
    
    IOException
    
    IndexException
  - getRightContext
```
public String[][] getRightContext(Binding hit,
                                  int numTokens)
                           throws IndexException
```
    Get the text to the right of the given binding.
    
    Parameters:
    
    hit - the binding.
    
    numTokens - the maximum number of tokens of context to return. The actual number of tokens returned may be smaller than this if the hit ends within numTokens tokens of the end of the document.
    
    Returns:
    
    an array of two string arrays, the first representing the tokens after the binding and the second the spaces after each token.
    
    Throws:
    
    IOException
    
    IndexException
  - getText
```
public String[][] getText(long documentID,
                          int termPosition,
                          int length)
                   throws IndexException
```
    Obtains the text for a specified region of a document. The return value is a pair of parallel arrays, one of tokens and the other of the spaces between them. If length >= 0, the two parallel arrays will always be exactly length items long, but any token positions that do not exist in the document (i.e. before the start or beyond the end of the text) will be null. If length < 0 the arrays will be of sufficient length to hold all the tokens from termPosition to the end of the document, with no trailing nulls (there may be leading nulls if termPosition < 0).
    
    Parameters:
    
    documentID - the document ID
    
    termPosition - the position of the first term required
    
    length - the number of terms to return. May be negativem, in which case all terms from termPosition to the end of the document will be returned.
    
    Returns:
    
    an array of two string arrays. The first represents the tokens and the second represents the spaces between them
    
    Throws:
    
    IndexException
  - renderDocument
```
public void renderDocument(long docID,
                           List<Binding> hits,
                           Appendable output)
                    throws IOException,
                           IndexException
```
    Renders a document and a list of hits.
    
    Parameters:
    
    docID - the document to be rendered.
    
    hits - the list of hits to be rendered.
    
    output - the Appendable used to write the output.
    
    Throws:
    
    IOException - if the output cannot be written to.
    
    IndexException - if no document renderer is available.
  - getDocumentTitle
```
public String getDocumentTitle(long docID)
                        throws IndexException
```
    Throws:
    
    IndexException
  - getDocumentURI
```
public String getDocumentURI(long docID)
                      throws IndexException
```
    Throws:
    
    IndexException
  - getDocumentMetadataField
```
public Serializable getDocumentMetadataField(long docID,
                                             String fieldName)
                                      throws IndexException
```
    Obtains an arbitrary document metadata field from the stored document data. DocumentMetadataHelpers used at indexing time can add arbitrary Serializable values as metadata fields for the documents being indexed. This method is used at search time to retrieve those values.
    
    Parameters:
    
    docID - the ID of document for which the metadata is sought.
    
    fieldName - the name of the metadata filed to be obtained
    
    Returns:
    
    the de-serialised value stored at indexing time for the given field name and document.
    
    Throws:
    
    IndexException
  - close
```
public void close()
```
    Closes this QueryEngine and releases all resources.

Class QueryEngine

Nested Class Summary

Field Summary

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Field Detail

MAX_IN_MEMORY_INDEX

DEFAULT_DOCUMENT_BLOCK_SIZE

index

indexConfig

subBindingsEnabled

scorerSource

logger

queryTokeniser

executor

Constructor Detail

QueryEngine

Method Detail

isSubBindingsEnabled

setSubBindingsEnabled

getDocumentBlockSize

setDocumentBlockSize

getScorerSource

setScorerSource

getExecutor

setExecutor

setQueryTokeniser

getSubIndexPosition

getTokenIndex

getAnnotationIndex

getAnnotationHelper

getIndex

getIndexConfig

getAnnotationHelper

getQueryRunner

releaseQueryRunner

getQueryRunner

getHitText

getHitText

getLeftContext

getRightContext

getText

renderDocument

getDocumentTitle

getDocumentURI

getDocumentMetadataField

close