co.elastic.clients.elasticsearch._types.aggregations.SignificantTextAggregation

All Implemented Interfaces:: AggregationVariant, JsonpSerializable

@JsonpDeserializable public class SignificantTextAggregation extends BucketAggregationBase implements AggregationVariant

See Also:

API specification

Nested Class Summary

Nested Classes

Modifier and Type

Class

Description

static class

SignificantTextAggregation.Builder

Builder for SignificantTextAggregation.

Nested classes/interfaces inherited from class co.elastic.clients.elasticsearch._types.aggregations.BucketAggregationBase
BucketAggregationBase.AbstractBuilder<BuilderT extends BucketAggregationBase.AbstractBuilder<BuilderT>>
Field Summary

Fields

Modifier and Type

Field

Description

static final JsonpDeserializer<SignificantTextAggregation>

_DESERIALIZER

Json deserializer for SignificantTextAggregation
Method Summary

Modifier and Type

Method

Description

Aggregation.Kind

_aggregationKind()

Aggregation variant kind.

final Query

backgroundFilter()

A background filter that can be used to focus in on significant terms within a narrower context, instead of the entire index.

final ChiSquareHeuristic

chiSquare()

Use Chi square, as described in "Information Retrieval", Manning et al., Chapter 13.5.2, as the significance score.

final TermsExclude

exclude()

Values to exclude.

final TermsAggregationExecutionHint

executionHint()

Determines whether the aggregation will use field values directly or global ordinals.

final String

field()

The field from which to return significant text.

final Boolean

filterDuplicateText()

Whether to out duplicate text to deal with noisy data.

final GoogleNormalizedDistanceHeuristic

gnd()

Use Google normalized distance as described in "The Google Similarity Distance", Cilibrasi and Vitanyi, 2007, as the significance score.

final TermsInclude

include()

Values to include.

final EmptyObject

jlh()

Use JLH score as the significance score.

final Long

minDocCount()

Only return values that are found in more than min_doc_count hits.

final MutualInformationHeuristic

mutualInformation()

Use mutual information as described in "Information Retrieval", Manning et al., Chapter 13.5.1, as the significance score.

static SignificantTextAggregation

of(Function<SignificantTextAggregation.Builder,ObjectBuilder<SignificantTextAggregation>> fn)

final PercentageScoreHeuristic

percentage()

A simple calculation of the number of documents in the foreground sample with a term divided by the number of documents in the background with the term.

final ScriptedHeuristic

scriptHeuristic()

Customized score, implemented via a script.

protected void

serializeInternal(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper)

protected static void

setupSignificantTextAggregationDeserializer(ObjectDeserializer<SignificantTextAggregation.Builder> op)

final Long

shardMinDocCount()

Regulates the certainty a shard has if the values should actually be added to the candidate list or not with respect to the min_doc_count.

final Integer

shardSize()

The number of candidate terms produced by each shard.

final Integer

size()

The number of buckets returned out of the overall terms list.

final List<String>

sourceFields()

Overrides the JSON _source fields from which text will be analyzed.

Methods inherited from class co.elastic.clients.elasticsearch._types.aggregations.BucketAggregationBase
setupBucketAggregationBaseDeserializer

Methods inherited from class co.elastic.clients.elasticsearch._types.aggregations.AggregationBase
meta, name, serialize, setupAggregationBaseDeserializer, toString

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait

Methods inherited from interface co.elastic.clients.elasticsearch._types.aggregations.AggregationVariant
_toAggregation

Field Details
- _DESERIALIZER
  
  public static final JsonpDeserializer<SignificantTextAggregation> _DESERIALIZER
  
  Json deserializer for SignificantTextAggregation
Method Details
- of
  
  public static SignificantTextAggregation of(Function<SignificantTextAggregation.Builder,ObjectBuilder<SignificantTextAggregation>> fn)
- _aggregationKind
  
  public Aggregation.Kind _aggregationKind()
  
  Aggregation variant kind.
  
  Specified by:
  
  _aggregationKind in interface AggregationVariant
- backgroundFilter
  
  @Nullable public final Query backgroundFilter()
  
  A background filter that can be used to focus in on significant terms within a narrower context, instead of the entire index.
  API name: background_filter
- chiSquare
  
  @Nullable public final ChiSquareHeuristic chiSquare()
  
  Use Chi square, as described in "Information Retrieval", Manning et al., Chapter 13.5.2, as the significance score.
  API name: chi_square
- exclude
  
  @Nullable public final TermsExclude exclude()
  
  Values to exclude.
  API name: exclude
- executionHint
  
  @Nullable public final TermsAggregationExecutionHint executionHint()
  
  Determines whether the aggregation will use field values directly or global ordinals.
  API name: execution_hint
- field
  
  @Nullable public final String field()
  
  The field from which to return significant text.
  API name: field
- filterDuplicateText
  
  @Nullable public final Boolean filterDuplicateText()
  
  Whether to out duplicate text to deal with noisy data.
  API name: filter_duplicate_text
- gnd
  
  @Nullable public final GoogleNormalizedDistanceHeuristic gnd()
  
  Use Google normalized distance as described in "The Google Similarity Distance", Cilibrasi and Vitanyi, 2007, as the significance score.
  API name: gnd
- include
  
  @Nullable public final TermsInclude include()
  
  Values to include.
  API name: include
- jlh
  
  @Nullable public final EmptyObject jlh()
  
  Use JLH score as the significance score.
  API name: jlh
- minDocCount
  
  @Nullable public final Long minDocCount()
  
  Only return values that are found in more than min_doc_count hits.
  API name: min_doc_count
- mutualInformation
  
  @Nullable public final MutualInformationHeuristic mutualInformation()
  
  Use mutual information as described in "Information Retrieval", Manning et al., Chapter 13.5.1, as the significance score.
  API name: mutual_information
- percentage
  
  @Nullable public final PercentageScoreHeuristic percentage()
  
  A simple calculation of the number of documents in the foreground sample with a term divided by the number of documents in the background with the term.
  API name: percentage
- scriptHeuristic
  
  @Nullable public final ScriptedHeuristic scriptHeuristic()
  
  Customized score, implemented via a script.
  API name: script_heuristic
- shardMinDocCount
  
  @Nullable public final Long shardMinDocCount()
  
  Regulates the certainty a shard has if the values should actually be added to the candidate list or not with respect to the min_doc_count. Values will only be considered if their local shard frequency within the set is higher than the shard_min_doc_count.
  API name: shard_min_doc_count
- shardSize
  
  @Nullable public final Integer shardSize()
  
  The number of candidate terms produced by each shard. By default, shard_size will be automatically estimated based on the number of shards and the size parameter.
  API name: shard_size
- size
  
  @Nullable public final Integer size()
  
  The number of buckets returned out of the overall terms list.
  API name: size
- sourceFields
  
  public final List<String> sourceFields()
  
  Overrides the JSON _source fields from which text will be analyzed.
  API name: source_fields
- serializeInternal
  
  protected void serializeInternal(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper)
  
  Overrides:
  
  serializeInternal in class AggregationBase
- setupSignificantTextAggregationDeserializer
  
  protected static void setupSignificantTextAggregationDeserializer(ObjectDeserializer<SignificantTextAggregation.Builder> op)

Class SignificantTextAggregation

Nested Class Summary

Nested classes/interfaces inherited from class co.elastic.clients.elasticsearch._types.aggregations.BucketAggregationBase

Field Summary

Method Summary

Methods inherited from class co.elastic.clients.elasticsearch._types.aggregations.BucketAggregationBase

Methods inherited from class co.elastic.clients.elasticsearch._types.aggregations.AggregationBase

Methods inherited from class java.lang.Object

Methods inherited from interface co.elastic.clients.elasticsearch._types.aggregations.AggregationVariant

Field Details

_DESERIALIZER

Method Details

of

_aggregationKind

backgroundFilter

chiSquare

exclude

executionHint

field

filterDuplicateText

gnd

include

jlh

minDocCount

mutualInformation

percentage

scriptHeuristic

shardMinDocCount

shardSize

size

sourceFields

serializeInternal

setupSignificantTextAggregationDeserializer