Class SignificantTextAggregation
java.lang.Object
co.elastic.clients.elasticsearch._types.aggregations.AggregationBase
co.elastic.clients.elasticsearch._types.aggregations.BucketAggregationBase
co.elastic.clients.elasticsearch._types.aggregations.SignificantTextAggregation
- All Implemented Interfaces:
AggregationVariant
,JsonpSerializable
@JsonpDeserializable
public class SignificantTextAggregation
extends BucketAggregationBase
implements AggregationVariant
- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from class co.elastic.clients.elasticsearch._types.aggregations.BucketAggregationBase
BucketAggregationBase.AbstractBuilder<BuilderT extends BucketAggregationBase.AbstractBuilder<BuilderT>>
-
Field Summary
Modifier and TypeFieldDescriptionstatic final JsonpDeserializer<SignificantTextAggregation>
Json deserializer forSignificantTextAggregation
-
Method Summary
Modifier and TypeMethodDescriptionAggregation variant kind.final Query
A background filter that can be used to focus in on significant terms within a narrower context, instead of the entire index.final ChiSquareHeuristic
Use Chi square, as described in "Information Retrieval", Manning et al., Chapter 13.5.2, as the significance score.final TermsExclude
exclude()
Values to exclude.Determines whether the aggregation will use field values directly or global ordinals.final String
field()
The field from which to return significant text.final Boolean
Whether to out duplicate text to deal with noisy data.gnd()
Use Google normalized distance as described in "The Google Similarity Distance", Cilibrasi and Vitanyi, 2007, as the significance score.final TermsInclude
include()
Values to include.final EmptyObject
jlh()
Use JLH score as the significance score.final Long
Only return values that are found in more thanmin_doc_count
hits.Use mutual information as described in "Information Retrieval", Manning et al., Chapter 13.5.1, as the significance score.static SignificantTextAggregation
final PercentageScoreHeuristic
A simple calculation of the number of documents in the foreground sample with a term divided by the number of documents in the background with the term.final ScriptedHeuristic
Customized score, implemented via a script.protected void
serializeInternal
(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper) protected static void
setupSignificantTextAggregationDeserializer
(ObjectDeserializer<SignificantTextAggregation.Builder> op) final Long
Regulates the certainty a shard has if the values should actually be added to the candidate list or not with respect to the min_doc_count.final Integer
The number of candidate terms produced by each shard.final Integer
size()
The number of buckets returned out of the overall terms list.Overrides the JSON_source
fields from which text will be analyzed.Methods inherited from class co.elastic.clients.elasticsearch._types.aggregations.BucketAggregationBase
setupBucketAggregationBaseDeserializer
Methods inherited from class co.elastic.clients.elasticsearch._types.aggregations.AggregationBase
meta, name, serialize, setupAggregationBaseDeserializer, toString
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
Methods inherited from interface co.elastic.clients.elasticsearch._types.aggregations.AggregationVariant
_toAggregation
-
Field Details
-
_DESERIALIZER
Json deserializer forSignificantTextAggregation
-
-
Method Details
-
of
public static SignificantTextAggregation of(Function<SignificantTextAggregation.Builder, ObjectBuilder<SignificantTextAggregation>> fn) -
_aggregationKind
Aggregation variant kind.- Specified by:
_aggregationKind
in interfaceAggregationVariant
-
backgroundFilter
A background filter that can be used to focus in on significant terms within a narrower context, instead of the entire index.API name:
background_filter
-
chiSquare
Use Chi square, as described in "Information Retrieval", Manning et al., Chapter 13.5.2, as the significance score.API name:
chi_square
-
exclude
Values to exclude.API name:
exclude
-
executionHint
Determines whether the aggregation will use field values directly or global ordinals.API name:
execution_hint
-
field
The field from which to return significant text.API name:
field
-
filterDuplicateText
Whether to out duplicate text to deal with noisy data.API name:
filter_duplicate_text
-
gnd
Use Google normalized distance as described in "The Google Similarity Distance", Cilibrasi and Vitanyi, 2007, as the significance score.API name:
gnd
-
include
Values to include.API name:
include
-
jlh
Use JLH score as the significance score.API name:
jlh
-
minDocCount
Only return values that are found in more thanmin_doc_count
hits.API name:
min_doc_count
-
mutualInformation
Use mutual information as described in "Information Retrieval", Manning et al., Chapter 13.5.1, as the significance score.API name:
mutual_information
-
percentage
A simple calculation of the number of documents in the foreground sample with a term divided by the number of documents in the background with the term.API name:
percentage
-
scriptHeuristic
Customized score, implemented via a script.API name:
script_heuristic
-
shardMinDocCount
Regulates the certainty a shard has if the values should actually be added to the candidate list or not with respect to the min_doc_count. Values will only be considered if their local shard frequency within the set is higher than theshard_min_doc_count
.API name:
shard_min_doc_count
-
shardSize
The number of candidate terms produced by each shard. By default,shard_size
will be automatically estimated based on the number of shards and thesize
parameter.API name:
shard_size
-
size
The number of buckets returned out of the overall terms list.API name:
size
-
sourceFields
Overrides the JSON_source
fields from which text will be analyzed.API name:
source_fields
-
serializeInternal
- Overrides:
serializeInternal
in classAggregationBase
-
setupSignificantTextAggregationDeserializer
protected static void setupSignificantTextAggregationDeserializer(ObjectDeserializer<SignificantTextAggregation.Builder> op)
-