Class FindStructureRequest<TJsonDocument>

java.lang.Object
co.elastic.clients.elasticsearch.text_structure.FindStructureRequest<TJsonDocument>
All Implemented Interfaces:
JsonpSerializable

public final class FindStructureRequest<TJsonDocument>
extends java.lang.Object
implements JsonpSerializable
  • Nested Class Summary

    Nested Classes
    Modifier and Type Class Description
    static class  FindStructureRequest.Builder<TJsonDocument>
  • Field Summary

    Fields
    Modifier and Type Field Description
    static Endpoint<FindStructureRequest<?>,​FindStructureResponse,​ElasticsearchError> ENDPOINT
    Endpoint "text_structure.find_structure".
  • Constructor Summary

  • Method Summary

    Modifier and Type Method Description
    java.lang.String charset()
    The text’s character set.
    java.lang.String columnNames()
    If you have set format to delimited, you can specify the column names in a comma-separated list.
    static <TJsonDocument>
    JsonpDeserializer<FindStructureRequest<TJsonDocument>>
    createFindStructureRequestDeserializer​(JsonpDeserializer<TJsonDocument> tJsonDocumentDeserializer)  
    java.lang.String delimiter()
    If you have set format to delimited, you can specify the character used to delimit the values in each row.
    java.lang.Boolean explain()
    If this parameter is set to true, the response includes a field named explanation, which is an array of strings that indicate how the structure finder produced its result.
    java.lang.String format()
    The high level structure of the text.
    java.lang.String grokPattern()
    If you have set format to semi_structured_text, you can specify a Grok pattern that is used to extract fields from every message in the text.
    java.lang.Boolean hasHeaderRow()
    If you have set format to delimited, you can use this parameter to indicate whether the column names are in the first row of the text.
    java.lang.Number lineMergeSizeLimit()
    The maximum number of characters in a message when lines are merged to form messages while analyzing semi-structured text.
    java.lang.Number linesToSample()
    The number of lines to include in the structural analysis, starting from the beginning of the text.
    java.lang.String quote()
    If you have set format to delimited, you can specify the character used to quote the values in each row if they contain newlines or the delimiter character.
    void serialize​(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper)
    Serialize this value to JSON.
    java.lang.Boolean shouldTrimFields()
    If you have set format to delimited, you can specify whether values between delimiters should have whitespace trimmed from them.
    java.util.List<TJsonDocument> textFiles()
    Required - Request body.
    java.lang.String timeout()
    Sets the maximum amount of time that the structure analysis make take.
    java.lang.String timestampField()
    Optional parameter to specify the timestamp field in the file
    java.lang.String timestampFormat()
    The Java time format of the timestamp field in the text.

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Field Details

  • Constructor Details

  • Method Details

    • charset

      @Nullable public java.lang.String charset()
      The text’s character set. It must be a character set that is supported by the JVM that Elasticsearch uses. For example, UTF-8, UTF-16LE, windows-1252, or EUC-JP. If this parameter is not specified, the structure finder chooses an appropriate character set.

      API name: charset

    • columnNames

      @Nullable public java.lang.String columnNames()
      If you have set format to delimited, you can specify the column names in a comma-separated list. If this parameter is not specified, the structure finder uses the column names from the header row of the text. If the text does not have a header role, columns are named "column1", "column2", "column3", etc.

      API name: column_names

    • delimiter

      @Nullable public java.lang.String delimiter()
      If you have set format to delimited, you can specify the character used to delimit the values in each row. Only a single character is supported; the delimiter cannot have multiple characters. By default, the API considers the following possibilities: comma, tab, semi-colon, and pipe (|). In this default scenario, all rows must have the same number of fields for the delimited format to be detected. If you specify a delimiter, up to 10% of the rows can have a different number of columns than the first row.

      API name: delimiter

    • explain

      @Nullable public java.lang.Boolean explain()
      If this parameter is set to true, the response includes a field named explanation, which is an array of strings that indicate how the structure finder produced its result.

      API name: explain

    • format

      @Nullable public java.lang.String format()
      The high level structure of the text. Valid values are ndjson, xml, delimited, and semi_structured_text. By default, the API chooses the format. In this default scenario, all rows must have the same number of fields for a delimited format to be detected. If the format is set to delimited and the delimiter is not set, however, the API tolerates up to 5% of rows that have a different number of columns than the first row.

      API name: format

    • grokPattern

      @Nullable public java.lang.String grokPattern()
      If you have set format to semi_structured_text, you can specify a Grok pattern that is used to extract fields from every message in the text. The name of the timestamp field in the Grok pattern must match what is specified in the timestamp_field parameter. If that parameter is not specified, the name of the timestamp field in the Grok pattern must match "timestamp". If grok_pattern is not specified, the structure finder creates a Grok pattern.

      API name: grok_pattern

    • hasHeaderRow

      @Nullable public java.lang.Boolean hasHeaderRow()
      If you have set format to delimited, you can use this parameter to indicate whether the column names are in the first row of the text. If this parameter is not specified, the structure finder guesses based on the similarity of the first row of the text to other rows.

      API name: has_header_row

    • lineMergeSizeLimit

      @Nullable public java.lang.Number lineMergeSizeLimit()
      The maximum number of characters in a message when lines are merged to form messages while analyzing semi-structured text. If you have extremely long messages you may need to increase this, but be aware that this may lead to very long processing times if the way to group lines into messages is misdetected.

      API name: line_merge_size_limit

    • linesToSample

      @Nullable public java.lang.Number linesToSample()
      The number of lines to include in the structural analysis, starting from the beginning of the text. The minimum is 2; If the value of this parameter is greater than the number of lines in the text, the analysis proceeds (as long as there are at least two lines in the text) for all of the lines.

      API name: lines_to_sample

    • quote

      @Nullable public java.lang.String quote()
      If you have set format to delimited, you can specify the character used to quote the values in each row if they contain newlines or the delimiter character. Only a single character is supported. If this parameter is not specified, the default value is a double quote ("). If your delimited text format does not use quoting, a workaround is to set this argument to a character that does not appear anywhere in the sample.

      API name: quote

    • shouldTrimFields

      @Nullable public java.lang.Boolean shouldTrimFields()
      If you have set format to delimited, you can specify whether values between delimiters should have whitespace trimmed from them. If this parameter is not specified and the delimiter is pipe (|), the default value is true. Otherwise, the default value is false.

      API name: should_trim_fields

    • timeout

      @Nullable public java.lang.String timeout()
      Sets the maximum amount of time that the structure analysis make take. If the analysis is still running when the timeout expires then it will be aborted.

      API name: timeout

    • timestampField

      @Nullable public java.lang.String timestampField()
      Optional parameter to specify the timestamp field in the file

      API name: timestamp_field

    • timestampFormat

      @Nullable public java.lang.String timestampFormat()
      The Java time format of the timestamp field in the text.

      API name: timestamp_format

    • textFiles

      public java.util.List<TJsonDocument> textFiles()
      Required - Request body.

      API name: _value_body

    • serialize

      public void serialize​(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper)
      Serialize this value to JSON.
      Specified by:
      serialize in interface JsonpSerializable
    • createFindStructureRequestDeserializer

      public static <TJsonDocument> JsonpDeserializer<FindStructureRequest<TJsonDocument>> createFindStructureRequestDeserializer​(JsonpDeserializer<TJsonDocument> tJsonDocumentDeserializer)