Class DataDescription
java.lang.Object
org.elasticsearch.client.ml.job.config.DataDescription
- All Implemented Interfaces:
org.elasticsearch.common.xcontent.ToXContent,org.elasticsearch.common.xcontent.ToXContentObject
public class DataDescription
extends java.lang.Object
implements org.elasticsearch.common.xcontent.ToXContentObject
Describes the format of the data used in the job and how it should
be interpreted by the ML job.
getTimeField() is the name of the field containing the timestamp and
getTimeFormat() is the format code for the date string in as described by
DateTimeFormatter.
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classDataDescription.Builderstatic classDataDescription.DataFormatEnum of the acceptable data formats. -
Field Summary
Fields Modifier and Type Field Description static charDEFAULT_DELIMITERThe default field delimiter expected by the native autodetect program.static charDEFAULT_QUOTE_CHARThe default quote character used to escape text in delimited data formatsstatic java.lang.StringDEFAULT_TIME_FIELDBy default autodetect expects the timestamp in a field with this namestatic java.lang.StringEPOCHSpecial time format string for epoch times (seconds)static java.lang.StringEPOCH_MSSpecial time format string for epoch times (milli-seconds)static org.elasticsearch.common.xcontent.ObjectParser<DataDescription.Builder,java.lang.Void>PARSER -
Constructor Summary
Constructors Constructor Description DataDescription(DataDescription.DataFormat dataFormat, java.lang.String timeFieldName, java.lang.String timeFormat, java.lang.Character fieldDelimiter, java.lang.Character quoteCharacter) -
Method Summary
Modifier and Type Method Description booleanequals(java.lang.Object other)Overridden equality testjava.lang.CharactergetFieldDelimiter()If the data is in a delimited format with a header e.g.DataDescription.DataFormatgetFormat()The format of the data to be processed.java.lang.CharactergetQuoteCharacter()The quote character used in delimited formats.java.lang.StringgetTimeField()The name of the field containing the timestampjava.lang.StringgetTimeFormat()Either "epoch", "epoch_ms" or a SimpleDateTime format string.inthashCode()org.elasticsearch.common.xcontent.XContentBuildertoXContent(org.elasticsearch.common.xcontent.XContentBuilder builder, org.elasticsearch.common.xcontent.ToXContent.Params params)
-
Field Details
-
EPOCH
public static final java.lang.String EPOCHSpecial time format string for epoch times (seconds)- See Also:
- Constant Field Values
-
EPOCH_MS
public static final java.lang.String EPOCH_MSSpecial time format string for epoch times (milli-seconds)- See Also:
- Constant Field Values
-
DEFAULT_TIME_FIELD
public static final java.lang.String DEFAULT_TIME_FIELDBy default autodetect expects the timestamp in a field with this name- See Also:
- Constant Field Values
-
DEFAULT_DELIMITER
public static final char DEFAULT_DELIMITERThe default field delimiter expected by the native autodetect program.- See Also:
- Constant Field Values
-
DEFAULT_QUOTE_CHAR
public static final char DEFAULT_QUOTE_CHARThe default quote character used to escape text in delimited data formats- See Also:
- Constant Field Values
-
PARSER
public static final org.elasticsearch.common.xcontent.ObjectParser<DataDescription.Builder,java.lang.Void> PARSER
-
-
Constructor Details
-
DataDescription
public DataDescription(DataDescription.DataFormat dataFormat, java.lang.String timeFieldName, java.lang.String timeFormat, java.lang.Character fieldDelimiter, java.lang.Character quoteCharacter)
-
-
Method Details
-
toXContent
public org.elasticsearch.common.xcontent.XContentBuilder toXContent(org.elasticsearch.common.xcontent.XContentBuilder builder, org.elasticsearch.common.xcontent.ToXContent.Params params) throws java.io.IOException- Specified by:
toXContentin interfaceorg.elasticsearch.common.xcontent.ToXContent- Throws:
java.io.IOException
-
getFormat
The format of the data to be processed. Defaults toDataDescription.DataFormat.XCONTENT- Returns:
- The data format
-
getTimeField
public java.lang.String getTimeField()The name of the field containing the timestamp- Returns:
- A String if set or
null
-
getTimeFormat
public java.lang.String getTimeFormat()Either "epoch", "epoch_ms" or a SimpleDateTime format string. If not set (isnullor an empty string) or set to "epoch_ms" (the default) then the date is assumed to be in milliseconds from the epoch.- Returns:
- A String if set or
null
-
getFieldDelimiter
public java.lang.Character getFieldDelimiter()If the data is in a delimited format with a header e.g. csv or tsv this is the delimiter character used. This is only applicable if getFormat() isDataDescription.DataFormat.DELIMITED. The default value for delimited format is 9.- Returns:
- A char
-
getQuoteCharacter
public java.lang.Character getQuoteCharacter()The quote character used in delimited formats. The default value for delimited format is 34.- Returns:
- The delimited format quote character
-
equals
public boolean equals(java.lang.Object other)Overridden equality test- Overrides:
equalsin classjava.lang.Object
-
hashCode
public int hashCode()- Overrides:
hashCodein classjava.lang.Object
-