Class

com.salesforce.op.stages.impl.feature

TextLenTransformer

Related Doc: package feature

Permalink

class TextLenTransformer[T <: TextList] extends SequenceTransformer[T, OPVector] with VectorizerDefaults with TextTokenizerParams with TextParams

Sequence transformer for generating a sequence of text lengths from a sequence of TextList values (eg. tokenized raw text)

Linear Supertypes
TextParams, TextTokenizerParams, TextMatchingParams, LanguageDetectionParams, VectorizerDefaults, SequenceTransformer[T, OPVector], OpTransformerN[T, OPVector], OpTransformer, OpPipelineStageN[T, OPVector], HasOut[OPVector], HasInN, OpPipelineStage[OPVector], OpPipelineStageBase, MLWritable, OpPipelineStageParams, InputParams, Transformer, PipelineStage, Logging, Params, Serializable, Serializable, Identifiable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. TextLenTransformer
  2. TextParams
  3. TextTokenizerParams
  4. TextMatchingParams
  5. LanguageDetectionParams
  6. VectorizerDefaults
  7. SequenceTransformer
  8. OpTransformerN
  9. OpTransformer
  10. OpPipelineStageN
  11. HasOut
  12. HasInN
  13. OpPipelineStage
  14. OpPipelineStageBase
  15. MLWritable
  16. OpPipelineStageParams
  17. InputParams
  18. Transformer
  19. PipelineStage
  20. Logging
  21. Params
  22. Serializable
  23. Serializable
  24. Identifiable
  25. AnyRef
  26. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new TextLenTransformer(uid: String = UID[TextLenTransformer[_]])(implicit tti: scala.reflect.api.JavaUniverse.TypeTag[T], ttiv: scala.reflect.api.JavaUniverse.TypeTag[Seq[String]])

    Permalink

Type Members

  1. final type InputFeatures = Array[FeatureLike[T]]

    Permalink
    Definition Classes
    OpPipelineStageN → OpPipelineStage → InputParams
  2. type KeyValue = (String) ⇒ Any

    Permalink
    Definition Classes
    OpTransformer
  3. final type OutputFeatures = FeatureLike[OPVector]

    Permalink
    Definition Classes
    OpPipelineStage → OpPipelineStageBase

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def $[T](param: Param[T]): T

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  4. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  5. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  6. final val autoDetectLanguage: BooleanParam

    Permalink

    Indicates whether to attempt language detection.

    Indicates whether to attempt language detection.

    Definition Classes
    LanguageDetectionParams
  7. final val autoDetectThreshold: DoubleParam

    Permalink

    Language detection threshold.

    Language detection threshold. If none of the detected languages have confidence greater than the threshold then defaultLanguage is used.

    Definition Classes
    LanguageDetectionParams
  8. implicit def booleanToDouble(v: Boolean): Double

    Permalink
    Definition Classes
    VectorizerDefaults
  9. final def checkInputLength(features: Array[_]): Boolean

    Permalink
    Definition Classes
    OpPipelineStageN → InputParams
  10. final def checkSerializable: Try[Unit]

    Permalink
    Definition Classes
    OpTransformerN → OpPipelineStageBase
  11. final val cleanText: BooleanParam

    Permalink
    Definition Classes
    TextParams
  12. final def clear(param: Param[_]): TextLenTransformer.this.type

    Permalink
    Definition Classes
    Params
  13. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  14. final def copy(extra: ParamMap): TextLenTransformer.this.type

    Permalink
    Definition Classes
    OpPipelineStageBase → Params
  15. def copyValues[T <: Params](to: T, extra: ParamMap): T

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  16. final def defaultCopy[T <: Params](extra: ParamMap): T

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  17. final val defaultLanguage: Param[String]

    Permalink

    Default language to assume in case autoDetectLanguage is disabled or failed to make a good enough prediction.

    Default language to assume in case autoDetectLanguage is disabled or failed to make a good enough prediction.

    Definition Classes
    LanguageDetectionParams
  18. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  19. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  20. def explainParam(param: Param[_]): String

    Permalink
    Definition Classes
    Params
  21. def explainParams(): String

    Permalink
    Definition Classes
    Params
  22. final def extractParamMap(): ParamMap

    Permalink
    Definition Classes
    Params
  23. final def extractParamMap(extra: ParamMap): ParamMap

    Permalink
    Definition Classes
    Params
  24. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  25. final def get[T](param: Param[T]): Option[T]

    Permalink
    Definition Classes
    Params
  26. def getAutoDetectLanguage: Boolean

    Permalink
    Definition Classes
    LanguageDetectionParams
  27. def getAutoDetectThreshold: Double

    Permalink
    Definition Classes
    LanguageDetectionParams
  28. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  29. final def getDefault[T](param: Param[T]): Option[T]

    Permalink
    Definition Classes
    Params
  30. def getDefaultLanguage: Language

    Permalink
    Definition Classes
    LanguageDetectionParams
  31. final def getInputFeature[T <: FeatureType](i: Int): Option[FeatureLike[T]]

    Permalink
    Definition Classes
    InputParams
  32. final def getInputFeatures(): Array[OPFeature]

    Permalink
    Definition Classes
    InputParams
  33. final def getInputSchema(): StructType

    Permalink
    Definition Classes
    OpPipelineStageParams
  34. final def getMetadata(): Metadata

    Permalink
    Definition Classes
    OpPipelineStageParams
  35. def getMinTokenLength: Int

    Permalink
    Definition Classes
    TextTokenizerParams
  36. final def getOrDefault[T](param: Param[T]): T

    Permalink
    Definition Classes
    Params
  37. def getOutput(): FeatureLike[OPVector]

    Permalink
    Definition Classes
    HasOut → OpPipelineStageBase
  38. final def getOutputFeatureName: String

    Permalink
    Definition Classes
    OpPipelineStage
  39. def getParam(paramName: String): Param[Any]

    Permalink
    Definition Classes
    Params
  40. def getStripHtml: Boolean

    Permalink
    Definition Classes
    TextTokenizerParams
  41. def getToLowercase: Boolean

    Permalink
    Definition Classes
    TextMatchingParams
  42. final def getTransientFeature(i: Int): Option[TransientFeature]

    Permalink
    Definition Classes
    InputParams
  43. final def getTransientFeatures(): Array[TransientFeature]

    Permalink
    Definition Classes
    InputParams
  44. final def hasDefault[T](param: Param[T]): Boolean

    Permalink
    Definition Classes
    Params
  45. def hasParam(paramName: String): Boolean

    Permalink
    Definition Classes
    Params
  46. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  47. final def inN: Array[TransientFeature]

    Permalink
    Attributes
    protected
    Definition Classes
    HasInN
  48. def initializeLogIfNecessary(isInterpreter: Boolean, silent: Boolean): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  49. def initializeLogIfNecessary(isInterpreter: Boolean): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  50. final def inputAsArray(in: InputFeatures): Array[OPFeature]

    Permalink
    Definition Classes
    OpPipelineStageN → InputParams
  51. final def isDefined(param: Param[_]): Boolean

    Permalink
    Definition Classes
    Params
  52. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  53. final def isSet(param: Param[_]): Boolean

    Permalink
    Definition Classes
    Params
  54. def isTraceEnabled(): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  55. def log: Logger

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  56. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  57. def logDebug(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  58. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  59. def logError(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  60. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  61. def logInfo(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  62. def logName: String

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  63. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  64. def logTrace(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  65. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  66. def logWarning(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  67. final val minTokenLength: IntParam

    Permalink

    Minimum token length, >= 1.

    Minimum token length, >= 1.

    Definition Classes
    TextTokenizerParams
  68. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  69. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  70. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  71. def onGetMetadata(): Unit

    Permalink
    Definition Classes
    TextLenTransformer → OpPipelineStageParams
  72. def onSetInput(): Unit

    Permalink
    Definition Classes
    VectorizerDefaults → InputParams
  73. val operationName: String

    Permalink
    Definition Classes
    SequenceTransformer → OpPipelineStageBase
  74. final def outputAsArray(out: OutputFeatures): Array[OPFeature]

    Permalink
    Definition Classes
    OpPipelineStage → OpPipelineStageBase
  75. def outputFeatureUid: String

    Permalink
    Attributes
    protected[com.salesforce.op]
    Definition Classes
    OpPipelineStageN → OpPipelineStage
  76. def outputIsResponse: Boolean

    Permalink
    Definition Classes
    OpPipelineStage
  77. def outputVectorMeta: OpVectorMetadata

    Permalink

    Get the metadata describing the output vector

    Get the metadata describing the output vector

    This does not trigger onGetMetadata()

    returns

    Metadata of output vector

    Attributes
    protected
    Definition Classes
    VectorizerDefaults
  78. lazy val params: Array[Param[_]]

    Permalink
    Definition Classes
    Params
  79. def save(path: String): Unit

    Permalink
    Definition Classes
    MLWritable
    Annotations
    @Since( "1.6.0" ) @throws( ... )
  80. final def set(paramPair: ParamPair[_]): TextLenTransformer.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  81. final def set(param: String, value: Any): TextLenTransformer.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  82. final def set[T](param: Param[T], value: T): TextLenTransformer.this.type

    Permalink
    Definition Classes
    Params
  83. def setAutoDetectLanguage(value: Boolean): TextLenTransformer.this.type

    Permalink
    Definition Classes
    LanguageDetectionParams
  84. def setAutoDetectThreshold(value: Double): TextLenTransformer.this.type

    Permalink
    Definition Classes
    LanguageDetectionParams
  85. def setCleanText(clean: Boolean): TextLenTransformer.this.type

    Permalink
    Definition Classes
    TextParams
  86. final def setDefault(paramPairs: ParamPair[_]*): TextLenTransformer.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  87. final def setDefault[T](param: Param[T], value: T): TextLenTransformer.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  88. def setDefaultLanguage(value: Language): TextLenTransformer.this.type

    Permalink
    Definition Classes
    LanguageDetectionParams
  89. final def setInput(features: FeatureLike[T]*): TextLenTransformer.this.type

    Permalink
    Definition Classes
    OpPipelineStageN
  90. final def setInput(features: InputFeatures): TextLenTransformer.this.type

    Permalink
    Definition Classes
    OpPipelineStageBase
  91. final def setInputFeatures[S <: OPFeature](features: Array[S]): TextLenTransformer.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    InputParams
  92. final def setMetadata(m: Metadata): TextLenTransformer.this.type

    Permalink
    Definition Classes
    OpPipelineStageParams
  93. def setMinTokenLength(value: Int): TextLenTransformer.this.type

    Permalink
    Definition Classes
    TextTokenizerParams
  94. def setOutputFeatureName(name: String): TextLenTransformer.this.type

    Permalink
    Definition Classes
    OpPipelineStage
  95. def setStripHtml(value: Boolean): TextLenTransformer.this.type

    Permalink
    Definition Classes
    TextTokenizerParams
  96. def setToLowercase(value: Boolean): TextLenTransformer.this.type

    Permalink
    Definition Classes
    TextMatchingParams
  97. final def stageName: String

    Permalink
    Definition Classes
    OpPipelineStageBase
  98. final val stripHtml: BooleanParam

    Permalink
    Definition Classes
    TextTokenizerParams
  99. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  100. final val toLowercase: BooleanParam

    Permalink

    Indicates whether to convert all characters to lowercase before string operation.

    Indicates whether to convert all characters to lowercase before string operation.

    Definition Classes
    TextMatchingParams
  101. def toString(): String

    Permalink
    Definition Classes
    Identifiable → AnyRef → Any
  102. def tokenize(text: Text, languageDetector: LanguageDetector = TextTokenizer.LanguageDetector, analyzer: TextAnalyzer = ...): TextTokenizerResult

    Permalink
    Definition Classes
    TextTokenizerParams
  103. def transform(dataset: Dataset[_]): DataFrame

    Permalink
    Definition Classes
    OpTransformerN → Transformer
  104. def transform(dataset: Dataset[_], paramMap: ParamMap): DataFrame

    Permalink
    Definition Classes
    Transformer
    Annotations
    @Since( "2.0.0" )
  105. def transform(dataset: Dataset[_], firstParamPair: ParamPair[_], otherParamPairs: ParamPair[_]*): DataFrame

    Permalink
    Definition Classes
    Transformer
    Annotations
    @Since( "2.0.0" ) @varargs()
  106. def transformFn: (Seq[T]) ⇒ OPVector

    Permalink
    Definition Classes
    TextLenTransformer → OpTransformerN
  107. lazy val transformKeyValue: (KeyValue) ⇒ Any

    Permalink
    Definition Classes
    OpTransformerN → OpTransformer
  108. def transformMap: (Map[String, Any]) ⇒ Any

    Permalink
    Definition Classes
    OpTransformer
  109. def transformRow: (Row) ⇒ Any

    Permalink
    Definition Classes
    OpTransformer
  110. final def transformSchema(schema: StructType): StructType

    Permalink
    Definition Classes
    OpPipelineStageBase
  111. def transformSchema(schema: StructType, logging: Boolean): StructType

    Permalink
    Attributes
    protected
    Definition Classes
    PipelineStage
    Annotations
    @DeveloperApi()
  112. implicit val tti: scala.reflect.api.JavaUniverse.TypeTag[T]

    Permalink
    Definition Classes
    SequenceTransformer → OpTransformerN
  113. implicit val ttiv: scala.reflect.api.JavaUniverse.TypeTag[Seq[String]]

    Permalink
  114. implicit val tto: scala.reflect.api.JavaUniverse.TypeTag[OPVector]

    Permalink
    Definition Classes
    SequenceTransformer → HasOut
  115. implicit val ttov: scala.reflect.api.JavaUniverse.TypeTag[Value]

    Permalink
    Definition Classes
    SequenceTransformer → HasOut
  116. val uid: String

    Permalink
    Definition Classes
    SequenceTransformer → Identifiable
  117. def vectorMetadataFromInputFeatures: OpVectorMetadata

    Permalink

    Compute the output vector metadata only from the input features.

    Compute the output vector metadata only from the input features. Vectorizers use this to derive the full vector, including pivot columns or indicator features.

    returns

    Vector metadata from input features

    Attributes
    protected
    Definition Classes
    VectorizerDefaults
  118. def vectorMetadataWithNullIndicators: OpVectorMetadata

    Permalink
    Attributes
    protected
    Definition Classes
    VectorizerDefaults
  119. def vectorOutputName: String

    Permalink

    Get the name of the output vector

    Get the name of the output vector

    returns

    Output vector name as a string

    Attributes
    protected
    Definition Classes
    VectorizerDefaults
  120. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  121. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  122. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  123. final def write: MLWriter

    Permalink
    Definition Classes
    OpPipelineStageBase → MLWritable

Inherited from TextParams

Inherited from TextTokenizerParams

Inherited from TextMatchingParams

Inherited from LanguageDetectionParams

Inherited from VectorizerDefaults

Inherited from SequenceTransformer[T, OPVector]

Inherited from OpTransformerN[T, OPVector]

Inherited from OpTransformer

Inherited from OpPipelineStageN[T, OPVector]

Inherited from HasOut[OPVector]

Inherited from HasInN

Inherited from OpPipelineStage[OPVector]

Inherited from OpPipelineStageBase

Inherited from MLWritable

Inherited from OpPipelineStageParams

Inherited from InputParams

Inherited from Transformer

Inherited from PipelineStage

Inherited from Logging

Inherited from Params

Inherited from Serializable

Inherited from Serializable

Inherited from Identifiable

Inherited from AnyRef

Inherited from Any

Ungrouped