public interface StringMapFunctions extends Column
Modifier and Type | Method and Description |
---|---|
default StringColumn |
abbreviate(int maxWidth)
Abbreviates a String using ellipses.
|
default StringColumn |
commonPrefix(Column column2) |
default StringColumn |
commonSuffix(Column column2) |
default StringColumn |
concatenate(String append)
Return a copy of this column with the given string appended to each element
|
default NumberColumn |
countTokens(String separator) |
default NumberColumn |
distance(Column column2)
Returns a column containing the levenshtein distance between the two given string columns
|
default StringColumn |
format(String formatString) |
default StringColumn |
join(String separator,
StringColumn... columns)
Return a copy of this column with the given string appended
|
default NumberColumn |
length()
Returns a column containing the character length of each string in this column
The returned column is the same size as the original
|
default StringColumn |
lowerCase() |
default StringColumn |
padEnd(int minLength,
char padChar) |
default StringColumn |
padStart(int minLength,
char padChar) |
default StringColumn |
replaceAll(String[] regexArray,
String replacement)
Creates a new column, replacing each string in this column with a new string formed by
replacing any substring that matches the regex
|
default StringColumn |
replaceAll(String regex,
String replacement) |
default StringColumn |
replaceFirst(String regex,
String replacement) |
default StringColumn |
substring(int start)
Returns a column containing the substrings from start to the end of the input
|
default StringColumn |
substring(int start,
int end) |
default StringColumn |
tokenizeAndRemoveDuplicates(String separator) |
default StringColumn |
tokenizeAndSort()
Splits on Whitespace and returns the lexicographically sorted result.
|
default StringColumn |
tokenizeAndSort(String separator) |
default StringColumn |
tokens(String separator)
Returns a column of arbitrary size containing each token in this column, where a token is defined using the
given separator.
|
default StringColumn |
trim() |
default StringColumn |
uniqueTokens(String separator)
Returns a column of arbitrary size containing each unique token in this column, where a token is defined using the
given separator, and uniqueness is calculated across the entire column
NOTE: Unlike other map functions, this method produces a column whose size may be different from the source,
so they cannot safely be combined in a table.
|
default StringColumn |
upperCase() |
append, appendCell, appendCell, appendMissing, asBytes, asDoubleArray, asObjectArray, byteSize, clear, columnWidth, copy, countMissing, countUnique, create, emptyCopy, emptyCopy, first, getDouble, getObject, getString, getUnformattedString, inRange, isEmpty, isMissing, isMissing, isNotMissing, lag, last, lead, name, print, removeMissing, rolling, rowComparator, rows, sampleN, sampleX, setName, size, sortAscending, sortDescending, subset, summarizeIf, summary, title, type, unique, where
default StringColumn upperCase()
default StringColumn lowerCase()
default StringColumn trim()
default StringColumn replaceAll(String regex, String replacement)
default StringColumn replaceFirst(String regex, String replacement)
default StringColumn substring(int start, int end)
default StringColumn substring(int start)
StringIndexOutOfBoundsException
- if any string in the column is shorter than startdefault StringColumn abbreviate(int maxWidth)
maxWidth
- the maximum width of the resulting strings, including the elipses.default StringColumn format(String formatString)
default StringColumn padEnd(int minLength, char padChar)
default StringColumn padStart(int minLength, char padChar)
default StringColumn commonPrefix(Column column2)
default StringColumn commonSuffix(Column column2)
default NumberColumn distance(Column column2)
default StringColumn join(String separator, StringColumn... columns)
columns
- the column to appenddefault StringColumn concatenate(String append)
append
- the string to appenddefault StringColumn replaceAll(String[] regexArray, String replacement)
regexArray
- the regex array to replacereplacement
- the replacement arraydefault StringColumn tokenizeAndSort(String separator)
default NumberColumn countTokens(String separator)
default StringColumn uniqueTokens(String separator)
separator
- the delimiter used in the tokenizing operationdefault StringColumn tokens(String separator)
separator
- the delimiter used in the tokenizing operationdefault NumberColumn length()
default StringColumn tokenizeAndSort()
StringColumn
default StringColumn tokenizeAndRemoveDuplicates(String separator)
Copyright © 2018. All rights reserved.