public final class NlpUtils
extends java.lang.Object
Modifier and Type | Method and Description |
---|---|
static boolean |
isControl(char c)
Check whether a character is is considered as a control character.
|
static boolean |
isPunctuation(char c)
Check whether a character is considered as a punctuation.
|
static boolean |
isWhiteSpace(char c)
Check whether a character is is considered as a whitespace.
|
public static boolean isWhiteSpace(char c)
tab, newline and unicode space characters are all considered as whitespace.
c
- input character to be checked.public static boolean isControl(char c)
tab, newline and ios control characters are all considered as control character.
c
- input character to be checked.public static boolean isPunctuation(char c)
We treat all non-letter/number ASCII as punctuation. Characters such as "^", "$", and "`" are not in the Unicode Punctuation class but we treat them as punctuation anyways, for consistency.
c
- input character to be checked