Class/Object

epic.preprocess

JavaWordTokenizer

Related Docs: object JavaWordTokenizer | package preprocess

Permalink

class JavaWordTokenizer extends Tokenizer

A Word Segmenter backed by Java's BreakIterator. Given an input string, it will return an iterator over sentences Doesn't return spaces, does return punctuation.

Linear Supertypes
Tokenizer, (String) ⇒ IndexedSeq[String], Serializable, Serializable, AnalysisFunction[String, Span, Sentence, Token], AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. JavaWordTokenizer
  2. Tokenizer
  3. Function1
  4. Serializable
  5. Serializable
  6. AnalysisFunction
  7. AnyRef
  8. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new JavaWordTokenizer()

    Permalink
  2. new JavaWordTokenizer(locale: Locale)

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def andThen[A](g: (IndexedSeq[String]) ⇒ A): (String) ⇒ A

    Permalink
    Definition Classes
    Function1
    Annotations
    @unspecialized()
  5. def andThen[II >: Sentence with Token, OO](other: AnalysisFunction[String, Span, II, OO]): AnalysisFunction[String, Span, Sentence, Token with OO]

    Permalink
    Definition Classes
    AnalysisFunction
  6. def apply[In <: Sentence](slab: StringSlab[In]): StringSlab[In with Token]

    Permalink
    Definition Classes
    JavaWordTokenizerAnalysisFunction
  7. def apply(a: String): IndexedSeq[String]

    Permalink
    Definition Classes
    Tokenizer → Function1
  8. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  9. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @HotSpotIntrinsicCandidate() @throws( ... )
  10. def compose[A](g: (A) ⇒ String): (A) ⇒ IndexedSeq[String]

    Permalink
    Definition Classes
    Function1
    Annotations
    @unspecialized()
  11. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  12. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  13. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
    Annotations
    @HotSpotIntrinsicCandidate()
  14. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
    Annotations
    @HotSpotIntrinsicCandidate()
  15. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  16. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  17. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @HotSpotIntrinsicCandidate()
  18. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @HotSpotIntrinsicCandidate()
  19. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  20. def toString(): String

    Permalink
    Definition Classes
    Tokenizer → Function1 → AnyRef → Any
  21. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  22. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  23. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Deprecated Value Members

  1. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @Deprecated @deprecated @throws( classOf[java.lang.Throwable] )
    Deprecated

    (Since version ) see corresponding Javadoc for more information.

Inherited from Tokenizer

Inherited from (String) ⇒ IndexedSeq[String]

Inherited from Serializable

Inherited from Serializable

Inherited from AnalysisFunction[String, Span, Sentence, Token]

Inherited from AnyRef

Inherited from Any

Ungrouped