Package

org.mitre.jcarafe

tokenizer

Permalink

package tokenizer

Visibility
  1. Public
  2. All

Type Members

  1. sealed case class Category(name: String, set: Set[String]) extends PatternElement with Product with Serializable

    Permalink
  2. case class ComplexTok(s: String, atts: Map[String, String]) extends Element with Product with Serializable

    Permalink
  3. sealed abstract class Element extends Positional

    Permalink
  4. case class EndWs(s: String) extends Element with Product with Serializable

    Permalink
  5. case class HardEndTok(s: String) extends Element with Product with Serializable

    Permalink
  6. case class IgnoreBlock(s: String) extends Element with Product with Serializable

    Permalink
  7. case class Matching(s: String, pos: Int) extends Product with Serializable

    Permalink
  8. class MergePatternSequence extends PatternSequence

    Permalink
  9. class MergeTokenizerAugmenterPattern extends TokenizerAugmenterPattern

    Permalink
  10. abstract class PatternElement extends AnyRef

    Permalink
  11. class PatternSequence extends AnyRef

    Permalink
  12. sealed case class R(reg: Regex) extends PatternElement with Product with Serializable

    Permalink
  13. sealed case class Rep(reg: Regex) extends PatternElement with Product with Serializable

    Permalink
  14. sealed case class S(str: String) extends PatternElement with Product with Serializable

    Permalink
  15. case class SoftEndTok(s: String) extends Element with Product with Serializable

    Permalink
  16. class SplitPatternSequence extends PatternSequence

    Permalink
  17. class SplitTokenizerAugmenterPattern extends TokenizerAugmenterPattern

    Permalink
  18. case class Tag(s: String, b: Boolean) extends Element with Product with Serializable

    Permalink

    The Tag case class represents an SGML element.

    The Tag case class represents an SGML element.

    s

    The tag element string (including attributes for start tags)

    b

    A Boolean true if this tag is a start tag; otherwise false

  19. case class Tok(s: String) extends Element with Product with Serializable

    Permalink
  20. class TokenTagUpcaseTransmuter extends TokenTransmuter

    Permalink
  21. abstract class TokenTransmuter extends AnyRef

    Permalink
  22. abstract class Tokenizer extends RegexParsers

    Permalink
  23. abstract class TokenizerAugmenterPattern extends AnyRef

    Permalink
  24. class TokenizerOptions extends CommandLineHandler

    Permalink
  25. class TokenizerPatternParser extends RegexParsers

    Permalink
  26. case class Ws(s: String) extends Element with Product with Serializable

    Permalink

Value Members

  1. object CharStr

    Permalink
  2. object FastTokenizer

    Permalink
  3. object MergeTokenizerAugmenterPattern

    Permalink
  4. object MoveAnnotations

    Permalink

    Utility application that takes a tagged file and moves tags outward such that they do not reside inside what would be regarded as a 'token' by simple tokenizers

  5. object RawTokenizer extends Tokenizer

    Permalink
  6. object Recurse extends PatternElement with Product with Serializable

    Permalink
  7. object SplitTokenizerAugmenterPattern

    Permalink
  8. object Tk extends PatternElement with Product with Serializable

    Permalink
  9. object TokenTagUpcaseTransmuter

    Permalink
  10. object WhiteSpaceTokenizer

    Permalink

Ungrouped