scala.xml.parsing

MarkupParser

trait MarkupParser extends MarkupParserCommon with TokenTests

An XML parser.

Parses XML 1.0, invokes callback methods of a MarkupHandler and returns whatever the markup handler returns. Use ConstructingParser if you just want to parse XML to construct instances of scala.xml.Node.

While XML elements are returned, DTD declarations - if handled - are collected using side-effects.

Self Type
MarkupParser with MarkupHandler
Version

1.0

Linear Supertypes
MarkupParserCommon, TokenTests, AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. MarkupParser
  2. MarkupParserCommon
  3. TokenTests
  4. AnyRef
  5. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Type Members

  1. type AttributesType = (MetaData, NamespaceBinding)

    Definition Classes
    MarkupParser → MarkupParserCommon
  2. type ElementType = NodeSeq

    Definition Classes
    MarkupParser → MarkupParserCommon
  3. type InputType = Source

    Definition Classes
    MarkupParser → MarkupParserCommon
  4. type NamespaceType = NamespaceBinding

    Definition Classes
    MarkupParser → MarkupParserCommon
  5. type PositionType = Int

    Definition Classes
    MarkupParser → MarkupParserCommon

Abstract Value Members

  1. abstract def externalSource(systemLiteral: String): Source

  2. abstract val input: Source

  3. abstract val preserveWS: Boolean

    if true, does not remove surplus whitespace

Concrete Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. def appendText(pos: Int, ts: NodeBuffer, txt: String): Unit

  7. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  8. def attrDecl(): Unit

    <! attlist := ATTLIST
  9. val cbuf: collection.mutable.StringBuilder

    character buffer, for names

    character buffer, for names

    Attributes
    protected
  10. def ch: Char

    The library and compiler parsers had the interesting distinction of different behavior for nextch (a function for which there are a total of two plausible behaviors, so we know the design space was fully explored.

    The library and compiler parsers had the interesting distinction of different behavior for nextch (a function for which there are a total of two plausible behaviors, so we know the design space was fully explored.) One of them returned the value of nextch before the increment and one of them the new value. So to unify code we have to at least temporarily abstract over the nextchs.

    Definition Classes
    MarkupParser → MarkupParserCommon
  11. def ch_returning_nextch: Char

    Attributes
    protected
    Definition Classes
    MarkupParser → MarkupParserCommon
  12. def checkPubID(s: String): Boolean

    Definition Classes
    TokenTests
  13. def checkSysID(s: String): Boolean

    Definition Classes
    TokenTests
  14. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  15. def content(pscope: NamespaceBinding): NodeSeq

    content1 ::=  '<' content1 | '&' charref ...
  16. def content1(pscope: NamespaceBinding, ts: NodeBuffer): Unit

    '<' content1 ::=  ...
  17. var curInput: Source

    Attributes
    protected
  18. var doc: Document

    Attributes
    protected
  19. def document(): Document

    [22]     prolog      ::= XMLDecl? Misc* (doctypedecl Misc*)?
    [23]     XMLDecl     ::= ' VersionInfo EncodingDecl? SDDecl? S? '?>'
    [24]     VersionInfo ::= S 'version' Eq ("'" VersionNum "'" | '"' VersionNum '"')
    [25]     Eq          ::= S? '=' S?
    [26]     VersionNum  ::= '1.0'
    [27]     Misc        ::= Comment | PI | S
  20. var dtd: DTD

  21. def element(pscope: NamespaceBinding): NodeSeq

  22. def element1(pscope: NamespaceBinding): NodeSeq

    '<' element ::= xmlTag1 '>'  { xmlExpr | '{' simpleExpr '}' } ETag
    | xmlTag1 '/' '>'
  23. def elementDecl(): Unit

    <! element := ELEMENT

  24. def entityDecl(): Unit

    <! element := ELEMENT
  25. def eof: Boolean

    Definition Classes
    MarkupParser → MarkupParserCommon
  26. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  27. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  28. def errorAndResult[T](msg: String, x: T): T

    Attributes
    protected
    Definition Classes
    MarkupParserCommon
  29. def errorNoEnd(tag: String): Nothing

    Definition Classes
    MarkupParser → MarkupParserCommon
  30. var extIndex: Int

  31. def extSubset(): Unit

  32. def externalID(): ExternalID

    externalID ::= SYSTEM S syslit
    PUBLIC S pubid S syslit
  33. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  34. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  35. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  36. def initialize: MarkupParser.this

    As the current code requires you to call nextch once manually after construction, this method formalizes that suboptimal reality.

  37. var inpStack: List[Source]

    stack of inputs

  38. def intSubset(): Unit

    "rec-xml/#ExtSubset" pe references may not occur within markup declarations

  39. def isAlpha(c: Char): Boolean

    These are 99% sure to be redundant but refactoring on the safe side.

    These are 99% sure to be redundant but refactoring on the safe side.

    Definition Classes
    TokenTests
  40. def isAlphaDigit(c: Char): Boolean

    Definition Classes
    TokenTests
  41. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  42. def isName(s: String): Boolean

    Name ::= ( Letter | '_' ) (NameChar)*

    See [5] of XML 1.0 specification.

    Definition Classes
    TokenTests
  43. def isNameChar(ch: Char): Boolean

    NameChar ::= Letter | Digit | '.' | '-' | '_' | ':'
    | CombiningChar | Extender

    See [4] and Appendix B of XML 1.0 specification.

    Definition Classes
    TokenTests
  44. def isNameStart(ch: Char): Boolean

    NameStart ::= ( Letter | '_' )

    where Letter means in one of the Unicode general categories { Ll, Lu, Lo, Lt, Nl }.

    We do not allow a name to start with :. See [3] and Appendix B of XML 1.0 specification

    Definition Classes
    TokenTests
  45. def isPubIDChar(ch: Char): Boolean

    Definition Classes
    TokenTests
  46. final def isSpace(cs: Seq[Char]): Boolean

    (#x20 | #x9 | #xD | #xA)+
    Definition Classes
    TokenTests
  47. final def isSpace(ch: Char): Boolean

    (#x20 | #x9 | #xD | #xA)
    Definition Classes
    TokenTests
  48. def isValidIANAEncoding(ianaEncoding: Seq[Char]): Boolean

    Returns true if the encoding name is a valid IANA encoding.

    Returns true if the encoding name is a valid IANA encoding. This method does not verify that there is a decoder available for this encoding, only that the characters are valid for an IANA encoding name.

    ianaEncoding

    The IANA encoding name.

    Definition Classes
    TokenTests
  49. var lastChRead: Char

  50. def lookahead(): BufferedIterator[Char]

    Create a lookahead reader which does not influence the input

    Create a lookahead reader which does not influence the input

    Definition Classes
    MarkupParser → MarkupParserCommon
  51. def markupDecl(): Unit

  52. def markupDecl1(): Any

  53. def mkAttributes(name: String, pscope: NamespaceBinding): (MarkupParser.this)#AttributesType

    Definition Classes
    MarkupParser → MarkupParserCommon
  54. def mkProcInstr(position: Int, name: String, text: String): (MarkupParser.this)#ElementType

    Definition Classes
    MarkupParser → MarkupParserCommon
  55. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  56. var nextChNeeded: Boolean

    holds the next character

  57. def nextch(): Unit

    this method tells ch to get the next character when next called

    this method tells ch to get the next character when next called

    Definition Classes
    MarkupParser → MarkupParserCommon
  58. def notationDecl(): Unit

    'N' notationDecl ::= "OTATION"
  59. final def notify(): Unit

    Definition Classes
    AnyRef
  60. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  61. def parseDTD(): Unit

    parses document type declaration and assigns it to instance variable dtd.

    parses document type declaration and assigns it to instance variable dtd.

    <! parseDTD ::= DOCTYPE name ... >
  62. def pop(): Unit

  63. var pos: Int

    holds the position in the source file

  64. def prolog(): (Option[String], Option[String], Option[Boolean])

    <? prolog ::= xml S?
    // this is a bit more lenient than necessary...
  65. def pubidLiteral(): String

    [12]       PubidLiteral ::=        '"' PubidChar* '"' | "'" (PubidChar - "'")* "'"
  66. def push(entityName: String): Unit

  67. def pushExternal(systemId: String): Unit

  68. def putChar(c: Char): collection.mutable.StringBuilder

    append Unicode character to name buffer

    append Unicode character to name buffer

    Attributes
    protected
  69. var reachedEof: Boolean

  70. def reportSyntaxError(str: String): Unit

    Definition Classes
    MarkupParser → MarkupParserCommon
  71. def reportSyntaxError(pos: Int, str: String): Unit

    Definition Classes
    MarkupParser → MarkupParserCommon
  72. def reportValidationError(pos: Int, str: String): Unit

  73. def returning[T](x: T)(f: (T) ⇒ Unit): T

    Apply a function and return the passed value

    Apply a function and return the passed value

    Definition Classes
    MarkupParserCommon
  74. def saving[A, B](getter: A, setter: (A) ⇒ Unit)(body: ⇒ B): B

    Execute body with a variable saved and restored after execution

    Execute body with a variable saved and restored after execution

    Definition Classes
    MarkupParserCommon
  75. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  76. def systemLiteral(): String

    attribute value, terminated by either ' or ".

    attribute value, terminated by either ' or ". value may not contain <.

    AttValue     ::= `'` { _ } `'`
    | `"` { _ } `"`
  77. def textDecl(): (Option[String], Option[String])

    prolog, but without standalone

  78. var tmppos: Int

    holds temporary values of pos

    holds temporary values of pos

    Definition Classes
    MarkupParser → MarkupParserCommon
  79. def toString(): String

    Definition Classes
    AnyRef → Any
  80. def truncatedError(msg: String): Nothing

    Definition Classes
    MarkupParser → MarkupParserCommon
  81. def unreachable: Nothing

    Attributes
    protected
    Definition Classes
    MarkupParserCommon
  82. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  83. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  84. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  85. def xAttributeValue(): String

    Definition Classes
    MarkupParserCommon
  86. def xAttributeValue(endCh: Char): String

    attribute value, terminated by either ' or ".

    attribute value, terminated by either ' or ". value may not contain <.

    endCh

    either ' or "

    Definition Classes
    MarkupParserCommon
  87. def xAttributes(pscope: NamespaceBinding): (MetaData, NamespaceBinding)

    parse attribute and create namespace scope, metadata

    parse attribute and create namespace scope, metadata

    [41] Attributes    ::= { S Name Eq AttValue }
  88. def xCharData: NodeSeq

    '<! CharData ::= [CDATA[ ( {char} - {char}"]]>"{char} ) ']]>'
    
    see [15]
  89. def xCharRef: String

    Definition Classes
    MarkupParserCommon
  90. def xCharRef(it: Iterator[Char]): String

    Definition Classes
    MarkupParserCommon
  91. def xCharRef(ch: () ⇒ Char, nextch: () ⇒ Unit): String

    CharRef ::= "&#" '0'.

    CharRef ::= "&#" '0'..'9' {'0'..'9'} ";" | "&#x" '0'..'9'|'A'..'F'|'a'..'f' { hexdigit } ";"

    see [66]

    Definition Classes
    MarkupParserCommon
  92. def xComment: NodeSeq

    Comment ::= ''
    
    see [15]
  93. def xEQ(): Unit

    scan [S] '=' [S]

    scan [S] '=' [S]

    Definition Classes
    MarkupParserCommon
  94. def xEndTag(startName: String): Unit

    [42] '<' xmlEndTag ::= '<' '/' Name S? '>'

    [42] '<' xmlEndTag ::= '<' '/' Name S? '>'

    Definition Classes
    MarkupParserCommon
  95. def xEntityValue(): String

    entity value, terminated by either ' or ".

    entity value, terminated by either ' or ". value may not contain <.

    AttValue     ::= `'` { _  } `'`
    | `"` { _ } `"`
  96. def xHandleError(that: Char, msg: String): Unit

    Definition Classes
    MarkupParser → MarkupParserCommon
  97. def xName: String

    actually, Name ::= (Letter | '_' | ':') (NameChar)* but starting with ':' cannot happen Name ::= (Letter | '_') (NameChar)*

    actually, Name ::= (Letter | '_' | ':') (NameChar)* but starting with ':' cannot happen Name ::= (Letter | '_') (NameChar)*

    see [5] of XML 1.0 specification

    pre-condition: ch != ':' // assured by definition of XMLSTART token post-condition: name does neither start, nor end in ':'

    Definition Classes
    MarkupParserCommon
  98. def xProcInstr: (MarkupParser.this)#ElementType

    '<?' ProcInstr ::= Name [S ({Char} - ({Char}'>?' {Char})]'?>'

    '<?' ProcInstr ::= Name [S ({Char} - ({Char}'>?' {Char})]'?>'

    see [15]

    Definition Classes
    MarkupParserCommon
  99. def xSpace(): Unit

    scan [3] S ::= (#x20 | #x9 | #xD | #xA)+

    scan [3] S ::= (#x20 | #x9 | #xD | #xA)+

    Definition Classes
    MarkupParserCommon
  100. def xSpaceOpt(): Unit

    skip optional space S?

    skip optional space S?

    Definition Classes
    MarkupParserCommon
  101. def xTag(pscope: (MarkupParser.this)#NamespaceType): (String, (MarkupParser.this)#AttributesType)

    parse a start or empty tag.

    parse a start or empty tag. [40] STag ::= '<' Name { S Attribute } [S] [44] EmptyElemTag ::= '<' Name { S Attribute } [S]

    Attributes
    protected
    Definition Classes
    MarkupParserCommon
  102. def xTakeUntil[T](handler: ((MarkupParser.this)#PositionType, String) ⇒ T, positioner: () ⇒ (MarkupParser.this)#PositionType, until: String): T

    Take characters from input stream until given String "until" is seen.

    Take characters from input stream until given String "until" is seen. Once seen, the accumulated characters are passed along with the current Position to the supplied handler function.

    Attributes
    protected
    Definition Classes
    MarkupParserCommon
  103. def xToken(that: Seq[Char]): Unit

    Definition Classes
    MarkupParserCommon
  104. def xToken(that: Char): Unit

    Definition Classes
    MarkupParserCommon
  105. def xmlProcInstr(): MetaData

    <? prolog ::= xml S ... ?>

Inherited from MarkupParserCommon

Inherited from TokenTests

Inherited from AnyRef

Inherited from Any

Ungrouped