Package org.jsoup.parser
Class HtmlTreeBuilder
java.lang.Object
org.jsoup.parser.HtmlTreeBuilder
HTML Tree Builder; creates a DOM from Tokens.
-
Field Summary
Fields -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprotected Element
Get the current element (last on the stack).protected boolean
currentElementIs
(String normalName) Checks if the Current Element's normal name equals the supplied name.protected void
If the parser is tracking errors, add an error at the current position.protected void
If the parser is tracking errors, add an error at the current position.protected void
initialiseParse
(Reader input, String baseUri, Parser parser) protected boolean
isContentForTagData
(String normalName) (An internal method, visible for Element.protected void
onNodeClosed
(Node node, org.jsoup.parser.Token token) Called by implementing TreeBuilders when a node is explicitly closed.protected void
onNodeInserted
(Node node, org.jsoup.parser.Token token) Called by implementing TreeBuilders when a node has been inserted.protected boolean
process
(org.jsoup.parser.Token token) protected boolean
processEndTag
(String name) protected boolean
processStartTag
(String name) boolean
processStartTag
(String name, Attributes attrs) protected void
protected Tag
tagFor
(String tagName, ParseSettings settings) toString()
-
Field Details
-
MaxScopeSearchDepth
public static final int MaxScopeSearchDepth- See Also:
-
parser
-
doc
-
stack
-
baseUri
-
currentToken
protected org.jsoup.parser.Token currentToken -
settings
-
seenTags
-
-
Constructor Details
-
HtmlTreeBuilder
public HtmlTreeBuilder()
-
-
Method Details
-
initialiseParse
@ParametersAreNonnullByDefault protected void initialiseParse(Reader input, String baseUri, Parser parser) -
process
protected boolean process(org.jsoup.parser.Token token) -
toString
-
isContentForTagData
(An internal method, visible for Element. For HTML parse, signals that script and style text should be treated as Data Nodes). -
runParser
protected void runParser() -
processStartTag
-
processStartTag
-
processEndTag
-
currentElement
Get the current element (last on the stack). If all items have been removed, returns the document instead (which might not actually be on the stack; use stack.size() == 0 to test if required.- Returns:
- the last element on the stack, if any; or the root document
-
currentElementIs
Checks if the Current Element's normal name equals the supplied name.- Parameters:
normalName
- name to check- Returns:
- true if there is a current element on the stack, and its name equals the supplied
-
error
If the parser is tracking errors, add an error at the current position.- Parameters:
msg
- error message
-
error
If the parser is tracking errors, add an error at the current position.- Parameters:
msg
- error message templateargs
- template arguments
-
tagFor
-
onNodeInserted
Called by implementing TreeBuilders when a node has been inserted. This implementation includes optionally tracking the source range of the node.- Parameters:
node
- the node that was just insertedtoken
- the (optional) token that created this node
-
onNodeClosed
Called by implementing TreeBuilders when a node is explicitly closed. This implementation includes optionally tracking the closing source range of the node.- Parameters:
node
- the node being closedtoken
- the end-tag token that closed this node
-