Index
All Classes and Interfaces|All Packages|Constant Field Values
A
- apply(String) - Method in class com.metreeca.xml.actions.Crawl
-
Crawls a site.
- apply(Node) - Method in class com.metreeca.xml.actions.Extract
- apply(Node) - Method in class com.metreeca.xml.actions.Untag
- apply(Node) - Method in class com.metreeca.xml.actions.XPath
B
- bool(String) - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves a boolean value from the target node.
C
- com.metreeca.xml.actions - package com.metreeca.xml.actions
-
XML/HTML processing actions.
- com.metreeca.xml.formats - package com.metreeca.xml.formats
-
XML/HTML message formats.
- Crawl - Class in com.metreeca.xml.actions
-
Site crawling.
- Crawl() - Constructor for class com.metreeca.xml.actions.Crawl
D
- decode(Message<?>) - Method in class com.metreeca.xml.formats.HTMLFormat
-
Decodes the HTML
message
body from the input stream supplied by themessage
InputFormat
body, if one is available and themessage
Content-Type
header is matched byHTMLFormat.MIMEPattern
, taking into account themessage
charset - decode(Message<?>) - Method in class com.metreeca.xml.formats.XMLFormat
-
Decodes the XML
message
body from the input stream supplied by themessage
InputFormat
body, if one is available and themessage
Content-Type
header is matched byXMLFormat.MIMEPattern
, taking into account themessage
charset - decode(CharSequence) - Static method in class com.metreeca.xml.actions.XPath
-
Decodes XML numeric entities.
- DefaultPrefix - Static variable in class com.metreeca.xml.actions.XPath
-
The prefix mapped to the default namespace of the target document ("_").
- document() - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves the target document.
E
- element(String) - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves an element value from the target node.
- elements(String) - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves element values from the target node.
- encode(M, Document) - Method in class com.metreeca.xml.formats.HTMLFormat
-
Configures
message
Content-Type
header to "text/html", unless already defined, and encodes the HTMLvalue
into the output stream accepted by themessage
OutputFormat
body, taking into account themessage
charset - encode(M, Document) - Method in class com.metreeca.xml.formats.XMLFormat
-
Configures
message
Content-Type
header to "application/xml", unless already defined, and encodes the XMLvalue
into the output stream accepted by themessage
OutputFormat
body, taking into account themessage
charset - Extract - Class in com.metreeca.xml.actions
-
Main X/HTMl content extraction.
- Extract() - Constructor for class com.metreeca.xml.actions.Extract
F
- fetch(Fetch) - Method in class com.metreeca.xml.actions.Crawl
-
Configures the fetch action (defaults to
Fetch
. - focus(Function<? super Node, Optional<Node>>) - Method in class com.metreeca.xml.actions.Crawl
-
Configures the content focus action (defaults to the identity function).
H
- html() - Static method in class com.metreeca.xml.formats.HTMLFormat
-
Creates an HTML message format.
- html(InputStream, String, String) - Static method in class com.metreeca.xml.formats.HTMLFormat
-
Parses an HTML document.
- html(O, String, String, Node) - Static method in class com.metreeca.xml.formats.HTMLFormat
-
Writes an HTML node.
- HTMLFormat - Class in com.metreeca.xml.formats
-
HTML message format.
L
- link(String) - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves a URI value from the target node.
- links(String) - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves URI values from the target node.
M
- mime() - Method in class com.metreeca.xml.formats.HTMLFormat
- mime() - Method in class com.metreeca.xml.formats.XMLFormat
- MIME - Static variable in class com.metreeca.xml.formats.HTMLFormat
-
The default MIME type for HTML messages ("text/html").
- MIME - Static variable in class com.metreeca.xml.formats.XMLFormat
-
The default MIME type for XML message bodies ("application/xml").
- MIMEPattern - Static variable in class com.metreeca.xml.formats.HTMLFormat
-
A pattern matching the HTML MIME type.
- MIMEPattern - Static variable in class com.metreeca.xml.formats.XMLFormat
-
A pattern matching XML-based MIME types, for instance
application/rss+xml
.
N
- node() - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves the target node.
- node(String) - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves a node value from the target node.
- nodes(String) - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves node values from the target node.
- number(String) - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves a numeric value from the target node.
P
- Processor(Node) - Constructor for class com.metreeca.xml.actions.XPath.Processor
-
Creates an XPath processor.
- prune(BiPredicate<String, String>) - Method in class com.metreeca.xml.actions.Crawl
-
Configures the prune action (defaults to always pass).
S
- string(String) - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves a textual value from the target node.
- strings(String) - Method in class com.metreeca.xml.actions.XPath.Processor
-
Retrieves textual values from the target node.
T
- threads(int) - Method in class com.metreeca.xml.actions.Crawl
-
Configures the number of concurrent requests (defaults to the number of processors)
U
- Untag - Class in com.metreeca.xml.actions
-
X/HTML to Markdown conversion.
- Untag() - Constructor for class com.metreeca.xml.actions.Untag
X
- xml() - Static method in class com.metreeca.xml.formats.XMLFormat
-
Creates an XML message format.
- xml(InputStream) - Static method in class com.metreeca.xml.formats.XMLFormat
-
Parses a XML document.
- xml(InputStream, String) - Static method in class com.metreeca.xml.formats.XMLFormat
-
Parses a XML document.
- xml(Source) - Static method in class com.metreeca.xml.formats.XMLFormat
-
Parses a XML document.
- xml(XMLReader) - Static method in class com.metreeca.xml.formats.XMLFormat
-
Creates an XML message format using a custom SAX parser.
- XMLFormat - Class in com.metreeca.xml.formats
-
XML message format.
- XPath<R> - Class in com.metreeca.xml.actions
-
XPath-based XML node processing.
- XPath(Function<XPath.Processor, R>) - Constructor for class com.metreeca.xml.actions.XPath
-
Creates an XPath-based node processing action.
- XPath.Processor - Class in com.metreeca.xml.actions
-
XPath processor.
All Classes and Interfaces|All Packages|Constant Field Values