Index

A B C D E F H L M N P S T U X 
All Classes and Interfaces|All Packages|Constant Field Values

A

apply(String) - Method in class com.metreeca.xml.actions.Crawl
Crawls a site.
apply(Node) - Method in class com.metreeca.xml.actions.Extract
 
apply(Node) - Method in class com.metreeca.xml.actions.Untag
 
apply(Node) - Method in class com.metreeca.xml.actions.XPath
 

B

bool(String) - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves a boolean value from the target node.

C

com.metreeca.xml.actions - package com.metreeca.xml.actions
XML/HTML processing actions.
com.metreeca.xml.formats - package com.metreeca.xml.formats
XML/HTML message formats.
Crawl - Class in com.metreeca.xml.actions
Site crawling.
Crawl() - Constructor for class com.metreeca.xml.actions.Crawl
 

D

decode(Message<?>) - Method in class com.metreeca.xml.formats.HTMLFormat
Decodes the HTML message body from the input stream supplied by the message InputFormat body, if one is available and the message Content-Type header is matched by HTMLFormat.MIMEPattern, taking into account the message charset
decode(Message<?>) - Method in class com.metreeca.xml.formats.XMLFormat
Decodes the XML message body from the input stream supplied by the message InputFormat body, if one is available and the message Content-Type header is matched by XMLFormat.MIMEPattern, taking into account the message charset
decode(CharSequence) - Static method in class com.metreeca.xml.actions.XPath
Decodes XML numeric entities.
DefaultPrefix - Static variable in class com.metreeca.xml.actions.XPath
The prefix mapped to the default namespace of the target document ("_").
document() - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves the target document.

E

element(String) - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves an element value from the target node.
elements(String) - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves element values from the target node.
encode(M, Document) - Method in class com.metreeca.xml.formats.HTMLFormat
Configures message Content-Type header to "text/html", unless already defined, and encodes the HTML value into the output stream accepted by the message OutputFormat body, taking into account the message charset
encode(M, Document) - Method in class com.metreeca.xml.formats.XMLFormat
Configures message Content-Type header to "application/xml", unless already defined, and encodes the XML value into the output stream accepted by the message OutputFormat body, taking into account the message charset
Extract - Class in com.metreeca.xml.actions
Main X/HTMl content extraction.
Extract() - Constructor for class com.metreeca.xml.actions.Extract
 

F

fetch(Fetch) - Method in class com.metreeca.xml.actions.Crawl
Configures the fetch action (defaults to Fetch.
focus(Function<? super Node, Optional<Node>>) - Method in class com.metreeca.xml.actions.Crawl
Configures the content focus action (defaults to the identity function).

H

html() - Static method in class com.metreeca.xml.formats.HTMLFormat
Creates an HTML message format.
html(InputStream, String, String) - Static method in class com.metreeca.xml.formats.HTMLFormat
Parses an HTML document.
html(O, String, String, Node) - Static method in class com.metreeca.xml.formats.HTMLFormat
Writes an HTML node.
HTMLFormat - Class in com.metreeca.xml.formats
HTML message format.

L

link(String) - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves a URI value from the target node.
links(String) - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves URI values from the target node.

M

mime() - Method in class com.metreeca.xml.formats.HTMLFormat
 
mime() - Method in class com.metreeca.xml.formats.XMLFormat
 
MIME - Static variable in class com.metreeca.xml.formats.HTMLFormat
The default MIME type for HTML messages ("text/html").
MIME - Static variable in class com.metreeca.xml.formats.XMLFormat
The default MIME type for XML message bodies ("application/xml").
MIMEPattern - Static variable in class com.metreeca.xml.formats.HTMLFormat
A pattern matching the HTML MIME type.
MIMEPattern - Static variable in class com.metreeca.xml.formats.XMLFormat
A pattern matching XML-based MIME types, for instance application/rss+xml.

N

node() - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves the target node.
node(String) - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves a node value from the target node.
nodes(String) - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves node values from the target node.
number(String) - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves a numeric value from the target node.

P

Processor(Node) - Constructor for class com.metreeca.xml.actions.XPath.Processor
Creates an XPath processor.
prune(BiPredicate<String, String>) - Method in class com.metreeca.xml.actions.Crawl
Configures the prune action (defaults to always pass).

S

string(String) - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves a textual value from the target node.
strings(String) - Method in class com.metreeca.xml.actions.XPath.Processor
Retrieves textual values from the target node.

T

threads(int) - Method in class com.metreeca.xml.actions.Crawl
Configures the number of concurrent requests (defaults to the number of processors)

U

Untag - Class in com.metreeca.xml.actions
X/HTML to Markdown conversion.
Untag() - Constructor for class com.metreeca.xml.actions.Untag
 

X

xml() - Static method in class com.metreeca.xml.formats.XMLFormat
Creates an XML message format.
xml(InputStream) - Static method in class com.metreeca.xml.formats.XMLFormat
Parses a XML document.
xml(InputStream, String) - Static method in class com.metreeca.xml.formats.XMLFormat
Parses a XML document.
xml(Source) - Static method in class com.metreeca.xml.formats.XMLFormat
Parses a XML document.
xml(XMLReader) - Static method in class com.metreeca.xml.formats.XMLFormat
Creates an XML message format using a custom SAX parser.
XMLFormat - Class in com.metreeca.xml.formats
XML message format.
XPath<R> - Class in com.metreeca.xml.actions
XPath-based XML node processing.
XPath(Function<XPath.Processor, R>) - Constructor for class com.metreeca.xml.actions.XPath
Creates an XPath-based node processing action.
XPath.Processor - Class in com.metreeca.xml.actions
XPath processor.
A B C D E F H L M N P S T U X 
All Classes and Interfaces|All Packages|Constant Field Values