ContentExtractors
An object containing HtmlExtractor
instances for extracting primitive data such as text, elements or attributes, as well as more complex information such as form data. Because they do perform little to no navigation through the document, they are typically preceded by a CSS query defining the location in the HTML document of the data to be retrieved.
Attributes
- Graph
-
- Supertypes
-
class Objecttrait Matchableclass Any
- Self type
-
ContentExtractors.type
Members list
Value members
Concrete methods
An extractor for the value of an attribute of the first matched element.
An extractor for the value of an attribute of the first matched element.
Value parameters
- attr
-
the attribute name to extract
Attributes
- Returns
-
an extractor for an attribute of the first matched element.
An extractor for a lazy iterable of the value of an attribute of each matched element.
An extractor for a lazy iterable of the value of an attribute of each matched element.
Value parameters
- attr
-
the attribute name to extract
Attributes
- Returns
-
an extractor for a lazy iterable of the value of an attribute of each matched element.
Concrete fields
An extractor for the text in all matched elements.
An extractor for the text in all matched elements.
Attributes
An extractor for the first element matched.
An extractor for the first element matched.
Attributes
An extractor for a list of the matched elements.
An extractor for a list of the matched elements.
Attributes
An extractor for an ElementQuery
with the matched elements.
An extractor for an ElementQuery
with the matched elements.
Attributes
An extractor for the form data present in the matched elements.
An extractor for the form data present in the matched elements.
Attributes
An extractor for the form data present in the matched elements, together with the submission URL in the form.
An extractor for the form data present in the matched elements, together with the submission URL in the form.
Attributes
An extractor for the first element matched. It retains the concrete type of the elements being extracted.
An extractor for the first element matched. It retains the concrete type of the elements being extracted.
Attributes
An extractor for a list of the matched elements. It retains the concrete type of the elements being extracted.
An extractor for a list of the matched elements. It retains the concrete type of the elements being extracted.
Attributes
An extractor for an ElementQuery
with the matched elements. It retains the concrete type of the elements being extracted.
An extractor for an ElementQuery
with the matched elements. It retains the concrete type of the elements being extracted.
Attributes
An extractor for the cells of an HTML table.
An extractor for the cells of an HTML table.
Cells spanning multiple rows or columns are repeated in each of the positions they occupy. As such, well-formed rectangular tables always result in a Vector
of Vector
s with identical sizes.
Rows in thead
elements are always presented first, while rows inside tfoot
elements are always at the end.
Attributes
An extractor for the text in the first element matched.
An extractor for the text in the first element matched.
Attributes
An extractor for a lazy iterable of the text in each element matched.
An extractor for a lazy iterable of the text in each element matched.