An object able to extract content from net.ruippeixotog.scalascraper.model.ElementQuery instances.
An extractor like HtmlExtractor but whose extracted content type depends on the type of the input net.ruippeixotog.scalascraper.model.Elements.
An extractor like HtmlExtractor but whose extracted content type depends on the type of the input
net.ruippeixotog.scalascraper.model.Elements. A PolyHtmlExtractor
supports application of CSS queries and can
be turned into a normal HtmlExtractor
by calling its apply[E]
method, fixing the type of the input Element
as
E
.
(Since version 2.0.0) Use HtmlExtractor constructor methods followed by map and mapQuery
(Since version 2.0.0) SimpleValidator is deprecated. Use HtmlValidator.apply methods instead
An object containing HtmlExtractor
instances for extracting primitive data such as text, elements or attributes,
as well as more complex information such as form data.
An object containing HtmlExtractor
instances for extracting primitive data such as text, elements or attributes,
as well as more complex information such as form data. Because they do perform little to no navigation through the
document, they are typically preceded by a CSS query defining the location in the HTML document of the data to be
retrieved.
An object containing functions for parsing extracted content.
An object containing functions for parsing extracted content. They can be used together with the DSL extractor
method or by calling map
on a HtmlExtractor
with them.
The companion object for HtmlExtractor
, containing methods for creating new extractors.
(Since version 2.0.0) SimpleValidator is deprecated. Use HtmlValidator.apply methods instead
An object able to extract content from net.ruippeixotog.scalascraper.model.ElementQuery instances.
the type of the elements needed by this
HtmlExtractor
the type of the extracted content