scraper

Type Members

trait HtmlExtractor[-E <: Element, +A] extends (ElementQuery[E]) ⇒ A

An object able to extract content from net.ruippeixotog.scalascraper.model.ElementQuery instances.
An object able to extract content from net.ruippeixotog.scalascraper.model.ElementQuery instances.
E
the type of the elements needed by this HtmlExtractor
A
the type of the extracted content
trait HtmlExtractorInstances extends AnyRef
trait HtmlValidator[-E <: Element, +R] extends AnyRef
trait PolyHtmlExtractor extends AnyRef

An extractor like HtmlExtractor but whose extracted content type depends on the type of the input net.ruippeixotog.scalascraper.model.Elements.
An extractor like HtmlExtractor but whose extracted content type depends on the type of the input net.ruippeixotog.scalascraper.model.Elements. A PolyHtmlExtractor supports application of CSS queries and can be turned into a normal HtmlExtractor by calling its apply[E] method, fixing the type of the input Element as E.
case class SimpleExtractor[-E <: Element, C, +A](cssQuery: String, contentExtractor: (ElementQuery[E]) ⇒ C, contentParser: (C) ⇒ A) extends HtmlExtractor[E, A] with Product with Serializable

Annotations
@deprecated
Deprecated
(Since version 2.0.0) Use HtmlExtractor constructor methods followed by map and mapQuery
case class SimpleValidator[-E <: Element, A, +R](htmlExtractor: HtmlExtractor[E, A], matcher: (A) ⇒ Boolean, result: Option[R] = None) extends HtmlValidator[E, R] with Product with Serializable

Annotations
@deprecated
Deprecated
(Since version 2.0.0) SimpleValidator is deprecated. Use HtmlValidator.apply methods instead

Value Members

object ContentExtractors

An object containing HtmlExtractor instances for extracting primitive data such as text, elements or attributes, as well as more complex information such as form data.
An object containing HtmlExtractor instances for extracting primitive data such as text, elements or attributes, as well as more complex information such as form data. Because they do perform little to no navigation through the document, they are typically preceded by a CSS query defining the location in the HTML document of the data to be retrieved.
object ContentParsers

An object containing functions for parsing extracted content.
An object containing functions for parsing extracted content. They can be used together with the DSL extractor method or by calling map on a HtmlExtractor with them.
object HtmlExtractor extends HtmlExtractorInstances

The companion object for HtmlExtractor, containing methods for creating new extractors.
object HtmlValidator
object PolyHtmlExtractor
object SimpleExtractor extends Serializable

Deprecated Value Members

object SimpleValidator extends Serializable

Annotations
@deprecated
Deprecated
(Since version 2.0.0) SimpleValidator is deprecated. Use HtmlValidator.apply methods instead

package scraper

Type Members

trait HtmlExtractor[-E <: Element, +A] extends (ElementQuery[E]) ⇒ A

trait HtmlExtractorInstances extends AnyRef

trait HtmlValidator[-E <: Element, +R] extends AnyRef

trait PolyHtmlExtractor extends AnyRef

case class SimpleExtractor[-E <: Element, C, +A](cssQuery: String, contentExtractor: (ElementQuery[E]) ⇒ C, contentParser: (C) ⇒ A) extends HtmlExtractor[E, A] with Product with Serializable

case class SimpleValidator[-E <: Element, A, +R](htmlExtractor: HtmlExtractor[E, A], matcher: (A) ⇒ Boolean, result: Option[R] = None) extends HtmlValidator[E, R] with Product with Serializable

Value Members

object ContentExtractors

object ContentParsers

object HtmlExtractor extends HtmlExtractorInstances

object HtmlValidator

object PolyHtmlExtractor

object SimpleExtractor extends Serializable

Deprecated Value Members

object SimpleValidator extends Serializable

Ungrouped