Package

net.ruippeixotog.scalascraper

scraper

Permalink

package scraper

Visibility
  1. Public
  2. All

Type Members

  1. trait HtmlExtractor[-E <: Element, +A] extends (ElementQuery[E]) ⇒ A

    Permalink

    An object able to extract content from net.ruippeixotog.scalascraper.model.ElementQuery instances.

    An object able to extract content from net.ruippeixotog.scalascraper.model.ElementQuery instances.

    E

    the type of the elements needed by this HtmlExtractor

    A

    the type of the extracted content

  2. trait HtmlExtractorInstances extends AnyRef

    Permalink
  3. trait HtmlValidator[-E <: Element, +R] extends AnyRef

    Permalink
  4. trait PolyHtmlExtractor extends AnyRef

    Permalink

    An extractor like HtmlExtractor but whose extracted content type depends on the type of the input net.ruippeixotog.scalascraper.model.Elements.

    An extractor like HtmlExtractor but whose extracted content type depends on the type of the input net.ruippeixotog.scalascraper.model.Elements. A PolyHtmlExtractor supports application of CSS queries and can be turned into a normal HtmlExtractor by calling its apply[E] method, fixing the type of the input Element as E.

  5. case class SimpleExtractor[-E <: Element, C, +A](cssQuery: String, contentExtractor: (ElementQuery[E]) ⇒ C, contentParser: (C) ⇒ A) extends HtmlExtractor[E, A] with Product with Serializable

    Permalink
    Annotations
    @deprecated
    Deprecated

    (Since version 2.0.0) Use HtmlExtractor constructor methods followed by map and mapQuery

  6. case class SimpleValidator[-E <: Element, A, +R](htmlExtractor: HtmlExtractor[E, A], matcher: (A) ⇒ Boolean, result: Option[R] = None) extends HtmlValidator[E, R] with Product with Serializable

    Permalink
    Annotations
    @deprecated
    Deprecated

    (Since version 2.0.0) SimpleValidator is deprecated. Use HtmlValidator.apply methods instead

Value Members

  1. object ContentExtractors

    Permalink

    An object containing HtmlExtractor instances for extracting primitive data such as text, elements or attributes, as well as more complex information such as form data.

    An object containing HtmlExtractor instances for extracting primitive data such as text, elements or attributes, as well as more complex information such as form data. Because they do perform little to no navigation through the document, they are typically preceded by a CSS query defining the location in the HTML document of the data to be retrieved.

  2. object ContentParsers

    Permalink

    An object containing functions for parsing extracted content.

    An object containing functions for parsing extracted content. They can be used together with the DSL extractor method or by calling map on a HtmlExtractor with them.

  3. object HtmlExtractor extends HtmlExtractorInstances

    Permalink

    The companion object for HtmlExtractor, containing methods for creating new extractors.

  4. object HtmlValidator

    Permalink
  5. object PolyHtmlExtractor

    Permalink
  6. object SimpleExtractor extends Serializable

    Permalink

Deprecated Value Members

  1. object SimpleValidator extends Serializable

    Permalink
    Annotations
    @deprecated
    Deprecated

    (Since version 2.0.0) SimpleValidator is deprecated. Use HtmlValidator.apply methods instead

Ungrouped