net.ruippeixotog.scalascraper.browser
The concrete type of documents created by this browser.
The concrete type of documents created by this browser.
Clears the cookie store of this browser.
Clears the cookie store of this browser.
Returns the current set of cookies stored in this browser for a given URL.
Returns the current set of cookies stored in this browser for a given URL.
the URL whose stored cookies are to be returned
a mapping of cookie names to their respective values.
Retrieves and parses a web page using a GET request.
Retrieves and parses a web page using a GET request.
the URL of the page to retrieve
a Document
containing the retrieved web page.
Parses a local HTML file with a specified charset.
Parses a local HTML file with a specified charset.
the HTML file to parse
the charset of the file
a Document
containing the parsed web page.
Parses a local HTML file encoded in UTF-8.
Parses a local HTML file encoded in UTF-8.
the path in the local filesystem where the HTML file is located
a Document
containing the parsed web page.
Parses a local HTML file with a specified charset.
Parses a local HTML file with a specified charset.
the path in the local filesystem where the HTML file is located
the charset of the file
a Document
containing the parsed web page.
Parses a local HTML file encoded in UTF-8.
Parses a local HTML file encoded in UTF-8.
the HTML file to parse
a Document
containing the parsed web page.
Parses an input stream with its content in a specified charset.
Parses an input stream with its content in a specified charset. The provided input stream is always closed before this method returns or throws an exception.
the input stream to parse
the charset of the input stream content
a Document
containing the parsed web page.
Parses a resource with a specified charset.
Parses a resource with a specified charset.
the name of the resource to parse
the charset of the resource
a Document
containing the parsed web page.
Parses an HTML string.
Parses an HTML string.
the HTML string to parse
a Document
containing the parsed web page.
Submits a form via a POST request and parses the resulting page.
Submits a form via a POST request and parses the resulting page.
the URL of the page to retrieve
a map containing the form fields to submit with their respective values
a Document
containing the resulting web page.
the user agent with which requests should be made
the user agent with which requests should be made
A Browser implementation based on jsoup, a Java HTML parser library.
JsoupBrowser
provides powerful and efficient document querying, but it doesn't run JavaScript in the pages. As such, it is limited to working strictly with the HTML send in the page source.Currently,
JsoupBrowser
does not keep separate cookie stores for different domains and paths. In each request all cookies set previously will be sent, regardless of the domain they were set on. If you do requests to different domains and do not want this behavior, use differentJsoupBrowser
instances.As the documents parsed by
JsoupBrowser
instances are not changed after loading,Document
andElement
instances obtained from them are guaranteed to be immutable.