Package org.jsoup.helper
Class DataUtil
java.lang.Object
org.jsoup.helper.DataUtil
Internal static utilities for handling data.
-
Field Summary
-
Method Summary
Modifier and TypeMethodDescriptionstatic Document
Loads and parses a file to a Document, with the HtmlParser.static Document
Loads and parses a file to a Document.static Document
load
(InputStream in, String charsetName, String baseUri) Parses a Document from an input steam.static Document
load
(InputStream in, String charsetName, String baseUri, Parser parser) Parses a Document from an input steam, using the provided Parser.static ByteBuffer
readToByteBuffer
(InputStream inStream, int maxSize) Read the input stream into a byte buffer.
-
Field Details
-
UTF_8
-
-
Method Details
-
load
public static Document load(File file, @Nullable String charsetName, String baseUri) throws IOException Loads and parses a file to a Document, with the HtmlParser. Files that are compressed with gzip (and end in.gz
or.z
) are supported in addition to uncompressed files.- Parameters:
file
- file to loadcharsetName
- (optional) character set of input; specifynull
to attempt to autodetect. A BOM in the file will always override this setting.baseUri
- base URI of document, to resolve relative links against- Returns:
- Document
- Throws:
IOException
- on IO error
-
load
public static Document load(File file, @Nullable String charsetName, String baseUri, Parser parser) throws IOException Loads and parses a file to a Document. Files that are compressed with gzip (and end in.gz
or.z
) are supported in addition to uncompressed files.- Parameters:
file
- file to loadcharsetName
- (optional) character set of input; specifynull
to attempt to autodetect. A BOM in the file will always override this setting.baseUri
- base URI of document, to resolve relative links againstparser
- alternateparser
to use.- Returns:
- Document
- Throws:
IOException
- on IO error- Since:
- 1.14.2
-
load
public static Document load(@WillClose InputStream in, @Nullable String charsetName, String baseUri) throws IOException Parses a Document from an input steam.- Parameters:
in
- input stream to parse. The stream will be closed after reading.charsetName
- character set of input (optional)baseUri
- base URI of document, to resolve relative links against- Returns:
- Document
- Throws:
IOException
- on IO error
-
load
public static Document load(@WillClose InputStream in, @Nullable String charsetName, String baseUri, Parser parser) throws IOException Parses a Document from an input steam, using the provided Parser.- Parameters:
in
- input stream to parse. The stream will be closed after reading.charsetName
- character set of input (optional)baseUri
- base URI of document, to resolve relative links againstparser
- alternateparser
to use.- Returns:
- Document
- Throws:
IOException
- on IO error
-
readToByteBuffer
Read the input stream into a byte buffer. To deal with slow input streams, you may interrupt the thread this method is executing on. The data read until being interrupted will be available.- Parameters:
inStream
- the input stream to read frommaxSize
- the maximum size in bytes to read from the stream. Set to 0 to be unlimited.- Returns:
- the filled byte buffer
- Throws:
IOException
- if an exception occurs whilst reading from the input stream.
-