The DSI utilities are a mish mash of classes accumulated during the last ten years in projects developed at the former DSI (Dipartimento di Scienze dell'Informazione, i.e., Information Sciences Department), now DI (Dipartimento di Informatica, i.e., Informatics Department) of the Università degli Studi di Milano. They were originally distributed in several projects (mainly in MG4J) but we finally decided to gather all the material in a single place.
The DSI utilities are distributed under the GNU Lesser General Public License.
Highlights
The implementations available are a bit eclectic due to the particular kind of applications we developed. Very broadly, we have:
- Implementations of pseudorandom number generators. See the package documentation for details.
BitVector
and its implementations—a high-performance but flexible set of bit vector classes.- A
it.unimi.dsi.compression
package containing codecs for several types of encodings. ProgressLogger
, a flexible logger with statistics marking the progress of the (many) classes we use that require hours of computation.ObjectParser
, a class making it easy to specify complex objects on the command line.MutableString
, our answer to the JavaString
class.- The
I/O package
, containing fast version of several classes existing injava.io
, many useful classes to read easily text data (e.g.,FileLinesCollection
), bit streams, classes providing large-size memory mapping such asByteBufferInputStream
, andOfflineIterable
—the easy & fast way to store large sequences of objects on disk and iterate on them. - The
it.unimi.dsi.util
package, containing pseudorandom number generators, tries, immutable prefix maps, Bloom filters, a very comfortableProperties
class and more. - The
it.unimi.dsi.stat
package, containing a lightweight class for computing basic statistics and an arbitrary-precision implementation of the Jackknife method. - Lots of utility methods in
Util
(have a look!) - Big versions of I/O and utility classes
in
it.unimi.dsi.big.io
andit.unimi.dsi.big.util
. - The
BulletParser
, that we use to parse HTML and XML.
Package | Description |
---|---|
it.unimi.dsi |
General utilities.
|
it.unimi.dsi.big.io |
I/O big classes.
|
it.unimi.dsi.big.util |
Collections and similar big classes.
|
it.unimi.dsi.bits |
Main classes manipulating bits.
|
it.unimi.dsi.compression |
Word-based compression/decompression classes.
|
it.unimi.dsi.io |
I/O classes.
|
it.unimi.dsi.lang |
Basic classes.
|
it.unimi.dsi.logging |
Logging classes.
|
it.unimi.dsi.parser |
A fast, lightweight, on-demand (X)HTML parser.
|
it.unimi.dsi.parser.callback |
Callbacks for the
BulletParser . |
it.unimi.dsi.stat |
Statistics classes.
|
it.unimi.dsi.util |
Miscellaneaous utility classes.
|
it.unimi.dsi.util.concurrent |