Package ai.djl.training.dataset
Contains classes to download and prepare training and testing data.
The central class to work with in this package is the Dataset
.
In practice, most of the implementations of Dataset
will actually
extend RandomAccessDataset
instead.
-
Interface Summary Interface Description Dataset An interface to represent a set of sample data/label pairs to train a model.Sampler An interface for sampling data items from aRandomAccessDataset
.Sampler.SubSampler An interface that samples a single data item at a time. -
Class Summary Class Description ArrayDataset ArrayDataset.Builder The Builder to construct anArrayDataset
.Batch ABatch
is used to hold multiple items (data and label pairs) from aDataset
.BatchSampler BatchSampler
is aSampler
that returns a single epoch over the data.DataIterable DataIterable is a data loader that combinesDataset
,Batchifier
,Pipeline
, andSampler
to provide an iterable over the givenRandomAccessDataset
.RandomAccessDataset RandomAccessDataset represent the dataset that support random access reads.RandomAccessDataset.BaseBuilder<T extends RandomAccessDataset.BaseBuilder<T>> The Builder to construct aRandomAccessDataset
.RandomSampler RandomSampler
is an implementation of theSampler.SubSampler
interface.Record Record
represents a single element of data and labels fromDataset
.SequenceSampler SequenceSampler
is an implementation of theSampler.SubSampler
interface. -
Enum Summary Enum Description Dataset.Usage An enum that indicates the mode - training, test or validation.