BatchReadable (CDAP API 6.10.0 API)

Skip navigation links

Prev Class
Next Class

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

Type Parameters:

KEY - The key type.

VALUE - The value type.

All Known Subinterfaces:

ObjectMappedTable<T>, ObjectStore<T>, Table

All Known Implementing Classes:

IndexedTable, KeyValueTable, TimeseriesTable
```
@Beta
public interface BatchReadable<KEY,VALUE>
```
Interface for datasets that can be input to a batch job.
In order to feed a dataset into a batch job, the dataset must be splittable into chunks so that it's possible to process every part of the dataset in parallel. Every chunk must be readable as a collection of {key,value} records.

Method Summary

All Methods Instance Methods Abstract Methods
Modifier and Type	Method and Description
`SplitReader<KEY,VALUE>`	`createSplitReader(Split split)` Creates a reader for the split of a dataset.
`List<Split>`	`getSplits()` Returns all splits of the dataset.

- Method Detail
  - getSplits
```
List<Split> getSplits()
```
    Returns all splits of the dataset.
    For feeding the whole dataset into a batch job.
    
    Returns:
    
    A list of Splits.
  - createSplitReader
```
SplitReader<KEY,VALUE> createSplitReader(Split split)
```
    Creates a reader for the split of a dataset.
    
    Parameters:
    
    split - The split to create a reader for.
    
    Returns:
    
    The instance of a SplitReader.

Skip navigation links

Prev Class
Next Class

All Classes

Summary:
Nested |
Field |
Constr |
Method

Detail:
Field |
Constr |
Method

Copyright © 2024 Cask Data, Inc. Licensed under the Apache License, Version 2.0.