public interface FileSet extends Dataset, InputFormatProvider, OutputFormatProvider
FileSetProperties
for details. If it is enabled for explore, a Hive external table will be created when the dataset is
created. The Hive table is not partitioned, and therefore querying will only work if every path in the
fileset is a file (not a subdirectory).Modifier and Type | Method and Description |
---|---|
Location |
getBaseLocation()
Allows to interact directly with the location of this dataset in the underlying file system.
|
Map<String,String> |
getInputFormatConfiguration(Iterable<? extends Location> inputLocs)
A variant of
InputFormatProvider.getInputFormatConfiguration()
that allows passing in the input locations (rather than using the input locations that were
determined from runtime arguments). |
List<Location> |
getInputLocations()
Allows direct access to files of this dataset in the underlying file system.
|
Location |
getLocation(String relativePath)
Allows direct access to files in this dataset, in the underlying file system.
|
Location |
getOutputLocation()
Allows direct access to files in the output location, in the underlying file system.
|
Map<String,String> |
getRuntimeArguments()
Allow direct access to the runtime arguments of this file set.
|
getInputFormatClassName, getInputFormatConfiguration
getOutputFormatClassName, getOutputFormatConfiguration
static final String TYPE
Location getBaseLocation()
List<Location> getInputLocations()
Location getOutputLocation()
Location getLocation(String relativePath)
Map<String,String> getRuntimeArguments()
Map<String,String> getInputFormatConfiguration(Iterable<? extends Location> inputLocs)
InputFormatProvider.getInputFormatConfiguration()
that allows passing in the input locations (rather than using the input locations that were
determined from runtime arguments).inputLocs
- the input locations to be usedCopyright © 2020 Cask Data, Inc. Licensed under the Apache License, Version 2.0.