public class CSVLoader extends AbstractFileLoader implements BatchConverter, OptionHandler
-N <range> The range of attributes to force type to be NOMINAL. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-S <range> The range of attribute to force type to be STRING. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-D <range> The range of attribute to force type to be DATE. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-format <date format> The date formatting string to use to parse date values. (default: "yyyy-MM-dd'T'HH:mm:ss")
-M <str> The string representing a missing value. (default: ?)
-E <enclosures> The enclosure character(s) to use for strings. Specify as a comma separated list (e.g. ",' (default: '"')
Loader
,
Serialized FormModifier and Type | Field and Description |
---|---|
static String |
FILE_EXTENSION
the file extension.
|
FILE_EXTENSION_COMPRESSED
BATCH, INCREMENTAL, NONE
Constructor and Description |
---|
CSVLoader()
default constructor.
|
Modifier and Type | Method and Description |
---|---|
String |
dateAttributesTipText()
Returns the tip text for this property.
|
String |
dateFormatTipText()
Returns the tip text for this property.
|
String |
enclosureCharactersTipText()
Returns the tip text for this property.
|
Instances |
getDataSet()
Return the full data set.
|
String |
getDateAttributes()
Returns the current attribute range to be forced to type date.
|
String |
getDateFormat()
Get the format to use for parsing date values.
|
String |
getEnclosureCharacters()
Get the character(s) to use/recognize as string enclosures
|
String |
getFileDescription()
Returns a description of the file type.
|
String |
getFileExtension()
Get the file extension used for arff files.
|
String[] |
getFileExtensions()
Gets all the file extensions used for this type of file.
|
String |
getMissingValue()
Returns the current placeholder for missing values.
|
Instance |
getNextInstance(Instances structure)
CSVLoader is unable to process a data set incrementally.
|
String |
getNominalAttributes()
Returns the current attribute range to be forced to type nominal.
|
String[] |
getOptions()
Gets the current settings of the Classifier.
|
String |
getRevision()
Returns the revision string.
|
String |
getStringAttributes()
Returns the current attribute range to be forced to type string.
|
Instances |
getStructure()
Determines and returns (if possible) the structure (internally the header)
of the data set as an empty set of instances.
|
String |
globalInfo()
Returns a string describing this attribute evaluator.
|
Enumeration |
listOptions()
Returns an enumeration describing the available options.
|
static void |
main(String[] args)
Main method.
|
String |
missingValueTipText()
Returns the tip text for this property.
|
String |
nominalAttributesTipText()
Returns the tip text for this property.
|
void |
reset()
Resets the Loader ready to read a new data set or the same data set again.
|
void |
setDateAttributes(String value)
Set the attribute range to be forced to type date.
|
void |
setDateFormat(String value)
Set the format to use for parsing date values.
|
void |
setEnclosureCharacters(String enclosure)
Set the character(s) to use/recognize as string enclosures
|
void |
setMissingValue(String value)
Sets the placeholder for missing values.
|
void |
setNominalAttributes(String value)
Sets the attribute range to be forced to type nominal.
|
void |
setOptions(String[] options)
Parses a given list of options.
|
void |
setSource(File file)
Resets the Loader object and sets the source of the data set to be the
supplied File object.
|
void |
setSource(InputStream input)
Resets the Loader object and sets the source of the data set to be the
supplied Stream object.
|
void |
setStringAttributes(String value)
Sets the attribute range to be forced to type string.
|
String |
stringAttributesTipText()
Returns the tip text for this property.
|
getUseRelativePath, retrieveFile, runFileLoader, setEnvironment, setFile, setUseRelativePath, useRelativePathTipText
setRetrieval
public static String FILE_EXTENSION
public String getFileExtension()
getFileExtension
in interface FileSourcedConverter
public String getFileDescription()
getFileDescription
in interface FileSourcedConverter
public String[] getFileExtensions()
getFileExtensions
in interface FileSourcedConverter
public String globalInfo()
public Enumeration listOptions()
listOptions
in interface OptionHandler
public void setOptions(String[] options) throws Exception
-N <range> The range of attributes to force type to be NOMINAL. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-S <range> The range of attribute to force type to be STRING. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-D <range> The range of attribute to force type to be DATE. 'first' and 'last' are accepted as well. Examples: "first-last", "1,4,5-27,50-last" (default: -none-)
-format <date format> The date formatting string to use to parse date values. (default: "yyyy-MM-dd'T'HH:mm:ss")
-M <str> The string representing a missing value. (default: ?)
-E <enclosures> The enclosure character(s) to use for strings. Specify as a comma separated list (e.g. ",' (default: '"')
setOptions
in interface OptionHandler
options
- the list of options as an array of stringsException
- if an option is not supportedpublic String[] getOptions()
getOptions
in interface OptionHandler
public void setNominalAttributes(String value)
value
- the rangepublic String getNominalAttributes()
public String nominalAttributesTipText()
public void setStringAttributes(String value)
value
- the rangepublic String getStringAttributes()
public String stringAttributesTipText()
public void setDateAttributes(String value)
value
- the rangepublic String getDateAttributes()
public String dateAttributesTipText()
public void setDateFormat(String value)
value
- the format to use.public String getDateFormat()
public String dateFormatTipText()
public String enclosureCharactersTipText()
public void setEnclosureCharacters(String enclosure)
enclosure
- the characters to use as string enclosurespublic String getEnclosureCharacters()
public void setMissingValue(String value)
value
- the placeholderpublic String getMissingValue()
public String missingValueTipText()
public void setSource(InputStream input) throws IOException
setSource
in interface Loader
setSource
in class AbstractLoader
input
- the input streamIOException
- if an error occurspublic void setSource(File file) throws IOException
setSource
in interface Loader
setSource
in class AbstractFileLoader
file
- the source file.IOException
- if an error occurspublic Instances getStructure() throws IOException
getStructure
in interface Loader
getStructure
in class AbstractLoader
IOException
- if an error occurspublic Instances getDataSet() throws IOException
getDataSet
in interface Loader
getDataSet
in class AbstractLoader
IOException
- if there is no source or parsing failspublic Instance getNextInstance(Instances structure) throws IOException
getNextInstance
in interface Loader
getNextInstance
in class AbstractLoader
structure
- ignoredIOException
- always. CSVLoader is unable to process a data set
incrementally.public void reset() throws IOException
reset
in interface Loader
reset
in class AbstractFileLoader
IOException
- if something goes wrongpublic String getRevision()
getRevision
in interface RevisionHandler
public static void main(String[] args)
args
- should contain the name of an input file.Copyright © 2016 University of Waikato, Hamilton, NZ. All Rights Reserved.