|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectweka.core.converters.AbstractLoader
weka.core.converters.TextDirectoryLoader
public class TextDirectoryLoader
Loads all text files in a directory and uses the subdirectory names as class labels. The content of the text files will be stored in a String attribute, the filename can be stored as well.
Valid options are:-D Enables debug output. (default: off)
-F Stores the filename in an additional attribute. (default: off)
-dir <directory> The directory to work on. (default: current directory)
-charset <charset name> The character set to use, e.g UTF-8. (default: use the default character set)
-R Retain all string attribute values when reading incrementally.Based on code from the TextDirectoryToArff tool:
Loader
,
Serialized FormField Summary |
---|
Fields inherited from interface weka.core.converters.Loader |
---|
BATCH, INCREMENTAL, NONE |
Constructor Summary | |
---|---|
TextDirectoryLoader()
default constructor |
Method Summary | |
---|---|
String |
charSetTipText()
the tip text for this property |
String |
debugTipText()
the tip text for this property |
String |
getCharSet()
Get the character set to use when reading text files. |
Instances |
getDataSet()
Return the full data set. |
boolean |
getDebug()
Gets whether additional debug information is printed. |
File |
getDirectory()
get the Dir specified as the source |
String |
getFileDescription()
Returns a description of the file type, actually it's directories. |
Instance |
getNextInstance(Instances structure)
TextDirectoryLoader is unable to process a data set incrementally. |
String[] |
getOptions()
Gets the setting |
boolean |
getOutputFilename()
Gets whether the filename will be stored as an extra attribute. |
String |
getRevision()
Returns the revision string. |
Instances |
getStructure()
Determines and returns (if possible) the structure (internally the header) of the data set as an empty set of instances. |
String |
globalInfo()
Returns a string describing this loader |
Enumeration |
listOptions()
Lists the available options |
static void |
main(String[] args)
Main method. |
String |
outputFilenameTipText()
the tip text for this property |
void |
reset()
Resets the loader ready to read a new data set |
void |
setCharSet(String charSet)
Set the character set to use when reading text files (an empty string indicates that the default character set will be used). |
void |
setDebug(boolean value)
Sets whether to print some debug information. |
void |
setDirectory(File dir)
sets the source directory |
void |
setOptions(String[] options)
Parses a given list of options. |
void |
setOutputFilename(boolean value)
Sets whether the filename will be stored as an extra attribute. |
void |
setSource(File dir)
Resets the Loader object and sets the source of the data set to be the supplied File object. |
Methods inherited from class weka.core.converters.AbstractLoader |
---|
setRetrieval, setSource |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public TextDirectoryLoader()
Method Detail |
---|
public String globalInfo()
public Enumeration listOptions()
listOptions
in interface OptionHandler
public void setOptions(String[] options) throws Exception
-D Enables debug output. (default: off)
-F Stores the filename in an additional attribute. (default: off)
-dir <directory> The directory to work on. (default: current directory)
-charset <charset name> The character set to use, e.g UTF-8. (default: use the default character set)
setOptions
in interface OptionHandler
options
- the options
Exception
- if options cannot be setpublic String[] getOptions()
getOptions
in interface OptionHandler
public String charSetTipText()
public void setCharSet(String charSet)
charSet
- the character set to use.public String getCharSet()
public void setDebug(boolean value)
value
- if true additional debug information will be printed.public boolean getDebug()
public String debugTipText()
public void setOutputFilename(boolean value)
value
- if true the filename will be stored in an extra
attributepublic boolean getOutputFilename()
public String outputFilenameTipText()
public String getFileDescription()
public File getDirectory()
public void setDirectory(File dir) throws IOException
dir
- the source directory
IOException
- if an error occurspublic void reset()
reset
in interface Loader
reset
in class AbstractLoader
public void setSource(File dir) throws IOException
setSource
in interface Loader
setSource
in class AbstractLoader
dir
- the source directory.
IOException
- if an error occurspublic Instances getStructure() throws IOException
getStructure
in interface Loader
getStructure
in class AbstractLoader
IOException
- if an error occurspublic Instances getDataSet() throws IOException
getDataSet
in interface Loader
getDataSet
in class AbstractLoader
IOException
- if there is no source or parsing failspublic Instance getNextInstance(Instances structure) throws IOException
getNextInstance
in interface Loader
getNextInstance
in class AbstractLoader
structure
- ignored
IOException
- always. TextDirectoryLoader is unable to process a data
set incrementally.public String getRevision()
getRevision
in interface RevisionHandler
public static void main(String[] args)
args
- should contain the name of an input file.
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |