Class GIS

  • All Implemented Interfaces:
    EventTrainer

    public class GIS
    extends AbstractEventTrainer
    A Factory class which uses instances of GISTrainer to create and train GISModels.
    • Field Detail

      • PRINT_MESSAGES

        public static boolean PRINT_MESSAGES
        Set this to false if you don't want messages about the progress of model training displayed. Alternately, you can use the overloaded version of trainModel() to conditionally enable progress messages.
      • SMOOTHING_OBSERVATION

        public static double SMOOTHING_OBSERVATION
        If we are using smoothing, this is used as the "number" of times we want the trainer to imagine that it saw a feature that it actually didn't see. Defaulted to 0.1.
    • Constructor Detail

      • GIS

        public GIS()
    • Method Detail

      • trainModel

        public static GISModel trainModel​(ObjectStream<Event> eventStream)
                                   throws java.io.IOException
        Train a model using the GIS algorithm, assuming 100 iterations and no cutoff.
        Parameters:
        eventStream - The EventStream holding the data on which this model will be trained.
        Returns:
        The newly trained model, which can be used immediately or saved to disk using an opennlp.tools.ml.maxent.io.GISModelWriter object.
        Throws:
        java.io.IOException
      • trainModel

        public static GISModel trainModel​(ObjectStream<Event> eventStream,
                                          boolean smoothing)
                                   throws java.io.IOException
        Train a model using the GIS algorithm, assuming 100 iterations and no cutoff.
        Parameters:
        eventStream - The EventStream holding the data on which this model will be trained.
        smoothing - Defines whether the created trainer will use smoothing while training the model.
        Returns:
        The newly trained model, which can be used immediately or saved to disk using an opennlp.tools.ml.maxent.io.GISModelWriter object.
        Throws:
        java.io.IOException
      • trainModel

        public static GISModel trainModel​(ObjectStream<Event> eventStream,
                                          int iterations,
                                          int cutoff)
                                   throws java.io.IOException
        Train a model using the GIS algorithm.
        Parameters:
        eventStream - The EventStream holding the data on which this model will be trained.
        iterations - The number of GIS iterations to perform.
        cutoff - The number of times a feature must be seen in order to be relevant for training.
        Returns:
        The newly trained model, which can be used immediately or saved to disk using an opennlp.tools.ml.maxent.io.GISModelWriter object.
        Throws:
        java.io.IOException
      • trainModel

        public static GISModel trainModel​(ObjectStream<Event> eventStream,
                                          int iterations,
                                          int cutoff,
                                          boolean smoothing,
                                          boolean printMessagesWhileTraining)
                                   throws java.io.IOException
        Train a model using the GIS algorithm.
        Parameters:
        eventStream - The EventStream holding the data on which this model will be trained.
        iterations - The number of GIS iterations to perform.
        cutoff - The number of times a feature must be seen in order to be relevant for training.
        smoothing - Defines whether the created trainer will use smoothing while training the model.
        printMessagesWhileTraining - Determines whether training status messages are written to STDOUT.
        Returns:
        The newly trained model, which can be used immediately or saved to disk using an opennlp.tools.ml.maxent.io.GISModelWriter object.
        Throws:
        java.io.IOException
      • trainModel

        public static GISModel trainModel​(ObjectStream<Event> eventStream,
                                          int iterations,
                                          int cutoff,
                                          double sigma)
                                   throws java.io.IOException
        Train a model using the GIS algorithm.
        Parameters:
        eventStream - The EventStream holding the data on which this model will be trained.
        iterations - The number of GIS iterations to perform.
        cutoff - The number of times a feature must be seen in order to be relevant for training.
        sigma - The standard deviation for the gaussian smoother.
        Returns:
        The newly trained model, which can be used immediately or saved to disk using an opennlp.tools.ml.maxent.io.GISModelWriter object.
        Throws:
        java.io.IOException
      • trainModel

        public static GISModel trainModel​(int iterations,
                                          DataIndexer indexer,
                                          boolean smoothing)
        Train a model using the GIS algorithm.
        Parameters:
        iterations - The number of GIS iterations to perform.
        indexer - The object which will be used for event compilation.
        smoothing - Defines whether the created trainer will use smoothing while training the model.
        Returns:
        The newly trained model, which can be used immediately or saved to disk using an opennlp.tools.ml.maxent.io.GISModelWriter object.
      • trainModel

        public static GISModel trainModel​(int iterations,
                                          DataIndexer indexer)
        Train a model using the GIS algorithm.
        Parameters:
        iterations - The number of GIS iterations to perform.
        indexer - The object which will be used for event compilation.
        Returns:
        The newly trained model, which can be used immediately or saved to disk using an opennlp.tools.ml.maxent.io.GISModelWriter object.
      • trainModel

        public static GISModel trainModel​(int iterations,
                                          DataIndexer indexer,
                                          Prior modelPrior,
                                          int cutoff)
        Train a model using the GIS algorithm with the specified number of iterations, data indexer, and prior.
        Parameters:
        iterations - The number of GIS iterations to perform.
        indexer - The object which will be used for event compilation.
        modelPrior - The prior distribution for the model.
        Returns:
        The newly trained model, which can be used immediately or saved to disk using an opennlp.tools.ml.maxent.io.GISModelWriter object.
      • trainModel

        public static GISModel trainModel​(int iterations,
                                          DataIndexer indexer,
                                          boolean printMessagesWhileTraining,
                                          boolean smoothing,
                                          Prior modelPrior,
                                          int cutoff)
        Train a model using the GIS algorithm.
        Parameters:
        iterations - The number of GIS iterations to perform.
        indexer - The object which will be used for event compilation.
        printMessagesWhileTraining - Determines whether training status messages are written to STDOUT.
        smoothing - Defines whether the created trainer will use smoothing while training the model.
        modelPrior - The prior distribution for the model.
        cutoff - The number of times a predicate must occur to be used in a model.
        Returns:
        The newly trained model, which can be used immediately or saved to disk using an opennlp.tools.ml.maxent.io.GISModelWriter object.
      • trainModel

        public static GISModel trainModel​(int iterations,
                                          DataIndexer indexer,
                                          boolean printMessagesWhileTraining,
                                          boolean smoothing,
                                          Prior modelPrior,
                                          int cutoff,
                                          int threads)
        Train a model using the GIS algorithm.
        Parameters:
        iterations - The number of GIS iterations to perform.
        indexer - The object which will be used for event compilation.
        printMessagesWhileTraining - Determines whether training status messages are written to STDOUT.
        smoothing - Defines whether the created trainer will use smoothing while training the model.
        modelPrior - The prior distribution for the model.
        cutoff - The number of times a predicate must occur to be used in a model.
        Returns:
        The newly trained model, which can be used immediately or saved to disk using an opennlp.tools.ml.maxent.io.GISModelWriter object.