A C E F G I L M N O P R S T U W X
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- addToTokenList(Token) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Adds token to wordTokenList.
- AnalysePatents - Class in uk.ac.cam.ch.wwmm.chemicaltagger.modelParser
-
First steps at creating a corpus for the ChemicalTagger model parser.
- AnalysePatents() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.AnalysePatents
- ASTtoXML - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Converts ANTLR ASTTrees into XML Documents.
- ASTtoXML() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.ASTtoXML
-
Default constructor method.
- ASTtoXML(String[], String[]) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.ASTtoXML
C
- ChemicalTaggerTokeniser - Interface in uk.ac.cam.ch.wwmm.chemicaltagger
-
An interface for tokenisers.
- ChemistryModelParser - Class in uk.ac.cam.ch.wwmm.chemicaltagger.modelParser
-
Runs the ChemicalParser model against text.
- ChemistryModelParser() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.ChemistryModelParser
- ChemistryPOSTagger - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Converts string input into tokenised and tagged text.
- ChemistryPOSTagger(ChemicalTaggerTokeniser, List<Tagger>) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistryPOSTagger
-
Custom constructor for setting up non-standard ChemicalTagger operations.
- ChemistryPOSTagger(ChemicalTaggerTokeniser, OscarTagger, RegexTagger, OpenNLPTagger) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistryPOSTagger
-
Deprecated.
- ChemistrySentenceParser - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Passes tagged sentences to the ANTLR grammar.
- ChemistrySentenceParser(InputStream) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistrySentenceParser
-
Constructor method for inputStream objects.
- ChemistrySentenceParser(String) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistrySentenceParser
-
Constructor method for String objects.
- ChemistrySentenceParser(POSContainer) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistrySentenceParser
-
Constructor method for POSContainer objects.
- cleanHTMLText(String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Cleans up text from html characters.
- combineTaggers() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Combines the output of all the taggers.
- containsNumber(String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Checks if a string contains a number.
- convert(Document) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.XMLtoAST
-
Converts AST Trees to XML Document.
- convert(Parse) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.ParseTreetoXML
-
Converts Parse Trees to XML Document.
- convert(Tree) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ASTtoXML
-
Overloading Method that converts astTree to XML Document with ActionPhrases Included.
- convert(Tree, boolean) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ASTtoXML
-
Converts AST Trees to XML Document.
- convert(Tree, boolean, HashMap<String, String>) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ASTtoXML
-
Converts astTree to XML Document.
- correctCombinedTagsList() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.PostProcessTags
-
Corrects known tagging problems in the tags in combinedtagsList.
- correctCombinedTagsList(List<String>) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.PostProcessTags
-
Corrects known tagging problems in the tags in combinedtagsList.
- createPosTagListFromStringArray(String[]) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.OpenNLPTagger
-
Creates the posTagList from the openNLP string Array format.
- CreateTreeBank - Class in uk.ac.cam.ch.wwmm.chemicaltagger.modelParser
-
Creates a Treebank from the rule-based ChemicalTagger output
- CreateTreeBank() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.CreateTreeBank
E
- ExtractFromXML - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Extracts text from XML Patents.
F
- Formatter - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Preprocesses text before it gets passed to tokenisation and tagging classes.
G
- generateOscarTokenSequence(List<Token>, String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.OscarTagger
-
Generates an OSCAR TokenSequence from a list of tokens
- getActionMap() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.PostProcessTrees
-
Getter method for actionMap.
- getCombinedTagsList() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Getter method for CombinedTagsList.
- getContent(InputStream, String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.ExtractFromXML
-
Returns the content of a searchTag from an XML File.
- getContent(String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.CreateTreeBank
-
Extracts the content from the patent files.
- getCTTokeniser() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistryPOSTagger
-
Getter method for ChemicalTaggerTokeniser.
- getDefaultInstance() - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistryPOSTagger
-
Gets the default ChemistryPOSTagger instance - recommended for standard ChemicalTagger processing.
- getExperimentalParagraphs(String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.AnalysePatents
- getIgnoredTags() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.OpenNLPTagger
- getIgnoredTags() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.OscarTagger
- getIgnoredTags() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.RegexTagger
- getIgnoredTags() - Method in interface uk.ac.cam.ch.wwmm.chemicaltagger.Tagger
- getInputStream(Class<?>, String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Returns the content of the resource as an inputstream.
- getInputText() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Getter method for InputText.
- getInstance() - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.OpenNLPTagger
-
Returns an OpenNLPTagger singleton.
- getName() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.Rule
-
Getter method for name.
- getNextTerminalElement(Element) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Gets the next terminal element.
- getNodes(Element, ParserRuleContext) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.XMLtoAST
-
A recursive function that goes through the leaves of the tree to create XML nodes.
- getNodes(Parse, Element) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.ParseTreetoXML
-
A recursive function that goes through the leaves of the tree to create XML nodes.
- getNodes(Tree, Element) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ASTtoXML
-
A recursive function that goes through the leaves of the tree to create XML nodes.
- getOpenNLPTagger() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistryPOSTagger
-
Getter method for OpenNLPTagger.
- getOscarTagger() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistryPOSTagger
-
Getter method for OscarTagger.
- getParseTree() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.SentenceParser
-
Getter method for parseTree.
- getPathAsInputStream(String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Returns the content of the resource as an inputstream.
- getPattern() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.Rule
-
Getter method for pattern.
- getPreviousSiblingOrParentsSibling(Element) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Gets the previous sibling if one exists or the previous sibling of the first parent element with previous siblings
- getPreviousTerminalElement(Element) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Gets the previous terminal element.
- getPrioritiseOscar() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Getter method for prioritiseOscar.
- getRegexTagger() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistryPOSTagger
-
Getter method for RegexTagger.
- getRules() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.RegexTagger
-
Getter method for rules.
- getSentenceList() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.XMLtoAST
- getSpectrumElementList() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Getter method for SpectrumList.
- getStringTree() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.XMLtoAST
- getStringValue(Element, String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.ExtractFromXML
-
Parses an XML files and saves the output into DocumentContainer.
- getTaggedTokenInStream() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.SentenceParser
-
Getter method for taggedTokenInputStream.
- getTagger() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.OpenNLPTagger
-
Getter method for posTagger.
- getTaggersOrderedInDescendingPriority() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistryPOSTagger
-
Getter method for taggersOrderedInDescendingPriority.
- getTagListContainer() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
- getTokenTagTupleAsString() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Gets the output of the string and tag as: "TAG TOKEN".
- getWordTokenList() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Getter method for WordTokenList.
I
- initializeRules() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.RegexTagger
-
Initialises the rules for the regular expression tagger.
L
- loadDefaultActionMap() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.PostProcessTrees
-
Loads default ActionMap for ChemicalTagger.
- loadsTagsFromFile(Class<?>) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Loads the tags from the .tokens file generated by Antlr.
M
- main(String[]) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.AnalysePatents
-
Main class.
- main(String[]) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.CreateTreeBank
-
Main class.
- makeNCName(String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Replaces all non-XML characters with _ .
- makeXMLDocument() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistrySentenceParser
- makeXMLDocument() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.SentenceParser
-
Creates an XML document from the parseTree.
- makeXMLDocument(boolean) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistrySentenceParser
N
- normaliseText(String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Formatter
-
Normalises sentences e.g.
O
- OpenNLPTagger - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Runs the OpenNLP tagger .
- oscar - Variable in class uk.ac.cam.ch.wwmm.chemicaltagger.OscarTagger
- OscarTagger - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Runs the OSCAR tagger .
- OscarTagger(Oscar) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.OscarTagger
-
Default constructor method.
- OscarTokeniser - Class in uk.ac.cam.ch.wwmm.chemicaltagger
- OscarTokeniser() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.OscarTokeniser
-
Public Constructor method.
P
- parseSentence(String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.ChemistryModelParser
- parseTags() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistrySentenceParser
- parseTags() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.SentenceParser
-
Passes an inputstream to ANTLR and produces a parse tree.
- ParseTreetoXML - Class in uk.ac.cam.ch.wwmm.chemicaltagger.modelParser
-
Converts OpenNLP Parse Trees into XML Documents.
- ParseTreetoXML() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.modelParser.ParseTreetoXML
-
Default constructor method.
- POSContainer - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
A container class that stores the grammatical structure of the text .
- POSContainer() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Default constructor method.
- PostProcessTags - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Corrects Mistagged tokens.
- PostProcessTags(POSContainer) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.PostProcessTags
-
Corrects Mistagged tokens within the given POSContainer
- PostProcessTrees - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Passes tagged sentences to the ANTLR grammar.
- PostProcessTrees() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.PostProcessTrees
-
Default Constructor method.
- printPrettyXML(Document) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistrySentenceParser
- process(Document) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.PostProcessTrees
-
Adds action phrases and roles to documents.
R
- readSentence(String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Loads a "sentence" file consisting of a single line of text.
- recombineTokens(POSContainer) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.RecombineTokens
-
Indexes the tokens that need combining and then calls combineTokens.
- RecombineTokens - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Combines tokens that have been erroneously split
- RegexTagger - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Runs the regular expression tagger .
- RegexTagger() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.RegexTagger
-
Public Constructor.
- RegexTagger(String) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.RegexTagger
-
Public Constructor.
- RegexTagger(List<String>) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.RegexTagger
-
Public Constructor.
- registerTagList(List<String>) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Adds tagLists to the tagListContainer
- Rule - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
The Rule class .
- Rule(String, String) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.Rule
-
Public Constructor method.
- Rule(String, String, boolean) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.Rule
-
Public Constructor method.
- run() - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.SentenceParser
-
Runs the thread.
- runChemicalTagger(String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Overloading method for running chemicalTagger Sets runSpectraTagger to false by default.
- runChemicalTagger(String, boolean) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Convenience method for running chemicalTagger Includes a flag for setting spectraTagger.
- runTagger(List<Token>, String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.OpenNLPTagger
-
Runs the OpenNLP POS tagger against a list of tokens and returns a list of tags
- runTagger(List<Token>, String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.OscarTagger
-
Runs OSCAR over a list of tokens and returns a list of tags
- runTagger(List<Token>, String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.RegexTagger
-
Runs the regular expression tagger against a list of tokens and returns a list of tags
- runTagger(List<Token>, String) - Method in interface uk.ac.cam.ch.wwmm.chemicaltagger.Tagger
-
Runs the tagger against the tokens and returns the tags as a list
- runTagger(POSContainer) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.SpectraTagger
-
Tags the NMR Spectra in a chemistry text.
- runTaggers(String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistryPOSTagger
-
Overloading method for runTaggers passing the default flag for useSpectraTagger to
ChemistryPOSTagger.runTaggers(String, boolean)
. - runTaggers(String, boolean) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.ChemistryPOSTagger
-
Normalises the inputSentence Optionally removes spectra Then tokenises the input and runs the taggers over the tokens
S
- SentenceParser - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Passes tagged sentences to the ANTLR grammar.
- SentenceParser(InputStream) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.SentenceParser
-
Constructor method for inputStream objects.
- SentenceParser(String) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.SentenceParser
-
Constructor method for String objects.
- SentenceParser(POSContainer) - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.SentenceParser
-
Constructor method for POSContainer objects.
- setActionMap(HashMap<String, String>) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.PostProcessTrees
-
Getter method for actionMap.
- setCombinedTagsList(List<String>) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Setter method for CombinedTagsList.
- setInputText(String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Setter method for InputText.
- setParseTree(ParseTree) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.SentenceParser
-
Setter method for parseTree.
- setPrioritiseOscar(boolean) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Setter method for prioritiseOscar.
- setRules(List<Rule>) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.RegexTagger
-
Setter method for rules.
- setSpectrumList(List<Element>) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Setter method for SpectrumList.
- setTaggedTokenInStream(InputStream) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.SentenceParser
-
Setter method for taggedTokenInputStream.
- setWordTokenList(List<Token>) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.POSContainer
-
Setter method for WordTokenList.
- SpectraTagger - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Runs the oscar SpectraTagger to pull out the NMR Spectra.
- subTokeniseTokens(List<Token>) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Formatter
-
Returns the list of tokens with tokens divided into further tokens in cases where this improves tagging e.g.
T
- tag(String[]) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.OpenNLPTagger
- Tagger - Interface in uk.ac.cam.ch.wwmm.chemicaltagger
-
An interface for a tagger.
- tokenise(String) - Method in interface uk.ac.cam.ch.wwmm.chemicaltagger.ChemicalTaggerTokeniser
-
Tokeniser method.
- tokenise(String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.OscarTokeniser
-
Tokenises input text using the OSCAR4 tokeniser.
- tokenise(String) - Method in class uk.ac.cam.ch.wwmm.chemicaltagger.WhiteSpaceTokeniser
-
Tokenises a String on white space.
- tokensToSpaceDelimitedStr(List<Token>) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Concatenates token surfaces adding a space between them
U
- uk.ac.cam.ch.wwmm.chemicaltagger - package uk.ac.cam.ch.wwmm.chemicaltagger
- uk.ac.cam.ch.wwmm.chemicaltagger.modelParser - package uk.ac.cam.ch.wwmm.chemicaltagger.modelParser
- Utils - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
A Utils class.
W
- WhiteSpaceTokeniser - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
A whitespace tokeniser to be used as a substitute to the OSCAR tokeniser.
- WhiteSpaceTokeniser() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.WhiteSpaceTokeniser
-
Default constructor method.
- writeListToFile(List<String>, String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Writes items of a list to a file.
- writeXMLToFile(Document, String) - Static method in class uk.ac.cam.ch.wwmm.chemicaltagger.Utils
-
Writes out an XML document to a file.
X
- XMLtoAST - Class in uk.ac.cam.ch.wwmm.chemicaltagger
-
Converts ANTLR ASTTrees into XML Documents.
- XMLtoAST() - Constructor for class uk.ac.cam.ch.wwmm.chemicaltagger.XMLtoAST
All Classes All Packages