The legacy parse format used by the Banker.
The legacy parse format used by the Banker.
It has four major fields:
- tokens
is a vector of Token objects (in the order that they appear in the associated
sentence). The zeroth element is assumed to be the nexus.
- breadcrumb
tells you the unique neighbor that is closer to the nexus in the
undirected tree (this can be the nexus itself); for instance, if breadcrumb(5) = 3,
then token 3 is one step closer to the nexus from token 5. The breadcrumb of the nexus
should be -1.
- children
tells you the set of children of a node in the polytree; for instance, if
children(5) = Set(3,6,7), then token 5 has three children: tokens 3, 6, and 7
- arclabels
tells you the labeled neighbors of a node in the undirected tree; for instance,
if arclabels(5) = Set((4, 'det), (7, 'amod)), then token 5 has two neighbors, reached with
arcs labeled 'det and 'amod (the labels are scala Symbol objects)
the parsed sentence (the zeroth token of which should be the nexus)
the breadcrumb of each token (see above definition)
the set of children of each token in the polytree
the set of labeled neighbors of each token in the undirected tree
Wrapper for the polyparser using the DependencyParser interface.
Wrapper for the polyparser using the DependencyParser interface.
This API allows you to optionally specify a set of files (in Conll-X format) containing "gold" parses. If such parses are specified, then the parser will populate a cache with these parses. Then whenever it is asked to parse a sentence, it will check this cache first, and only parse a sentence from scratch if there is a cache miss.
Interface between the Banker tool and the Polyparser, including a spray-json protocol for objects used by the banker UI.
Interface between the Banker tool and the Polyparser, including a spray-json protocol for objects used by the banker UI. This will help ease the migration to reming (or allow the banker UI to stay on spray-json). The banker UI uses TransitionConstraint and PolytreeParse in its models.