Type Definitions
CorefMentionJSON
- Source:
- See:
Properties:
Name | Type | Description |
---|---|---|
id |
number | Mention ID |
text |
string | The text (literal word) of the mention |
sentNum |
number | 1-based index of the sentence containinng this mention |
headIndex |
number | 1-based index |
startIndex |
number | 1-based index |
endIndex |
number | 1-based index |
isRepresentativeMention |
boolean | Wehther the mention word is representative or not |
animacy |
"ANIMATE" | "INANIMATE" | "UNKNOWN" | Mention's animacy |
gender |
"FEMALE" | "MALE" | "NEUTRAL" | "UNKNOWN" | Gender of the mention |
number |
"SINGULAR" | "PLURAL" | "UNKNOWN" | Cardinality of the mention |
type |
"PRONOMINAL" | "NOMINAL" | "PROPER" | "LIST" | Mention type |
position |
Array | Position is a binary tuple of (sentence number, mention number in that sentence). This is used for indexing by mention. |
A CorefMention.
DocumentJSON
- Source:
Properties:
Name | Type | Description |
---|---|---|
index |
number | |
sentences |
Array.<Sentence> |
The CoreNLP API JSON structure representing a document
ExpressionJSON
- Source:
Properties:
Name | Type | Description |
---|---|---|
index |
number | |
sentences |
Array.<Array.<ExpressionSentenceMatch>> |
The CoreNLP API JSON structure representing an expression
This expression structure can be found as the output of TokensRegex
,
Semgrex
and Tregex
.
ExpressionSentenceMatchGroup
Properties:
Name | Type | Attributes | Description |
---|---|---|---|
label |
string | group label |
|
begin |
number | 0-based index of the matched group, relative to the given text |
|
end |
number | 0-based index of the matched group, relative to the given text |
|
token |
Token |
<optional> |
onluy given if aggregated with an annotated Sentence or Document |
$label |
ExpressionSentenceMatchGroup |
<optional> |
other groups inside |
ExpressionSentenceMatchJSON
Properties:
Name | Type | Attributes | Description |
---|---|---|---|
begin |
number | word begin position, starting from zero |
|
end |
number | word end position, starting from zero (no match ends at 0) |
|
text |
string | matched text |
|
$label |
string |
<optional> |
any label, as defined in the expression pattern |
A ExpressionSentenceMatch of either TokensRegex
, Semgrex
or Tregex
.
GovernorJSON
- Source:
Properties:
Name | Type | Description |
---|---|---|
dep |
string | |
governor |
number | |
governorGloss |
string | |
dependent |
number | |
dependentGloss |
string |
The CoreNLP API JSON structure representing a governor
PosInfo
- Source:
Properties:
Name | Type | Description |
---|---|---|
group |
string | |
tag |
string | |
examples |
Array.<string> |
PosInfo does not come as part of the CoreNLP. It is an indexed reference of POS tags by language provided by this library. It's only helpful for analysis and study. The data was collected from different documentation resources on the Web. The PosInfo may vary depending on the POS annotation types used, for example, CoreNLP for Spanish uses custom POS tags developed by Stanford, but this can also be changed to Universal Dependencies, which uses different tags.
SentenceJSON
- Source:
Properties:
Name | Type | Description |
---|---|---|
index |
number | 1-based index, as they come indexed by StanfordCoreNLP |
tokens |
Array.<Token> |
The CoreNLP API JSON structure representing a sentence
TokenJSON
- Source:
Properties:
Name | Type | Description |
---|---|---|
index |
number | |
word |
string | |
originalText |
string | |
characterOffsetBegin |
number | |
characterOffsetEnd |
number | |
before |
string | |
after |
string |
The CoreNLP API JSON structure representing a token