Global

Type Definitions

CorefMentionJSON

Source:
See:
Properties:
Name Type Description
id number

Mention ID

text string

The text (literal word) of the mention

sentNum number

1-based index of the sentence containinng this mention

headIndex number

1-based index

startIndex number

1-based index

endIndex number

1-based index

isRepresentativeMention boolean

Wehther the mention word is representative or not

animacy "ANIMATE" | "INANIMATE" | "UNKNOWN"

Mention's animacy

gender "FEMALE" | "MALE" | "NEUTRAL" | "UNKNOWN"

Gender of the mention

number "SINGULAR" | "PLURAL" | "UNKNOWN"

Cardinality of the mention

type "PRONOMINAL" | "NOMINAL" | "PROPER" | "LIST"

Mention type

position Array

Position is a binary tuple of (sentence number, mention number in that sentence). This is used for indexing by mention.

A CorefMention.

DocumentJSON

Source:
Properties:
Name Type Description
index number
sentences Array.<Sentence>

The CoreNLP API JSON structure representing a document

ExpressionJSON

Source:
Properties:
Name Type Description
index number
sentences Array.<Array.<ExpressionSentenceMatch>>

The CoreNLP API JSON structure representing an expression This expression structure can be found as the output of TokensRegex, Semgrex and Tregex.

ExpressionSentenceMatchGroup

Source:
Properties:
Name Type Attributes Description
label string

group label

begin number

0-based index of the matched group, relative to the given text

end number

0-based index of the matched group, relative to the given text

token Token <optional>

onluy given if aggregated with an annotated Sentence or Document

$label ExpressionSentenceMatchGroup <optional>

other groups inside

ExpressionSentenceMatchJSON

Source:
Properties:
Name Type Attributes Description
begin number

word begin position, starting from zero

end number

word end position, starting from zero (no match ends at 0)

text string

matched text

$label string <optional>

any label, as defined in the expression pattern

A ExpressionSentenceMatch of either TokensRegex, Semgrex or Tregex.

GovernorJSON

Source:
Properties:
Name Type Description
dep string
governor number
governorGloss string
dependent number
dependentGloss string

The CoreNLP API JSON structure representing a governor

PosInfo

Source:
Properties:
Name Type Description
group string
tag string
examples Array.<string>

PosInfo does not come as part of the CoreNLP. It is an indexed reference of POS tags by language provided by this library. It's only helpful for analysis and study. The data was collected from different documentation resources on the Web. The PosInfo may vary depending on the POS annotation types used, for example, CoreNLP for Spanish uses custom POS tags developed by Stanford, but this can also be changed to Universal Dependencies, which uses different tags.

SentenceJSON

Source:
Properties:
Name Type Description
index number

1-based index, as they come indexed by StanfordCoreNLP

tokens Array.<Token>

The CoreNLP API JSON structure representing a sentence

TokenJSON

Source:
Properties:
Name Type Description
index number
word string
originalText string
characterOffsetBegin number
characterOffsetEnd number
before string
after string

The CoreNLP API JSON structure representing a token