compound
: compound
compound is used for:
- noun compounds. (These should show the correct modification structure of noun compounds, and do - or should - in the English UD treebank. Note, however, that the current automatic Stanford UD converter still makes all nouns modify the rightmost noun of the noun phrase when run on corpora like the 1999 Penn Treebank 3 which do not show noun compound structure - there is no intelligent noun compound analysis. The correct results are achieved when run on corpora like OntoNotes which do represent the branching structure of noun phrases.)
This includes proper names that use regular syntactic relations—contrast with name:
- numbers
- adjectival compounds
- imitative reduplication
- idiomatic phrasal verbs are analyzed as a language-specific subrelation of compound
Treebank Statistics (UD_English)
This relation is universal.
There are 1 language-specific subtypes of compound
: compound:prt.
10251 nodes (4%) are attached to their parents as compound
.
10104 instances of compound
(99%) are right-to-left (child precedes parent).
Average distance between parent and child is 1.31196956394498.
The following 47 pairs of parts of speech are connected with compound
: NOUN-NOUN (5334; 52% instances), PROPN-PROPN (3039; 30% instances), NOUN-PROPN (1168; 11% instances), PROPN-NOUN (128; 1% instances), NUM-NUM (96; 1% instances), NOUN-NUM (53; 1% instances), NOUN-SYM (50; 0% instances), NOUN-VERB (50; 0% instances), VERB-NOUN (47; 0% instances), SYM-NUM (46; 0% instances), X-X (43; 0% instances), VERB-ADV (24; 0% instances), NOUN-X (21; 0% instances), NOUN-ADJ (18; 0% instances), VERB-ADP (16; 0% instances), VERB-PROPN (16; 0% instances), NUM-NOUN (11; 0% instances), ADJ-NOUN (9; 0% instances), PROPN-X (9; 0% instances), ADJ-NUM (8; 0% instances), NOUN-ADP (8; 0% instances), VERB-ADJ (8; 0% instances), NOUN-DET (6; 0% instances), NUM-SYM (4; 0% instances), VERB-VERB (4; 0% instances), ADJ-PROPN (3; 0% instances), NOUN-ADV (3; 0% instances), NOUN-PRON (3; 0% instances), X-NOUN (3; 0% instances), ADJ-X (2; 0% instances), NOUN-PART (2; 0% instances), NUM-PROPN (2; 0% instances), PRON-PROPN (2; 0% instances), PROPN-NUM (2; 0% instances), ADJ-ADJ (1; 0% instances), ADV-VERB (1; 0% instances), DET-SYM (1; 0% instances), NOUN-CONJ (1; 0% instances), NUM-DET (1; 0% instances), NUM-PRON (1; 0% instances), PROPN-ADV (1; 0% instances), PROPN-SYM (1; 0% instances), PROPN-VERB (1; 0% instances), SYM-PROPN (1; 0% instances), VERB-AUX (1; 0% instances), VERB-PRON (1; 0% instances), X-PROPN (1; 0% instances).
Treebank Statistics (UD_English-ESL)
This relation is universal.
There are 1 language-specific subtypes of compound
: compound:prt.
1340 nodes (1%) are attached to their parents as compound
.
1337 instances of compound
(100%) are right-to-left (child precedes parent).
Average distance between parent and child is 1.22462686567164.
The following 18 pairs of parts of speech are connected with compound
: NOUN-NOUN (973; 73% instances), PROPN-PROPN (247; 18% instances), NOUN-PROPN (64; 5% instances), PROPN-NOUN (15; 1% instances), NUM-NUM (12; 1% instances), NOUN-SYM (6; 0% instances), NOUN-VERB (4; 0% instances), NUM-PROPN (4; 0% instances), VERB-NOUN (4; 0% instances), PROPN-DET (3; 0% instances), ADJ-ADV (1; 0% instances), ADJ-NOUN (1; 0% instances), ADP-NOUN (1; 0% instances), ADP-VERB (1; 0% instances), NOUN-INTJ (1; 0% instances), NOUN-PRON (1; 0% instances), NUM-NOUN (1; 0% instances), PROPN-NUM (1; 0% instances).
Treebank Statistics (UD_English-LinES)
This relation is universal.
There are 1 language-specific subtypes of compound
: compound:prt.
1788 nodes (2%) are attached to their parents as compound
.
1783 instances of compound
(100%) are right-to-left (child precedes parent).
Average distance between parent and child is 1.19407158836689.
The following 16 pairs of parts of speech are connected with compound
: NOUN-NOUN (1411; 79% instances), PROPN-NOUN (210; 12% instances), NOUN-PROPN (83; 5% instances), NUM-NOUN (30; 2% instances), VERB-NOUN (18; 1% instances), ADJ-NOUN (12; 1% instances), PROPN-PROPN (7; 0% instances), ADP-NOUN (3; 0% instances), ADV-NOUN (3; 0% instances), NOUN-ADV (3; 0% instances), PUNCT-NOUN (2; 0% instances), X-NOUN (2; 0% instances), NUM-NUM (1; 0% instances), PRON-NOUN (1; 0% instances), PROPN-ADV (1; 0% instances), VERB-PROPN (1; 0% instances).
compound in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]