home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-ESL: POS Tags: NOUN

There are 1 NOUN lemmas (6%), 1 NOUN types (6%) and 14135 NOUN tokens (16%). Out of 17 observed tags, the rank of NOUN is: 8 in number of lemmas, 8 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: _

The 10 most frequent NOUN types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)

The 10 most frequent ambiguous types: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)

Morphology

The form / lemma ratio of NOUN is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

NOUN does not occur with any features.

Relations

NOUN nodes are attached to their parents using 34 different relations: nmod (5311; 38% instances), dobj (3397; 24% instances), nsubj (1868; 13% instances), conj (926; 7% instances), compound (898; 6% instances), root (464; 3% instances), nmod:tmod (237; 2% instances), nsubjpass (186; 1% instances), nmod:npmod (156; 1% instances), advcl (129; 1% instances), appos (108; 1% instances), ccomp (107; 1% instances), nmod:poss (101; 1% instances), parataxis (59; 0% instances), xcomp (51; 0% instances), acl:relcl (41; 0% instances), mwe (28; 0% instances), iobj (13; 0% instances), goeswith (10; 0% instances), acl (9; 0% instances), list (8; 0% instances), amod (4; 0% instances), csubj (4; 0% instances), discourse (3; 0% instances), nummod (3; 0% instances), vocative (3; 0% instances), csubjpass (2; 0% instances), mark (2; 0% instances), remnant (2; 0% instances), case (1; 0% instances), cc (1; 0% instances), det (1; 0% instances), dislocated (1; 0% instances), foreign (1; 0% instances)

Parents of NOUN nodes belong to 16 different parts of speech: VERB (8313; 59% instances), NOUN (3872; 27% instances), ADJ (949; 7% instances), (464; 3% instances), ADV (139; 1% instances), PROPN (120; 1% instances), PRON (107; 1% instances), NUM (90; 1% instances), ADP (31; 0% instances), DET (27; 0% instances), SYM (8; 0% instances), X (5; 0% instances), SCONJ (4; 0% instances), AUX (3; 0% instances), CONJ (2; 0% instances), INTJ (1; 0% instances)

2041 (14%) NOUN nodes are leaves.

3998 (28%) NOUN nodes have one child.

4444 (31%) NOUN nodes have two children.

3652 (26%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 14.

Children of NOUN nodes are attached using 39 different relations: det (6240; 22% instances), case (5676; 20% instances), amod (3034; 11% instances), nmod (2236; 8% instances), nmod:poss (1825; 7% instances), punct (1432; 5% instances), conj (987; 4% instances), compound (950; 3% instances), cc (825; 3% instances), cop (810; 3% instances), acl:relcl (744; 3% instances), nsubj (705; 3% instances), nummod (430; 2% instances), acl (421; 2% instances), advmod (407; 1% instances), mark (200; 1% instances), det:predet (168; 1% instances), neg (162; 1% instances), advcl (156; 1% instances), aux (132; 0% instances), appos (129; 0% instances), parataxis (85; 0% instances), expl (53; 0% instances), csubj (44; 0% instances), goeswith (34; 0% instances), nmod:npmod (23; 0% instances), dobj (15; 0% instances), nmod:tmod (15; 0% instances), cc:preconj (14; 0% instances), xcomp (14; 0% instances), list (10; 0% instances), discourse (6; 0% instances), ccomp (5; 0% instances), remnant (2; 0% instances), reparandum (2; 0% instances), vocative (2; 0% instances), compound:prt (1; 0% instances), foreign (1; 0% instances), iobj (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: DET (8222; 29% instances), ADP (5497; 20% instances), NOUN (3872; 14% instances), ADJ (3178; 11% instances), VERB (2285; 8% instances), PUNCT (1432; 5% instances), CONJ (827; 3% instances), PRON (756; 3% instances), ADV (473; 2% instances), NUM (469; 2% instances), PROPN (415; 1% instances), PART (214; 1% instances), SCONJ (162; 1% instances), AUX (135; 0% instances), X (40; 0% instances), SYM (13; 0% instances), INTJ (6; 0% instances)