home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-ESL: POS Tags: NOUN

There are 1 NOUN lemmas (6%), 1 NOUN types (6%) and 15635 NOUN tokens (16%). Out of 17 observed tags, the rank of NOUN is: 8 in number of lemmas, 8 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: _

The 10 most frequent NOUN types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 15635, VERB 12250, PRON 10618, DET 10057, PUNCT 9580, ADP 8546, AUX 7363, ADJ 5857, ADV 5704, PART 3531, CCONJ 3198, SCONJ 2516, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)

The 10 most frequent ambiguous types: _ (NOUN 15635, VERB 12250, PRON 10618, DET 10057, PUNCT 9580, ADP 8546, AUX 7363, ADJ 5857, ADV 5704, PART 3531, CCONJ 3198, SCONJ 2516, PROPN 1795, NUM 844, INTJ 80, X 68, SYM 39)

Morphology

The form / lemma ratio of NOUN is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

NOUN occurs with 1 features: Foreign (1; 0% instances)

NOUN occurs with 1 feature-value pairs: Foreign=Yes

NOUN occurs with 2 feature combinations. The most frequent feature combination is _ (15634 tokens). Examples: _

Relations

NOUN nodes are attached to their parents using 34 different relations: obl (3833; 25% instances), obj (3785; 24% instances), nsubj (2071; 13% instances), nmod (1998; 13% instances), conj (1035; 7% instances), compound (994; 6% instances), root (510; 3% instances), obl:tmod (265; 2% instances), nsubj:pass (197; 1% instances), advcl (146; 1% instances), obl:npmod (145; 1% instances), ccomp (122; 1% instances), nmod:poss (119; 1% instances), appos (116; 1% instances), parataxis (64; 0% instances), xcomp (58; 0% instances), acl:relcl (50; 0% instances), fixed (31; 0% instances), nmod:npmod (17; 0% instances), goeswith (16; 0% instances), iobj (14; 0% instances), acl (10; 0% instances), list (9; 0% instances), amod (7; 0% instances), csubj (4; 0% instances), csubj:pass (3; 0% instances), discourse (3; 0% instances), nummod (3; 0% instances), vocative (3; 0% instances), dislocated (2; 0% instances), mark (2; 0% instances), case (1; 0% instances), det (1; 0% instances), flat (1; 0% instances)

Parents of NOUN nodes belong to 16 different parts of speech: VERB (9213; 59% instances), NOUN (4274; 27% instances), ADJ (1043; 7% instances), (510; 3% instances), ADV (152; 1% instances), PROPN (131; 1% instances), PRON (113; 1% instances), NUM (99; 1% instances), ADP (33; 0% instances), DET (29; 0% instances), X (13; 0% instances), AUX (10; 0% instances), SYM (8; 0% instances), SCONJ (5; 0% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances)

1916 (12%) NOUN nodes are leaves.

4700 (30%) NOUN nodes have one child.

5073 (32%) NOUN nodes have two children.

3946 (25%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 18.

Children of NOUN nodes are attached using 39 different relations: det (6962; 23% instances), case (6233; 20% instances), amod (3386; 11% instances), nmod (2329; 8% instances), nmod:poss (2072; 7% instances), punct (1593; 5% instances), conj (1102; 4% instances), compound (1049; 3% instances), cop (914; 3% instances), cc (855; 3% instances), acl:relcl (825; 3% instances), nsubj (787; 3% instances), advmod (537; 2% instances), nummod (463; 1% instances), acl (462; 1% instances), mark (225; 1% instances), det:predet (186; 1% instances), advcl (167; 1% instances), aux (150; 0% instances), appos (145; 0% instances), obl (120; 0% instances), parataxis (93; 0% instances), expl (58; 0% instances), csubj (52; 0% instances), goeswith (34; 0% instances), nmod:npmod (25; 0% instances), obj (16; 0% instances), obl:tmod (16; 0% instances), xcomp (15; 0% instances), cc:preconj (14; 0% instances), list (13; 0% instances), discourse (6; 0% instances), ccomp (5; 0% instances), reparandum (2; 0% instances), vocative (2; 0% instances), compound:prt (1; 0% instances), flat (1; 0% instances), iobj (1; 0% instances), orphan (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: DET (9111; 29% instances), ADP (6025; 19% instances), NOUN (4274; 14% instances), ADJ (3545; 11% instances), VERB (1624; 5% instances), PUNCT (1592; 5% instances), AUX (1068; 3% instances), CCONJ (859; 3% instances), PRON (843; 3% instances), ADV (513; 2% instances), NUM (506; 2% instances), PROPN (465; 2% instances), PART (246; 1% instances), SCONJ (181; 1% instances), X (45; 0% instances), SYM (14; 0% instances), INTJ (6; 0% instances)