Treebank Statistics: UD_English-Pronouns: POS Tags: NOUN
There are 4 NOUN lemmas (6%), 6 NOUN types (8%) and 240 NOUN tokens (14%).
Out of 13 observed tags, the rank of NOUN is: 5 in number of lemmas, 4 in number of types and 4 in number of tokens.
The 10 most frequent NOUN lemmas: dealer, car, paint, bump
The 10 most frequent NOUN types: dealer, car, dealers, cars, paint, bumps
The 10 most frequent ambiguous lemmas: paint (NOUN 10, VERB 5)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NOUN is 1.500000 (the average of all parts of speech is 1.212121).
The 1st highest number of forms (2) was observed with the lemma “car”: car, cars.
The 2nd highest number of forms (2) was observed with the lemma “dealer”: dealer, dealers.
The 3rd highest number of forms (1) was observed with the lemma “bump”: bumps.
NOUN occurs with 1 features: Number (240; 100% instances)
NOUN occurs with 2 feature-value pairs: Number=Plur, Number=Sing
NOUN occurs with 2 feature combinations.
The most frequent feature combination is Number=Sing (175 tokens).
Examples: dealer, car, paint
Relations
NOUN nodes are attached to their parents using 3 different relations: nsubj (190; 79% instances), obj (45; 19% instances), conj (5; 2% instances)
Parents of NOUN nodes belong to 3 different parts of speech: VERB (195; 81% instances), PRON (40; 17% instances), NOUN (5; 2% instances)
40 (17%) NOUN nodes are leaves.
160 (67%) NOUN nodes have one child.
30 (13%) NOUN nodes have two children.
10 (4%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 4.
Children of NOUN nodes are attached using 9 different relations: det (185; 73% instances), nmod (20; 8% instances), amod (10; 4% instances), case (10; 4% instances), punct (10; 4% instances), appos (5; 2% instances), cc (5; 2% instances), conj (5; 2% instances), nsubj (5; 2% instances)
Children of NOUN nodes belong to 7 different parts of speech: DET (185; 73% instances), PRON (30; 12% instances), ADJ (10; 4% instances), PART (10; 4% instances), PUNCT (10; 4% instances), CCONJ (5; 2% instances), NOUN (5; 2% instances)