home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-Pronouns: POS Tags: NOUN

There are 4 NOUN lemmas (6%), 6 NOUN types (8%) and 240 NOUN tokens (14%). Out of 13 observed tags, the rank of NOUN is: 5 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent NOUN lemmas: dealer, car, paint, bump

The 10 most frequent NOUN types: dealer, car, dealers, cars, paint, bumps

The 10 most frequent ambiguous lemmas: paint (NOUN 10, VERB 5)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.500000 (the average of all parts of speech is 1.212121).

The 1st highest number of forms (2) was observed with the lemma “car”: car, cars.

The 2nd highest number of forms (2) was observed with the lemma “dealer”: dealer, dealers.

The 3rd highest number of forms (1) was observed with the lemma “bump”: bumps.

NOUN occurs with 1 features: Number (240; 100% instances)

NOUN occurs with 2 feature-value pairs: Number=Plur, Number=Sing

NOUN occurs with 2 feature combinations. The most frequent feature combination is Number=Sing (175 tokens). Examples: dealer, car, paint

Relations

NOUN nodes are attached to their parents using 3 different relations: nsubj (190; 79% instances), obj (45; 19% instances), conj (5; 2% instances)

Parents of NOUN nodes belong to 3 different parts of speech: VERB (195; 81% instances), PRON (40; 17% instances), NOUN (5; 2% instances)

40 (17%) NOUN nodes are leaves.

160 (67%) NOUN nodes have one child.

30 (13%) NOUN nodes have two children.

10 (4%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 4.

Children of NOUN nodes are attached using 9 different relations: det (185; 73% instances), nmod (20; 8% instances), amod (10; 4% instances), case (10; 4% instances), punct (10; 4% instances), appos (5; 2% instances), cc (5; 2% instances), conj (5; 2% instances), nsubj (5; 2% instances)

Children of NOUN nodes belong to 7 different parts of speech: DET (185; 73% instances), PRON (30; 12% instances), ADJ (10; 4% instances), PART (10; 4% instances), PUNCT (10; 4% instances), CCONJ (5; 2% instances), NOUN (5; 2% instances)