home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-EWT: POS Tags: PRON

There are 81 PRON lemmas (0%), 93 PRON types (0%) and 22955 PRON tokens (9%). Out of 17 observed tags, the rank of PRON is: 12 in number of lemmas, 11 in number of types and 4 in number of tokens.

The 10 most frequent PRON lemmas: i, you, it, they, we, he, my, that, she, this

The 10 most frequent PRON types: i, you, it, they, my, we, that, he, your, me

The 10 most frequent ambiguous lemmas: i (PRON 438, NUM 5, X 2), you (PRON 3584, NOUN 1), it (PRON 2278, NOUN 10, VERB 1), we (PRON 1743, NOUN 2, VERB 1), he (PRON 1722, INTJ 2), my (PRON 1112, INTJ 2, X 2), that (SCONJ 1161, PRON 1012, DET 222, ADV 44), this (DET 905, PRON 511, ADV 5, NOUN 1), what (PRON 503, DET 79, X 1), there (PRON 469, ADV 268, X 1)

The 10 most frequent ambiguous types: i (PRON 437, X 2, NUM 1), it (PRON 1812, ADP 1, ADV 1, SCONJ 1, VERB 1), my (PRON 940, INTJ 2, X 2, AUX 1), we (PRON 734, NOUN 1, VERB 1), that (SCONJ 1153, PRON 937, DET 201, ADV 44, ADP 1), he (PRON 702, DET 1, INTJ 1), your (PRON 758, X 1), this (DET 764, PRON 369, ADV 5, NOUN 1), what (PRON 380, DET 49, VERB 1, X 1), their (PRON 458, ADV 2)

Morphology

The form / lemma ratio of PRON is 1.148148 (the average of all parts of speech is 1.185132).

The 1st highest number of forms (3) was observed with the lemma “he”: he, him, his.

The 2nd highest number of forms (3) was observed with the lemma “they”: their, them, they.

The 3rd highest number of forms (3) was observed with the lemma “we”: our, us, we.

PRON occurs with 10 features: PronType (21647; 94% instances), Person (18650; 81% instances), Number (16733; 73% instances), Case (14972; 65% instances), Gender (4815; 21% instances), Poss (3656; 16% instances), Reflex (121; 1% instances), Typo (5; 0% instances), Abbr (4; 0% instances), Definite (1; 0% instances)

PRON occurs with 20 feature-value pairs: Abbr=Yes, Case=Acc, Case=Nom, Definite=Def, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Int, PronType=Prs, PronType=Rel, Reflex=Yes, Typo=Yes

PRON occurs with 51 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=1|PronType=Prs (4296 tokens). Examples: i

Relations

PRON nodes are attached to their parents using 28 different relations: nsubj (12842; 56% instances), nmod:poss (3704; 16% instances), obj (2898; 13% instances), obl (982; 4% instances), expl (733; 3% instances), nsubj:pass (555; 2% instances), iobj (390; 2% instances), nmod (373; 2% instances), conj (174; 1% instances), root (122; 1% instances), ccomp (31; 0% instances), nmod:npmod (29; 0% instances), advcl (22; 0% instances), parataxis (22; 0% instances), obl:npmod (16; 0% instances), appos (14; 0% instances), det (13; 0% instances), xcomp (8; 0% instances), acl:relcl (5; 0% instances), compound (5; 0% instances), reparandum (4; 0% instances), vocative (4; 0% instances), det:predet (3; 0% instances), csubj (2; 0% instances), acl (1; 0% instances), advmod (1; 0% instances), case (1; 0% instances), list (1; 0% instances)

Parents of PRON nodes belong to 14 different parts of speech: VERB (15384; 67% instances), NOUN (5056; 22% instances), ADJ (1617; 7% instances), ADV (181; 1% instances), PROPN (176; 1% instances), PRON (150; 1% instances), (122; 1% instances), AUX (100; 0% instances), NUM (82; 0% instances), DET (60; 0% instances), SYM (14; 0% instances), ADP (11; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)

20694 (90%) PRON nodes are leaves.

1677 (7%) PRON nodes have one child.

325 (1%) PRON nodes have two children.

259 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 11.

Children of PRON nodes are attached using 34 different relations: case (1417; 42% instances), acl:relcl (314; 9% instances), punct (242; 7% instances), cop (173; 5% instances), cc (171; 5% instances), nmod (168; 5% instances), nsubj (168; 5% instances), amod (139; 4% instances), conj (137; 4% instances), advmod (104; 3% instances), acl (62; 2% instances), det (54; 2% instances), appos (42; 1% instances), advcl (35; 1% instances), mark (34; 1% instances), aux (20; 1% instances), parataxis (17; 1% instances), obl (15; 0% instances), discourse (6; 0% instances), nmod:npmod (5; 0% instances), cc:preconj (4; 0% instances), goeswith (4; 0% instances), csubj (3; 0% instances), xcomp (3; 0% instances), compound (2; 0% instances), expl (2; 0% instances), flat (2; 0% instances), aux:pass (1; 0% instances), ccomp (1; 0% instances), dislocated (1; 0% instances), nmod:poss (1; 0% instances), nummod (1; 0% instances), reparandum (1; 0% instances), vocative (1; 0% instances)

Children of PRON nodes belong to 17 different parts of speech: ADP (1336; 40% instances), VERB (413; 12% instances), NOUN (287; 9% instances), PUNCT (247; 7% instances), AUX (200; 6% instances), CCONJ (166; 5% instances), ADJ (155; 5% instances), PRON (150; 4% instances), ADV (104; 3% instances), SCONJ (95; 3% instances), PROPN (83; 2% instances), DET (62; 2% instances), PART (33; 1% instances), INTJ (7; 0% instances), NUM (6; 0% instances), X (5; 0% instances), SYM (1; 0% instances)