home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: PRON

There are 67 PRON lemmas (0%), 222 PRON types (0%) and 88826 PRON tokens (5%). Out of 17 observed tags, the rank of PRON is: 14 in number of lemmas, 11 in number of types and 6 in number of tokens.

The 10 most frequent PRON lemmas: я, он, она, они, мы, это, что, ты, то, вы

The 10 most frequent PRON types: я, он, это, она, что, мы, они, ты, мне, меня

The 10 most frequent ambiguous lemmas: я (PRON 14750, NOUN 5, X 4), он (PRON 13155, X 4), она (PRON 9390, DET 1), они (PRON 7030, X 2), это (PRON 6178, PART 1170, DET 2), что (SCONJ 8265, PRON 5112, ADV 79, PART 8), то (PRON 4713, SCONJ 990, PART 728, CCONJ 602, X 2, DET 1), вы (PRON 3259, X 2), всё (PRON 2778, ADV 754, DET 1), все (PRON 1004, PART 10, DET 2)

The 10 most frequent ambiguous types: я (PRON 5970, NOUN 6, X 4, SCONJ 1), он (PRON 5061, X 4), это (PRON 3287, PART 1038, DET 866), что (SCONJ 8203, PRON 3150, ADV 36, PART 6, X 1), они (PRON 2381, X 2), его (DET 3371, PRON 2174, X 2), все (DET 1775, PRON 1387, ADV 521, PART 12), вы (PRON 1287, X 2), то (PRON 1530, SCONJ 973, PART 727, CCONJ 547, DET 421, X 130, ADV 1), их (DET 1526, PRON 1376, X 2, CCONJ 1)

Morphology

The form / lemma ratio of PRON is 3.313433 (the average of all parts of speech is 2.706171).

The 1st highest number of forms (15) was observed with the lemma “она”: еë, ее, ей, ею, её, нее, ней, нею, нея, неё, нё, оеа, она, ёй, ёё.

The 2nd highest number of forms (13) was observed with the lemma “он”: Эго, его, ему, им, нëм, него, нем, нему, ним, нём, он, она, от.

The 3rd highest number of forms (13) was observed with the lemma “что”: Чам, Что-о, сто, сём, че, чего, чем, чему, что, что́, чьто, чём, што.

PRON occurs with 11 features: PronType (88826; 100% instances), Case (88081; 99% instances), Number (84719; 95% instances), Person (60179; 68% instances), Gender (47131; 53% instances), Animacy (24540; 28% instances), Reflex (2542; 3% instances), ExtPos (2104; 2% instances), Abbr (745; 1% instances), Typo (100; 0% instances), Clitic (14; 0% instances)

PRON occurs with 40 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Par, Clitic=Yes, ExtPos=ADP, ExtPos=ADV, ExtPos=CCONJ, ExtPos=DET, ExtPos=NOUN, ExtPos=PART, ExtPos=PRON, ExtPos=SCONJ, ExtPos=VERB, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Emp, PronType=Exc, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

PRON occurs with 223 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=1|PronType=Prs (9159 tokens). Examples: я

Relations

PRON nodes are attached to their parents using 41 different relations: nsubj (40749; 46% instances), obl (13600; 15% instances), obj (13109; 15% instances), iobj (9226; 10% instances), root (2401; 3% instances), nmod (1832; 2% instances), conj (1638; 2% instances), fixed (1439; 2% instances), nsubj:pass (1014; 1% instances), mark (726; 1% instances), obl:agent (420; 0% instances), parataxis:discourse (416; 0% instances), parataxis (340; 0% instances), obl:tmod (310; 0% instances), ccomp (278; 0% instances), advmod (246; 0% instances), det (223; 0% instances), orphan (184; 0% instances), appos (169; 0% instances), expl (93; 0% instances), advcl (68; 0% instances), xcomp (60; 0% instances), flat (49; 0% instances), cc (44; 0% instances), reparandum (29; 0% instances), csubj (26; 0% instances), acl (25; 0% instances), dislocated (25; 0% instances), acl:relcl (24; 0% instances), list (20; 0% instances), cop (8; 0% instances), flat:name (7; 0% instances), obl:float (6; 0% instances), obl:pronmod (6; 0% instances), vocative (5; 0% instances), case (3; 0% instances), nsubj:outer (3; 0% instances), dep (2; 0% instances), csubj:outer (1; 0% instances), discourse (1; 0% instances), obl:depict (1; 0% instances)

Parents of PRON nodes belong to 17 different parts of speech: VERB (67062; 75% instances), NOUN (7279; 8% instances), ADJ (6096; 7% instances), (2401; 3% instances), PRON (1938; 2% instances), ADV (1615; 2% instances), DET (807; 1% instances), PROPN (398; 0% instances), ADP (307; 0% instances), PART (278; 0% instances), CCONJ (237; 0% instances), NUM (186; 0% instances), AUX (66; 0% instances), X (59; 0% instances), INTJ (56; 0% instances), SCONJ (36; 0% instances), SYM (5; 0% instances)

61263 (69%) PRON nodes are leaves.

19171 (22%) PRON nodes have one child.

4279 (5%) PRON nodes have two children.

4113 (5%) PRON nodes have three or more children.

The highest child degree of a PRON node is 13.

Children of PRON nodes are attached using 43 different relations: case (16584; 37% instances), punct (7531; 17% instances), advmod (3543; 8% instances), fixed (2247; 5% instances), acl (2211; 5% instances), nsubj (1967; 4% instances), cc (1688; 4% instances), det (1463; 3% instances), conj (1200; 3% instances), nmod (846; 2% instances), parataxis (799; 2% instances), amod (677; 2% instances), appos (541; 1% instances), acl:relcl (414; 1% instances), vocative (414; 1% instances), orphan (340; 1% instances), cop (318; 1% instances), advcl (260; 1% instances), mark (227; 1% instances), discourse (209; 0% instances), parataxis:discourse (172; 0% instances), dislocated (135; 0% instances), obl:pronmod (126; 0% instances), obl (92; 0% instances), expl (78; 0% instances), iobj (77; 0% instances), goeswith (66; 0% instances), nummod (62; 0% instances), nummod:gov (52; 0% instances), csubj (37; 0% instances), reparandum (28; 0% instances), obl:tmod (26; 0% instances), list (18; 0% instances), aux (17; 0% instances), compound (6; 0% instances), obj (5; 0% instances), ccomp (4; 0% instances), obl:float (4; 0% instances), dep (3; 0% instances), flat:name (2; 0% instances), nsubj:outer (2; 0% instances), csubj:outer (1; 0% instances), flat (1; 0% instances)

Children of PRON nodes belong to 17 different parts of speech: ADP (16626; 37% instances), PUNCT (7531; 17% instances), VERB (4106; 9% instances), PART (3208; 7% instances), NOUN (3116; 7% instances), PRON (1938; 4% instances), ADJ (1806; 4% instances), CCONJ (1670; 4% instances), DET (1605; 4% instances), ADV (1150; 3% instances), PROPN (711; 2% instances), AUX (353; 1% instances), SCONJ (349; 1% instances), NUM (171; 0% instances), X (80; 0% instances), INTJ (59; 0% instances), SYM (14; 0% instances)