home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: POS Tags: PRON

There are 38 PRON lemmas (0%), 148 PRON types (0%) and 66174 PRON tokens (4%). Out of 17 observed tags, the rank of PRON is: 13 in number of lemmas, 9 in number of types and 7 in number of tokens.

The 10 most frequent PRON lemmas: он, я, это, они, то, мы, она, что, себя, всё

The 10 most frequent PRON types: он, это, я, мы, что, они, его, она, все, то

The 10 most frequent ambiguous lemmas: это (PRON 7418, PART 1016), то (PRON 5655, SCONJ 1543, PART 319, CCONJ 4), что (SCONJ 10960, PRON 4034, ADV 15, PART 2), вы (PRON 1632, X 1), все (PRON 826, PART 746, ADV 1), что-то (PRON 548, ADV 12), друг (PRON 498, NOUN 318, ADJ 1), многие (PRON 216, ADJ 5), многое (PRON 192, NOUN 23), нечто (PRON 102, VERB 1)

The 10 most frequent ambiguous types: это (PRON 3737, PART 961, DET 568), что (SCONJ 10907, PRON 2419, ADV 11, PART 1), его (DET 2183, PRON 2063), все (DET 1667, PRON 1487, PART 677), то (SCONJ 1515, PRON 1398, DET 381, PART 319, CCONJ 4), их (PRON 1531, DET 1190), того (PRON 1312, DET 245, PART 1), том (PRON 1206, DET 546, NOUN 13), ее (PRON 1108, DET 898), этом (PRON 1061, DET 656, PART 1)

Morphology

The form / lemma ratio of PRON is 3.894737 (the average of all parts of speech is 2.668075).

The 1st highest number of forms (9) was observed with the lemma “он”: его, ему, им, него, нем, нему, ним, нём, он.

The 2nd highest number of forms (9) was observed with the lemma “она”: ее, ей, ею, её, нее, ней, нею, неё, она.

The 3rd highest number of forms (9) was observed with the lemma “оно”: его, ему, им, него, нем, нему, ним, нём, оно.

PRON occurs with 10 features: PronType (66174; 100% instances), Case (66142; 100% instances), Number (62915; 95% instances), Gender (39232; 59% instances), Person (38523; 58% instances), Animacy (24783; 37% instances), Reflex (2342; 4% instances), ExtPos (1407; 2% instances), Abbr (22; 0% instances), Typo (1; 0% instances)

PRON occurs with 37 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, ExtPos=ADP, ExtPos=ADV, ExtPos=CCONJ, ExtPos=DET, ExtPos=NOUN, ExtPos=PART, ExtPos=PRON, ExtPos=SCONJ, ExtPos=VERB, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

PRON occurs with 207 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs (5675 tokens). Examples: он

Relations

PRON nodes are attached to their parents using 33 different relations: nsubj (29537; 45% instances), obl (12802; 19% instances), obj (8382; 13% instances), iobj (5291; 8% instances), nmod (3617; 5% instances), root (1274; 2% instances), nsubj:pass (1105; 2% instances), fixed (929; 1% instances), conj (878; 1% instances), parataxis (614; 1% instances), cc (529; 1% instances), advmod (209; 0% instances), parataxis:discourse (159; 0% instances), mark (138; 0% instances), ccomp (122; 0% instances), appos (115; 0% instances), obl:agent (91; 0% instances), advcl (88; 0% instances), orphan (87; 0% instances), expl (38; 0% instances), acl (29; 0% instances), amod (25; 0% instances), acl:relcl (24; 0% instances), xcomp (21; 0% instances), csubj (17; 0% instances), det (17; 0% instances), discourse (15; 0% instances), flat:name (12; 0% instances), dislocated (3; 0% instances), flat (2; 0% instances), nsubj:outer (2; 0% instances), case (1; 0% instances), obl:tmod (1; 0% instances)

Parents of PRON nodes belong to 17 different parts of speech: VERB (48431; 73% instances), NOUN (7406; 11% instances), ADJ (4692; 7% instances), ADV (1770; 3% instances), (1274; 2% instances), PRON (1058; 2% instances), DET (530; 1% instances), ADP (314; 0% instances), NUM (254; 0% instances), PROPN (196; 0% instances), PART (146; 0% instances), CCONJ (54; 0% instances), X (18; 0% instances), SCONJ (13; 0% instances), SYM (12; 0% instances), INTJ (4; 0% instances), AUX (2; 0% instances)

43539 (66%) PRON nodes are leaves.

15027 (23%) PRON nodes have one child.

4591 (7%) PRON nodes have two children.

3017 (5%) PRON nodes have three or more children.

The highest child degree of a PRON node is 13.

Children of PRON nodes are attached using 38 different relations: case (13864; 38% instances), punct (5339; 15% instances), advmod (3274; 9% instances), acl (3040; 8% instances), fixed (1751; 5% instances), nsubj (1410; 4% instances), det (1312; 4% instances), cc (977; 3% instances), conj (904; 2% instances), parataxis (756; 2% instances), nmod (675; 2% instances), amod (535; 1% instances), acl:relcl (435; 1% instances), advcl (391; 1% instances), cop (307; 1% instances), mark (239; 1% instances), orphan (235; 1% instances), appos (209; 1% instances), obl:pronmod (134; 0% instances), obl (117; 0% instances), expl (63; 0% instances), vocative (39; 0% instances), iobj (33; 0% instances), discourse (31; 0% instances), nummod:gov (30; 0% instances), csubj (23; 0% instances), aux (13; 0% instances), nummod (10; 0% instances), ccomp (8; 0% instances), dislocated (7; 0% instances), flat:name (6; 0% instances), parataxis:discourse (6; 0% instances), obj (5; 0% instances), flat (4; 0% instances), obl:tmod (2; 0% instances), compound (1; 0% instances), dep (1; 0% instances), obl:float (1; 0% instances)

Children of PRON nodes belong to 17 different parts of speech: ADP (13867; 38% instances), PUNCT (5339; 15% instances), VERB (4327; 12% instances), PART (2912; 8% instances), NOUN (2548; 7% instances), ADV (1463; 4% instances), DET (1427; 4% instances), ADJ (1247; 3% instances), PRON (1058; 3% instances), CCONJ (958; 3% instances), AUX (332; 1% instances), SCONJ (329; 1% instances), PROPN (234; 1% instances), NUM (129; 0% instances), INTJ (7; 0% instances), X (6; 0% instances), SYM (4; 0% instances)