home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: POS Tags: PRON

There are 38 PRON lemmas (0%), 149 PRON types (0%) and 66151 PRON tokens (4%). Out of 17 observed tags, the rank of PRON is: 13 in number of lemmas, 9 in number of types and 7 in number of tokens.

The 10 most frequent PRON lemmas: он, я, это, они, то, мы, она, что, себя, всё

The 10 most frequent PRON types: он, это, я, мы, что, они, его, она, все, то

The 10 most frequent ambiguous lemmas: это (PRON 7416, PART 1018), то (PRON 5654, SCONJ 1544, PART 319, CCONJ 4), что (SCONJ 10961, PRON 4009, ADV 38, PART 2), всё (PRON 2227, ADV 6), вы (PRON 1632, X 1), все (PRON 826, PART 745), что-то (PRON 548, ADV 12), друг (PRON 519, NOUN 297, ADJ 1), многие (PRON 216, ADJ 5), многое (PRON 192, NOUN 23)

The 10 most frequent ambiguous types: это (PRON 3737, PART 961, DET 568), что (SCONJ 10908, PRON 2407, ADV 22, PART 1), его (DET 2184, PRON 2062), все (DET 1669, PRON 1481, PART 676, ADV 5), то (SCONJ 1516, PRON 1397, DET 381, PART 319, CCONJ 4), их (PRON 1531, DET 1190), того (PRON 1312, DET 245, PART 1), том (PRON 1206, DET 546, NOUN 13), ее (PRON 1108, DET 898), этом (PRON 1061, DET 656, PART 1)

Morphology

The form / lemma ratio of PRON is 3.921053 (the average of all parts of speech is 2.668831).

The 1st highest number of forms (9) was observed with the lemma “он”: его, ему, им, него, нем, нему, ним, нём, он.

The 2nd highest number of forms (9) was observed with the lemma “она”: ее, ей, ею, её, нее, ней, нею, неё, она.

The 3rd highest number of forms (9) was observed with the lemma “оно”: его, ему, им, него, нем, нему, ним, нём, оно.

PRON occurs with 10 features: PronType (66151; 100% instances), Case (66119; 100% instances), Number (62879; 95% instances), Gender (39190; 59% instances), Person (38522; 58% instances), Animacy (24746; 37% instances), Reflex (2335; 4% instances), ExtPos (1487; 2% instances), Abbr (22; 0% instances), Typo (1; 0% instances)

PRON occurs with 37 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, ExtPos=ADP, ExtPos=ADV, ExtPos=CCONJ, ExtPos=DET, ExtPos=NOUN, ExtPos=PART, ExtPos=PRON, ExtPos=SCONJ, ExtPos=VERB, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

PRON occurs with 212 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs (5675 tokens). Examples: он

Relations

PRON nodes are attached to their parents using 35 different relations: nsubj (29531; 45% instances), obl (10838; 16% instances), obj (9161; 14% instances), iobj (5674; 9% instances), nmod (3492; 5% instances), root (1278; 2% instances), nsubj:pass (1105; 2% instances), fixed (950; 1% instances), conj (882; 1% instances), parataxis:discourse (553; 1% instances), obl:tmod (546; 1% instances), cc (529; 1% instances), obl:agent (312; 0% instances), parataxis (222; 0% instances), advmod (220; 0% instances), mark (178; 0% instances), ccomp (158; 0% instances), appos (120; 0% instances), orphan (87; 0% instances), advcl (80; 0% instances), xcomp (51; 0% instances), expl (38; 0% instances), acl (31; 0% instances), acl:relcl (25; 0% instances), amod (24; 0% instances), csubj (22; 0% instances), det (17; 0% instances), flat:name (11; 0% instances), dislocated (3; 0% instances), flat (3; 0% instances), obl:float (3; 0% instances), obl:pronmod (3; 0% instances), nsubj:outer (2; 0% instances), case (1; 0% instances), vocative (1; 0% instances)

Parents of PRON nodes belong to 17 different parts of speech: VERB (48655; 74% instances), NOUN (7395; 11% instances), ADJ (4701; 7% instances), ADV (1507; 2% instances), (1278; 2% instances), PRON (1071; 2% instances), DET (531; 1% instances), ADP (314; 0% instances), NUM (255; 0% instances), PROPN (190; 0% instances), PART (154; 0% instances), CCONJ (54; 0% instances), X (15; 0% instances), SCONJ (13; 0% instances), SYM (12; 0% instances), INTJ (4; 0% instances), AUX (2; 0% instances)

43568 (66%) PRON nodes are leaves.

15026 (23%) PRON nodes have one child.

4520 (7%) PRON nodes have two children.

3037 (5%) PRON nodes have three or more children.

The highest child degree of a PRON node is 13.

Children of PRON nodes are attached using 38 different relations: case (13957; 39% instances), punct (5260; 15% instances), advmod (3235; 9% instances), acl (3040; 8% instances), fixed (1752; 5% instances), nsubj (1410; 4% instances), det (1313; 4% instances), cc (986; 3% instances), conj (906; 3% instances), nmod (706; 2% instances), amod (540; 1% instances), parataxis (451; 1% instances), acl:relcl (438; 1% instances), advcl (390; 1% instances), cop (308; 1% instances), orphan (242; 1% instances), mark (240; 1% instances), appos (215; 1% instances), parataxis:discourse (191; 1% instances), obl:pronmod (162; 0% instances), expl (64; 0% instances), vocative (62; 0% instances), obl (50; 0% instances), iobj (33; 0% instances), nummod:gov (32; 0% instances), discourse (31; 0% instances), csubj (23; 0% instances), aux (13; 0% instances), nummod (13; 0% instances), obl:tmod (13; 0% instances), dislocated (10; 0% instances), ccomp (8; 0% instances), obj (5; 0% instances), flat (4; 0% instances), flat:name (4; 0% instances), obl:float (4; 0% instances), compound (1; 0% instances), dep (1; 0% instances)

Children of PRON nodes belong to 17 different parts of speech: ADP (13880; 38% instances), PUNCT (5260; 15% instances), VERB (4431; 12% instances), PART (2899; 8% instances), NOUN (2561; 7% instances), DET (1428; 4% instances), ADV (1297; 4% instances), ADJ (1249; 3% instances), PRON (1071; 3% instances), CCONJ (966; 3% instances), SCONJ (361; 1% instances), AUX (333; 1% instances), PROPN (233; 1% instances), NUM (128; 0% instances), INTJ (6; 0% instances), X (6; 0% instances), SYM (4; 0% instances)