home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: POS Tags: PRON

There are 35 PRON lemmas (0%), 144 PRON types (0%) and 72945 PRON tokens (5%). Out of 17 observed tags, the rank of PRON is: 13 in number of lemmas, 9 in number of types and 7 in number of tokens.

The 10 most frequent PRON lemmas: он, это, я, который, они, то, мы, она, что, все

The 10 most frequent PRON types: это, он, я, мы, что, они, его, она, все, то

The 10 most frequent ambiguous lemmas: это (PRON 8070, PART 364), то (PRON 5654, SCONJ 1547, PART 319), что (SCONJ 10960, PRON 4061, ADV 15, NOUN 1, PART 1), все (PRON 3056, PART 746, ADV 1, DET 1), вы (PRON 1632, X 1), что-то (PRON 548, ADV 12), многие (PRON 75, ADJ 5), прочее (PRON 34, NOUN 1), нечего (PRON 23, NOUN 12, ADV 8, VERB 5), немногие (ADJ 31, PRON 7)

The 10 most frequent ambiguous types: это (PRON 4381, DET 568, PART 317), что (SCONJ 10907, PRON 2419, ADV 11, NOUN 1), его (DET 2183, PRON 2063), все (DET 1667, PRON 1487, PART 677), то (SCONJ 1519, PRON 1398, DET 381, PART 319), их (PRON 1530, DET 1191), того (PRON 1312, DET 245, PART 1), том (PRON 1206, DET 546, NOUN 13), ее (PRON 1109, DET 897), этом (PRON 1062, DET 656)

Morphology

The form / lemma ratio of PRON is 4.114286 (the average of all parts of speech is 2.654430).

The 1st highest number of forms (12) was observed with the lemma “который”: которая, которого, которое, которой, котором, которому, которую, которые, который, которым, которыми, которых.

The 2nd highest number of forms (9) was observed with the lemma “он”: его, ему, им, него, нем, нему, ним, нём, он.

The 3rd highest number of forms (9) was observed with the lemma “она”: ее, ей, ею, её, нее, ней, нею, неё, она.

PRON occurs with 9 features: Case (72259; 99% instances), Number (59240; 81% instances), PronType (49059; 67% instances), Person (38523; 53% instances), Gender (35076; 48% instances), Animacy (19066; 26% instances), Reflex (1587; 2% instances), Abbr (22; 0% instances), Typo (1; 0% instances)

PRON occurs with 25 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Tot, Reflex=Yes, Typo=Yes

PRON occurs with 202 feature combinations. The most frequent feature combination is Case=Nom|PronType=Int,Rel (4476 tokens). Examples: что, которые, кто, который, которая, которое, че

Relations

PRON nodes are attached to their parents using 32 different relations: nsubj (32282; 44% instances), obl (14292; 20% instances), obj (9226; 13% instances), iobj (5412; 7% instances), nmod (4260; 6% instances), nsubj:pass (1494; 2% instances), root (1285; 2% instances), conj (843; 1% instances), fixed (789; 1% instances), expl (689; 1% instances), parataxis (647; 1% instances), cc (512; 1% instances), advmod (206; 0% instances), mark (154; 0% instances), discourse (141; 0% instances), ccomp (120; 0% instances), appos (113; 0% instances), advcl (95; 0% instances), obl:agent (89; 0% instances), orphan (87; 0% instances), acl:relcl (74; 0% instances), amod (32; 0% instances), acl (29; 0% instances), xcomp (21; 0% instances), det (17; 0% instances), csubj (15; 0% instances), flat:name (12; 0% instances), dislocated (3; 0% instances), flat (2; 0% instances), nsubj:outer (2; 0% instances), case (1; 0% instances), obl:tmod (1; 0% instances)

Parents of PRON nodes belong to 17 different parts of speech: VERB (53669; 74% instances), NOUN (8607; 12% instances), ADJ (5198; 7% instances), ADV (1882; 3% instances), (1285; 2% instances), PRON (826; 1% instances), DET (370; 1% instances), ADP (317; 0% instances), NUM (310; 0% instances), PROPN (237; 0% instances), PART (150; 0% instances), CCONJ (54; 0% instances), SCONJ (13; 0% instances), SYM (13; 0% instances), X (8; 0% instances), INTJ (4; 0% instances), AUX (2; 0% instances)

48805 (67%) PRON nodes are leaves.

16602 (23%) PRON nodes have one child.

4476 (6%) PRON nodes have two children.

3062 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 13.

Children of PRON nodes are attached using 35 different relations: case (15457; 41% instances), punct (5584; 15% instances), acl (3021; 8% instances), advmod (2855; 8% instances), fixed (1472; 4% instances), nsubj (1456; 4% instances), det (996; 3% instances), cc (951; 3% instances), conj (900; 2% instances), amod (881; 2% instances), parataxis (762; 2% instances), nmod (657; 2% instances), obl (475; 1% instances), acl:relcl (428; 1% instances), advcl (392; 1% instances), cop (328; 1% instances), mark (249; 1% instances), orphan (231; 1% instances), appos (210; 1% instances), discourse (98; 0% instances), expl (64; 0% instances), nummod:gov (61; 0% instances), csubj (52; 0% instances), iobj (43; 0% instances), vocative (37; 0% instances), aux (13; 0% instances), nummod (11; 0% instances), ccomp (8; 0% instances), dislocated (7; 0% instances), flat:name (6; 0% instances), obj (5; 0% instances), flat (4; 0% instances), obl:tmod (2; 0% instances), compound (1; 0% instances), dep (1; 0% instances)

Children of PRON nodes belong to 17 different parts of speech: ADP (15378; 41% instances), PUNCT (5584; 15% instances), VERB (4354; 12% instances), PART (2872; 8% instances), NOUN (2569; 7% instances), ADJ (1626; 4% instances), ADV (1480; 4% instances), DET (1047; 3% instances), CCONJ (934; 2% instances), PRON (826; 2% instances), AUX (351; 1% instances), SCONJ (333; 1% instances), PROPN (253; 1% instances), NUM (97; 0% instances), INTJ (7; 0% instances), SYM (4; 0% instances), X (3; 0% instances)