home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Poetry: POS Tags: PRON

There are 26 PRON lemmas (0%), 97 PRON types (1%) and 3533 PRON tokens (6%). Out of 17 observed tags, the rank of PRON is: 13 in number of lemmas, 7 in number of types and 6 in number of tokens.

The 10 most frequent PRON lemmas: я, ты, мы, он, что, она, они, всё, кто, это

The 10 most frequent PRON types: я, ты, мне, он, что, мы, меня, все, тебя, нам

The 10 most frequent ambiguous lemmas: что (PRON 188, SCONJ 173, ADV 4), всё (PRON 156, ADV 58), это (PRON 84, PART 14), то (PRON 55, CCONJ 30, PART 25, SCONJ 7), что-то (PRON 22, ADV 1), друг (NOUN 51, PRON 20)

The 10 most frequent ambiguous types: что (SCONJ 118, PRON 104, ADV 1), все (PRON 62, DET 31, ADV 16), это (PRON 50, DET 7, PART 6), всё (PRON 31, ADV 10, DET 5), их (DET 48, PRON 31), его (DET 38, PRON 24), то (PART 24, PRON 22, CCONJ 9, DET 5, SCONJ 1), ее (DET 26, PRON 17), чем (PRON 19, SCONJ 12), всем (PRON 12, DET 7)

Morphology

The form / lemma ratio of PRON is 3.730769 (the average of all parts of speech is 1.831021).

The 1st highest number of forms (9) was observed with the lemma “он”: его, ему, им, него, нем, нему, ним, нём, он.

The 2nd highest number of forms (7) was observed with the lemma “она”: ее, ей, ею, нее, ней, нею, она.

The 3rd highest number of forms (7) was observed with the lemma “они”: им, ими, их, ним, ними, них, они.

PRON occurs with 8 features: PronType (3533; 100% instances), Case (3532; 100% instances), Number (3448; 98% instances), Person (2746; 78% instances), Gender (1122; 32% instances), Animacy (702; 20% instances), Reflex (64; 2% instances), Abbr (1; 0% instances)

PRON occurs with 27 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Exc, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes

PRON occurs with 111 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=1|PronType=Prs (690 tokens). Examples: я

Relations

PRON nodes are attached to their parents using 26 different relations: nsubj (1759; 50% instances), iobj (475; 13% instances), obl (448; 13% instances), obj (417; 12% instances), conj (104; 3% instances), root (96; 3% instances), nmod (65; 2% instances), nsubj:pass (50; 1% instances), fixed (19; 1% instances), obl:agent (16; 0% instances), parataxis (14; 0% instances), det (12; 0% instances), expl (9; 0% instances), advmod (6; 0% instances), appos (6; 0% instances), ccomp (6; 0% instances), orphan (6; 0% instances), parataxis:discourse (6; 0% instances), flat (5; 0% instances), advcl (4; 0% instances), xcomp (3; 0% instances), acl:relcl (2; 0% instances), csubj (2; 0% instances), acl (1; 0% instances), nsubj:outer (1; 0% instances), obl:float (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: VERB (2579; 73% instances), NOUN (301; 9% instances), ADJ (300; 8% instances), PRON (122; 3% instances), (96; 3% instances), ADV (61; 2% instances), DET (38; 1% instances), NUM (13; 0% instances), PART (7; 0% instances), INTJ (5; 0% instances), PROPN (5; 0% instances), AUX (4; 0% instances), CCONJ (2; 0% instances)

2474 (70%) PRON nodes are leaves.

722 (20%) PRON nodes have one child.

152 (4%) PRON nodes have two children.

185 (5%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 29 different relations: case (600; 34% instances), punct (310; 18% instances), advmod (155; 9% instances), nsubj (102; 6% instances), conj (87; 5% instances), acl (74; 4% instances), cc (56; 3% instances), vocative (53; 3% instances), appos (51; 3% instances), det (44; 3% instances), fixed (31; 2% instances), acl:relcl (26; 1% instances), amod (25; 1% instances), nmod (25; 1% instances), orphan (17; 1% instances), dislocated (12; 1% instances), parataxis (11; 1% instances), advcl (10; 1% instances), obl (10; 1% instances), cop (9; 1% instances), iobj (9; 1% instances), mark (8; 0% instances), discourse (5; 0% instances), parataxis:discourse (4; 0% instances), expl (3; 0% instances), aux (2; 0% instances), csubj (2; 0% instances), nummod (1; 0% instances), nummod:gov (1; 0% instances)

Children of PRON nodes belong to 16 different parts of speech: ADP (585; 34% instances), PUNCT (310; 18% instances), NOUN (217; 12% instances), PART (131; 8% instances), PRON (122; 7% instances), VERB (96; 6% instances), ADJ (76; 4% instances), CCONJ (56; 3% instances), ADV (48; 3% instances), DET (48; 3% instances), SCONJ (26; 1% instances), AUX (12; 1% instances), PROPN (10; 1% instances), INTJ (3; 0% instances), NUM (2; 0% instances), SYM (1; 0% instances)