home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: POS Tags: PRON

There are 30 PRON lemmas (0%), 132 PRON types (0%) and 49058 PRON tokens (4%). Out of 17 observed tags, the rank of PRON is: 13 in number of lemmas, 9 in number of types and 7 in number of tokens.

The 10 most frequent PRON lemmas: он, это, который, они, то, я, мы, она, что, все

The 10 most frequent PRON types: это, он, я, мы, они, что, его, которые, она, их

The 10 most frequent ambiguous lemmas: это (PRON 5931, PART 58), то (PRON 4232, SCONJ 1171, PART 228), что (SCONJ 7898, PRON 2773, PART 1), все (PRON 2084, PART 536), вы (PRON 1028, X 1), что-то (PRON 332, ADV 10), прочее (PRON 24, NOUN 1), нечего (PRON 23, NOUN 12, ADV 8), некого (PRON 6, NOUN 2), тем (SCONJ 92, PRON 1)

The 10 most frequent ambiguous types: это (PRON 3282, DET 400, PART 42), что (SCONJ 7863, PRON 1611, NOUN 1), его (PRON 1525, DET 1446), их (PRON 1201, DET 862), то (SCONJ 1156, PRON 1014, DET 291, PART 228), все (DET 1174, PRON 1001, PART 496), того (PRON 989, DET 171), том (PRON 907, DET 406, NOUN 5), ее (PRON 799, DET 578), этом (PRON 795, DET 505)

Morphology

The form / lemma ratio of PRON is 4.400000 (the average of all parts of speech is 2.589298).

The 1st highest number of forms (12) was observed with the lemma “который”: которая, которого, которое, которой, котором, которому, которую, которые, который, которым, которыми, которых.

The 2nd highest number of forms (9) was observed with the lemma “он”: его, ему, им, него, нем, нему, ним, нём, он.

The 3rd highest number of forms (9) was observed with the lemma “она”: ее, ей, ею, её, нее, ней, нею, неё, она.

PRON occurs with 5 features: Case (48402; 99% instances), Number (36150; 74% instances), Person (24513; 50% instances), Gender (21747; 44% instances), Animacy (11637; 24% instances)

PRON occurs with 16 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3

PRON occurs with 77 feature combinations. The most frequent feature combination is Case=Nom (5106 tokens). Examples: он, это, что, я, которые, мы, они, кто, она, который

Relations

PRON nodes are attached to their parents using 28 different relations: nsubj (20904; 43% instances), obl (10462; 21% instances), obj (5756; 12% instances), nmod (3458; 7% instances), iobj (3231; 7% instances), nsubj:pass (1124; 2% instances), root (789; 2% instances), cop (644; 1% instances), fixed (560; 1% instances), conj (546; 1% instances), mark (510; 1% instances), parataxis (430; 1% instances), advmod (146; 0% instances), discourse (141; 0% instances), orphan (77; 0% instances), advcl (68; 0% instances), acl:relcl (44; 0% instances), ccomp (43; 0% instances), expl (38; 0% instances), amod (27; 0% instances), appos (17; 0% instances), acl (15; 0% instances), flat:name (12; 0% instances), csubj (11; 0% instances), xcomp (2; 0% instances), cc (1; 0% instances), flat (1; 0% instances), nummod (1; 0% instances)

Parents of PRON nodes belong to 16 different parts of speech: VERB (35520; 72% instances), NOUN (6444; 13% instances), ADJ (3581; 7% instances), ADV (1252; 3% instances), (789; 2% instances), PRON (497; 1% instances), ADP (256; 1% instances), DET (209; 0% instances), NUM (200; 0% instances), PROPN (144; 0% instances), PART (91; 0% instances), CCONJ (46; 0% instances), SCONJ (12; 0% instances), SYM (12; 0% instances), INTJ (3; 0% instances), X (2; 0% instances)

32703 (67%) PRON nodes are leaves.

11260 (23%) PRON nodes have one child.

3101 (6%) PRON nodes have two children.

1994 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 13.

Children of PRON nodes are attached using 29 different relations: case (10719; 43% instances), punct (3260; 13% instances), acl (2205; 9% instances), advmod (1809; 7% instances), fixed (1087; 4% instances), nsubj (866; 3% instances), amod (694; 3% instances), cc (609; 2% instances), det (584; 2% instances), conj (557; 2% instances), parataxis (439; 2% instances), obl (403; 2% instances), nmod (354; 1% instances), acl:relcl (287; 1% instances), advcl (268; 1% instances), nummod:gov (193; 1% instances), mark (187; 1% instances), orphan (179; 1% instances), appos (158; 1% instances), cop (149; 1% instances), discourse (81; 0% instances), csubj (45; 0% instances), iobj (32; 0% instances), aux (8; 0% instances), nummod (6; 0% instances), flat:name (5; 0% instances), flat (3; 0% instances), obj (3; 0% instances), dep (1; 0% instances)

Children of PRON nodes belong to 16 different parts of speech: ADP (10688; 42% instances), PUNCT (3260; 13% instances), VERB (3096; 12% instances), PART (1955; 8% instances), NOUN (1627; 6% instances), ADJ (1242; 5% instances), ADV (1080; 4% instances), DET (606; 2% instances), CCONJ (603; 2% instances), PRON (497; 2% instances), SCONJ (180; 1% instances), PROPN (148; 1% instances), AUX (124; 0% instances), NUM (80; 0% instances), INTJ (4; 0% instances), SYM (1; 0% instances)