This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ru/pos issue tracker

PRON: pronoun

Definition

Pronouns are words that substitute for nouns or noun phrases, whose meaning is recoverable from the linguistic or extralinguistic context.

Pronouns under this definition function like nouns. Note that Russian grammar traditionally extends the term pronoun to words that substitute for adjectives. Such words are not tagged PRON under our universal scheme. They are tagged as determiners in order to annotate the same thing same way across languages.

For instance, ‘это  “this” is traditionally called pronoun in Russian grammar, regardless of context (the notion of determiners does not exist in Russian grammar). To make the annotation parallel across languages, it should be now tagged PRON in Я видел это вчера.  “I saw this yesterday.” and DET in Я видел эту машину вчера.  “I saw this car yesterday.”

Examples


Treebank Statistics (UD_Russian)

There are 28 PRON lemmas (0%), 92 PRON types (0%) and 1915 PRON tokens (2%). Out of 16 observed tags, the rank of PRON is: 9 in number of lemmas, 9 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: ОН, КОТОРЫЙ, ТО, ОНИ, ОНА, ЭТО, СЕБЯ, ЧТО, Я, МЫ

The 10 most frequent PRON types: он, который, это, она, которые, они, его, того, что, которой

The 10 most frequent ambiguous lemmas: ТО (PRON 184, ADV 25, CONJ 7, ADP 2, SCONJ 2), ЭТО (PRON 147, PART 28), ЧТО (SCONJ 250, PRON 81, DET 12, ADP 12, NOUN 1), Я (PRON 32, NOUN 1), ВСЁ (PRON 24, ADV 13, PART 3), Т. (PRON 13, ADV 7, SCONJ 2), I (ADJ 22, NOUN 3, PRON 2), ME (PRON 2, NOUN 1)

The 10 most frequent ambiguous types: это (PRON 55, PART 25, DET 23), его (DET 189, PRON 67), того (PRON 62, DET 16), что (SCONJ 250, PRON 55, ADP 12, DET 11, NOUN 1), тем (PRON 35, DET 7, ADV 1), им (PRON 34, ADJ 1), том (DET 41, PRON 36, NOUN 2), их (DET 66, PRON 33), этом (PRON 33, DET 27), то (DET 34, PRON 29, ADV 25, CONJ 7, ADP 2, SCONJ 2)

Morphology

The form / lemma ratio of PRON is 3.285714 (the average of all parts of speech is 1.591757).

The 1st highest number of forms (12) was observed with the lemma “КОТОРЫЙ”: которая, которого, которое, которой, котором, которому, которую, которые, который, которым, которыми, которых.

The 2nd highest number of forms (9) was observed with the lemma “ОН”: его, ему, им, него, нем, нему, ним, нём, он.

The 3rd highest number of forms (8) was observed with the lemma “ОНА”: ее, ей, ею, её, нее, ней, неё, она.

PRON occurs with 6 features: Case (1915; 100% instances), Number (1831; 96% instances), Gender (1412; 74% instances), Animacy (922; 48% instances), Person (909; 47% instances), Reflex (84; 4% instances)

PRON occurs with 18 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Reflex=Yes

PRON occurs with 84 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|Person=3 (263 tokens). Examples: он, He

Relations

PRON nodes are attached to their parents using 15 different relations: nsubj (764; 40% instances), nmod (651; 34% instances), dobj (221; 12% instances), iobj (176; 9% instances), nsubjpass (41; 2% instances), advmod (22; 1% instances), mark (12; 1% instances), det (10; 1% instances), case (8; 0% instances), nmod:agent (4; 0% instances), vocative (2; 0% instances), appos (1; 0% instances), conj (1; 0% instances), discourse (1; 0% instances), root (1; 0% instances)

Parents of PRON nodes belong to 11 different parts of speech: VERB (1502; 78% instances), NOUN (234; 12% instances), ADJ (99; 5% instances), ADV (29; 2% instances), ADP (20; 1% instances), NUM (12; 1% instances), PROPN (8; 0% instances), DET (5; 0% instances), SYM (4; 0% instances), PUNCT (1; 0% instances), ROOT (1; 0% instances)

1267 (66%) PRON nodes are leaves.

519 (27%) PRON nodes have one child.

94 (5%) PRON nodes have two children.

35 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 11.

Children of PRON nodes are attached using 23 different relations: case (551; 67% instances), punct (59; 7% instances), mwe (55; 7% instances), acl:relcl (34; 4% instances), goeswith (22; 3% instances), discourse (18; 2% instances), ccomp (17; 2% instances), amod (15; 2% instances), det (13; 2% instances), advcl (6; 1% instances), appos (6; 1% instances), cc (5; 1% instances), conj (5; 1% instances), advmod (4; 0% instances), neg (4; 0% instances), nmod (4; 0% instances), acl (3; 0% instances), cc:preconj (1; 0% instances), dobj (1; 0% instances), nsubj (1; 0% instances), nummod (1; 0% instances), nummod:gov (1; 0% instances), parataxis (1; 0% instances)

Children of PRON nodes belong to 12 different parts of speech: ADP (560; 68% instances), PUNCT (70; 8% instances), VERB (59; 7% instances), PART (39; 5% instances), ADV (29; 4% instances), NOUN (26; 3% instances), ADJ (18; 2% instances), DET (14; 2% instances), CONJ (6; 1% instances), NUM (3; 0% instances), PROPN (2; 0% instances), SCONJ (1; 0% instances)


Treebank Statistics (UD_Russian-SynTagRus)

There are 15 PRON lemmas (0%), 71 PRON types (0%) and 30240 PRON tokens (3%). Out of 17 observed tags, the rank of PRON is: 15 in number of lemmas, 11 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: он, они, я, мы, она, что, себя, вы, кто, ты

The 10 most frequent PRON types: он, я, мы, они, что, его, она, их, них, нас

The 10 most frequent ambiguous lemmas: что (SCONJ 7138, PRON 2705, PART 1), вы (PRON 1031, X 1)

The 10 most frequent ambiguous types: что (SCONJ 7104, PRON 1559, NOUN 1), его (PRON 1494, DET 1338, ADJ 81), их (PRON 1189, DET 816, ADJ 35), ее (PRON 786, DET 531, ADJ 36), вы (PRON 461, X 1), себе (PRON 562, PART 7), ничего (PRON 422, ADV 16, PART 3), чем (SCONJ 637, PRON 319), её (PRON 22, DET 18), ком (PRON 4, NOUN 3)

Morphology

The form / lemma ratio of PRON is 4.733333 (the average of all parts of speech is 2.665758).

The 1st highest number of forms (10) was observed with the lemma “оно”: его, ему, им, им., него, нем, нему, ним, нём, оно.

The 2nd highest number of forms (9) was observed with the lemma “он”: его, ему, им, него, нем, нему, ним, нём, он.

The 3rd highest number of forms (9) was observed with the lemma “она”: ее, ей, ею, её, нее, ней, нею, неё, она.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 18 different relations: nsubj (14374; 48% instances), nmod (9732; 32% instances), dobj (3893; 13% instances), nsubjpass (550; 2% instances), iobj (367; 1% instances), root (335; 1% instances), conj (233; 1% instances), dep (221; 1% instances), nmod:agent (197; 1% instances), parataxis (91; 0% instances), mwe (78; 0% instances), advmod (62; 0% instances), advcl (54; 0% instances), amod (22; 0% instances), name (9; 0% instances), acl:relcl (8; 0% instances), appos (8; 0% instances), acl (6; 0% instances)

Parents of PRON nodes belong to 14 different parts of speech: VERB (24071; 80% instances), NOUN (2612; 9% instances), ADJ (2313; 8% instances), ADV (520; 2% instances), ROOT (335; 1% instances), PRON (165; 1% instances), NUM (90; 0% instances), PROPN (57; 0% instances), PART (48; 0% instances), SCONJ (13; 0% instances), CONJ (9; 0% instances), INTJ (3; 0% instances), SYM (3; 0% instances), X (1; 0% instances)

21141 (70%) PRON nodes are leaves.

6856 (23%) PRON nodes have one child.

1412 (5%) PRON nodes have two children.

831 (3%) PRON nodes have three or more children.

The highest child degree of a PRON node is 13.

Children of PRON nodes are attached using 25 different relations: case (5412; 42% instances), punct (3215; 25% instances), advmod (1313; 10% instances), nsubj (501; 4% instances), amod (485; 4% instances), cc (468; 4% instances), conj (353; 3% instances), nmod (251; 2% instances), parataxis (225; 2% instances), appos (143; 1% instances), neg (96; 1% instances), acl (89; 1% instances), cop (88; 1% instances), det (78; 1% instances), mark (60; 0% instances), mwe (32; 0% instances), advcl (24; 0% instances), nummod:gov (21; 0% instances), acl:relcl (15; 0% instances), aux (5; 0% instances), name (4; 0% instances), nummod (4; 0% instances), dep (3; 0% instances), discourse (3; 0% instances), iobj (3; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (5412; 42% instances), PUNCT (3215; 25% instances), PART (1084; 8% instances), NOUN (982; 8% instances), ADJ (601; 5% instances), ADV (394; 3% instances), CONJ (337; 3% instances), VERB (223; 2% instances), SCONJ (199; 2% instances), PRON (165; 1% instances), PROPN (107; 1% instances), DET (78; 1% instances), AUX (77; 1% instances), NUM (14; 0% instances), INTJ (3; 0% instances)


PRON in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]