home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: PRON

There are 45 PRON lemmas (1%), 255 PRON types (1%) and 7670 PRON tokens (7%). Out of 17 observed tags, the rank of PRON is: 11 in number of lemmas, 9 in number of types and 8 in number of tokens.

The 10 most frequent PRON lemmas: онъ, мы, то, они, што, вы, хто, я, ся, себе

The 10 most frequent PRON types: его, то, мы, их, што, намъ, ему, нам, того, ихъ

The 10 most frequent ambiguous lemmas: онъ (PRON 1736, DET 1), то (PRON 1340, PART 39, SCONJ 7), што (PRON 414, SCONJ 408, DET 1), се (PRON 64, PART 29), ништо (PRON 36, NOUN 1), що (PRON 19, SCONJ 6), который (DET 490, PRON 17), и (CCONJ 6241, PART 191, PRON 7), иже (SCONJ 68, PRON 5, ADV 1), тотъ (DET 306, PRON 4)

The 10 most frequent ambiguous types: то (PRON 576, PART 37, DET 7, SCONJ 7), што (SCONJ 386, PRON 301, DET 1), того (PRON 238, DET 218), емꙋ (PRON 143, DET 1), том (PRON 131, DET 54), сѧ (PRON 124, DET 2), томъ (PRON 95, DET 52), вы (PRON 72, ADP 9), тому (PRON 66, DET 60), тым (DET 55, PRON 48)

Morphology

The form / lemma ratio of PRON is 5.666667 (the average of all parts of speech is 2.589846).

The 1st highest number of forms (41) was observed with the lemma “онъ”: eго, eмоу, го, е]г(о), ег(о), ег[о], его, емоу, ему, емꙋ, енъ, му, нег(о), него, нем, немоу, нему, немъ, немꙋ, ниго, ним, нимъ, нѣг(о), нѣгo, нѣго, нѣм, нѣмоу, он, онъ, є(г)о, єго, ємоу, єму, ємѹ, ѡ(н), ѡ(н)ъ, ѡн, ѡни, ѡнъ, ѥго, ѥму.

The 2nd highest number of forms (24) was observed with the lemma “то”: (т)о, ного, т(о), т(ом), то, то(м), то(м)ъ, тог(о), того, тое, том, том), томоу, тому, томъ, томь, томꙋ, ты, тым, тымъ, тымь, тѡг(о), тѡм, тѣмь.

The 3rd highest number of forms (23) was observed with the lemma “они”: [их], ени, и(м), и(х), иx, им, има, ими, имъ, их, иххъ, ихъ, н]их, ни(х), ними, нимъ, них, них(ъ), нихъ, ны(х), они, ѡни, ѡны.

PRON occurs with 10 features: PronType (7662; 100% instances), Case (7476; 97% instances), Number (7290; 95% instances), Gender (5272; 69% instances), Person (5016; 65% instances), Reflex (381; 5% instances), Clitic (234; 3% instances), Analyt (192; 3% instances), Animacy (7; 0% instances), Mood (1; 0% instances)

PRON occurs with 28 feature-value pairs: Analyt=Yes, Animacy=Anim, Case=Acc, Case=Acc,Gen, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Clitic=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes

PRON occurs with 130 feature combinations. The most frequent feature combination is Case=Gen|Gender=Masc|Number=Sing|Person=3|PronType=Prs (937 tokens). Examples: его, ег(о), него, єго, нег(о), eго, нѣго, го, е]г(о), ег[о]

Relations

PRON nodes are attached to their parents using 27 different relations: obl (1843; 24% instances), iobj (1519; 20% instances), nsubj (1358; 18% instances), det (1351; 18% instances), obj (889; 12% instances), nmod (181; 2% instances), expl (162; 2% instances), conj (97; 1% instances), root (74; 1% instances), nsubj:pass (49; 1% instances), orphan (40; 1% instances), expl:pv (32; 0% instances), appos (16; 0% instances), reparandum (12; 0% instances), cc (8; 0% instances), fixed (7; 0% instances), ccomp (5; 0% instances), advcl (4; 0% instances), mark (4; 0% instances), nsubj:outer (4; 0% instances), acl:relcl (3; 0% instances), dep (3; 0% instances), obl:agent (3; 0% instances), acl (2; 0% instances), parataxis (2; 0% instances), dislocated (1; 0% instances), obl:tmod (1; 0% instances)

Parents of PRON nodes belong to 14 different parts of speech: VERB (5385; 70% instances), NOUN (1808; 24% instances), ADJ (203; 3% instances), (74; 1% instances), PRON (70; 1% instances), ADV (47; 1% instances), PROPN (29; 0% instances), DET (26; 0% instances), AUX (8; 0% instances), PART (8; 0% instances), NUM (5; 0% instances), X (3; 0% instances), CCONJ (2; 0% instances), SCONJ (2; 0% instances)

5034 (66%) PRON nodes are leaves.

2084 (27%) PRON nodes have one child.

304 (4%) PRON nodes have two children.

248 (3%) PRON nodes have three or more children.

The highest child degree of a PRON node is 14.

Children of PRON nodes are attached using 33 different relations: case (2082; 55% instances), conj (275; 7% instances), appos (251; 7% instances), punct (217; 6% instances), advmod (187; 5% instances), det (168; 4% instances), cc (131; 3% instances), acl (89; 2% instances), acl:relcl (72; 2% instances), nmod (64; 2% instances), nsubj (64; 2% instances), cop (53; 1% instances), orphan (20; 1% instances), dislocated (15; 0% instances), fixed (14; 0% instances), reparandum (12; 0% instances), obl (9; 0% instances), vocative (7; 0% instances), iobj (6; 0% instances), amod (5; 0% instances), mark (5; 0% instances), advcl (3; 0% instances), discourse (3; 0% instances), nummod:gov (3; 0% instances), obj (3; 0% instances), parataxis (3; 0% instances), dep (2; 0% instances), expl (2; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), nummod (1; 0% instances), obl:float (1; 0% instances)

Children of PRON nodes belong to 17 different parts of speech: ADP (2077; 55% instances), NOUN (533; 14% instances), PUNCT (217; 6% instances), DET (177; 5% instances), PART (176; 5% instances), VERB (152; 4% instances), CCONJ (128; 3% instances), PROPN (113; 3% instances), PRON (70; 2% instances), AUX (63; 2% instances), ADJ (22; 1% instances), ADV (21; 1% instances), SCONJ (12; 0% instances), NUM (6; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances), X (1; 0% instances)