home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Russian-RNC: POS Tags: DET

There are 26 DET lemmas (1%), 279 DET types (4%) and 1437 DET tokens (5%). Out of 17 observed tags, the rank of DET is: 10 in number of lemmas, 5 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: тотъ, твой, весь, свой, мой, нашъ, сей, всякий, какой, который

The 10 most frequent DET types: твой, мои, тѣхъ, то, тѣ, твоему, того, т., вся, сей

The 10 most frequent ambiguous lemmas: тотъ (DET 362, PRON 18), весь (DET 173, PRON 2), сей (DET 77, PRON 2), который (PRON 31, DET 20), кой (DET 8, PRON 2), одинъ (NUM 20, DET 3), иже (PRON 16, SCONJ 4, DET 2), съ (ADP 427, DET 1)

The 10 most frequent ambiguous types: то (DET 39, PRON 18, PART 2), того (DET 30, PRON 20), т. (DET 28, PRON 3), вся (DET 25, PRON 2), те (DET 22, PRON 2), тое (DET 15, PRON 1), тому (DET 15, PRON 6), всем (DET 13, PRON 6), всех (DET 11, PRON 1), томъ (PRON 23, DET 12)

Morphology

The form / lemma ratio of DET is 10.730769 (the average of all parts of speech is 1.947446).

The 1st highest number of forms (39) was observed with the lemma “тотъ”: та, таво, те, тем, теми, темъ, тех, тии, то, тово, того, тое, тои, той, том, тому, томъ, томꙋ, тот, тотъ, тою, тоя, тоі, тоѣ, ту, тые, тыи, тыми, тымъ, тыхъ, тѡм, тѣ, тѣи, тѣм, тѣми, тѣмъ, тѣни, тѣх, тѣхъ.

The 2nd highest number of forms (28) was observed with the lemma “весь”: вес, весь, все, всеа, всег[о], всего, всеи, всей, всем, всеми, всему, всемъ, всех, всея, вси, всимъ, всихъ, всомъ, всь, всю, вся, всѣ, всѣми, всѣмъ, всѣх, всѣхъ, въвесь, вьси.

The 3rd highest number of forms (28) was observed with the lemma “нашъ”: н[а]ше, н[а]шеи, наш, наша, наше, наше[ю, нашег[о], нашего, нашеи, нашей, нашем, нашему, нашемꙋ, нашею, нашея, наши, нашим, нашими, нашимъ, наших, нашихъ, нашия, нашого, нашой, нашу, нашъ, нашы, нашь.

DET occurs with 10 features: PronType (1437; 100% instances), Case (1407; 98% instances), Number (1406; 98% instances), Gender (1405; 98% instances), Poss (548; 38% instances), Reflex (153; 11% instances), Animacy (55; 4% instances), Abbr (32; 2% instances), Variant (8; 1% instances), Person (1; 0% instances)

DET occurs with 25 feature-value pairs: Abbr=Yes, Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Variant=Short

DET occurs with 213 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|Poss=Yes|PronType=Prs (81 tokens). Examples: мои, твой, нашъ, мой, моі, нашь, твои, мо, наш, наша

Relations

DET nodes are attached to their parents using 15 different relations: det (1306; 91% instances), obl (40; 3% instances), nsubj (29; 2% instances), obj (25; 2% instances), nmod (10; 1% instances), iobj (7; 0% instances), acl (6; 0% instances), amod (3; 0% instances), conj (3; 0% instances), root (3; 0% instances), advcl (1; 0% instances), dep (1; 0% instances), expl:pv (1; 0% instances), mark (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of DET nodes belong to 9 different parts of speech: NOUN (1235; 86% instances), VERB (96; 7% instances), PROPN (62; 4% instances), ADJ (22; 2% instances), DET (8; 1% instances), PRON (8; 1% instances), (3; 0% instances), ADV (2; 0% instances), NUM (1; 0% instances)

1303 (91%) DET nodes are leaves.

115 (8%) DET nodes have one child.

9 (1%) DET nodes have two children.

10 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 6.

Children of DET nodes are attached using 22 different relations: case (45; 26% instances), advmod (39; 23% instances), acl:relcl (14; 8% instances), acl (13; 8% instances), parataxis (9; 5% instances), punct (9; 5% instances), conj (7; 4% instances), det (7; 4% instances), nsubj (6; 3% instances), cc (5; 3% instances), appos (4; 2% instances), dep (3; 2% instances), nmod (2; 1% instances), vocative (2; 1% instances), advcl (1; 1% instances), amod (1; 1% instances), aux (1; 1% instances), dislocated (1; 1% instances), expl (1; 1% instances), fixed (1; 1% instances), mark (1; 1% instances), obl (1; 1% instances)

Children of DET nodes belong to 14 different parts of speech: ADP (43; 25% instances), PART (40; 23% instances), NOUN (31; 18% instances), VERB (18; 10% instances), PUNCT (9; 5% instances), DET (8; 5% instances), CCONJ (5; 3% instances), PRON (5; 3% instances), ADJ (4; 2% instances), SCONJ (3; 2% instances), X (3; 2% instances), PROPN (2; 1% instances), ADV (1; 1% instances), AUX (1; 1% instances)