home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-RNC: POS Tags: DET

There are 45 DET lemmas (1%), 423 DET types (4%) and 2059 DET tokens (4%). Out of 17 observed tags, the rank of DET is: 10 in number of lemmas, 5 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: тотъ, твой, весь, свой, сей, мой, нашъ, всякий, оный, который

The 10 most frequent DET types: твой, то, сей, мои, тѣхъ, тово, тѣ, всѣхъ, те, того

The 10 most frequent ambiguous lemmas: тотъ (DET 446, PRON 18), весь (DET 275, PRON 2), сей (DET 183, PRON 2), который (PRON 40, DET 34), другой (DET 22, ADJ 15), самъ (ADJ 25, DET 12), самый (DET 11, ADJ 5), иной (ADJ 53, DET 9), кой (DET 8, PRON 2), многий (ADJ 32, DET 8)

The 10 most frequent ambiguous types: то (DET 45, PRON 18, PART 3, SCONJ 3), тово (DET 26, PRON 12), те (DET 34, PRON 2), того (DET 34, PRON 20), т. (DET 28, PRON 3), вся (DET 26, PRON 2), все (DET 22, PRON 3, ADV 1), сего (DET 19, PRON 3), всех (DET 16, PRON 1), тое (DET 18, PRON 1)

Morphology

The form / lemma ratio of DET is 9.400000 (the average of all parts of speech is 1.988362).

The 1st highest number of forms (40) was observed with the lemma “тотъ”: т[а], та, таво, те, тем, теми, темъ, тех, тии, то, тово, того, тое, тои, той, том, тому, томъ, томꙋ, тот, тотъ, тою, тоя, тоі, тоѣ, ту, тые, тыи, тыми, тымъ, тыхъ, тѡм, тѣ, тѣи, тѣм, тѣми, тѣмъ, тѣни, тѣх, тѣхъ.

The 2nd highest number of forms (34) was observed with the lemma “весь”: ве(с), вес, весь, все, все(м), всеа, всег[о], всего, всее, всеи, всей, всем, всеми, всему, всемъ, всех, всехъ, всею, всея, вси, всимъ, всихъ, всомъ, всь, всю, вся, всѣ, всѣми, всѣмъ, всѣх, всѣхъ, всꙗ, въвесь, вьси.

The 3rd highest number of forms (34) was observed with the lemma “нашъ”: н[а]ше, н[а]шеи, наш, наша, наше, наше[ю, нашег[о], нашего, нашеи, нашей, нашем, нашему, нашемꙋ, нашею, нашея, наши, наши(х), нашим, нашими, нашимъ, наших, нашихъ, нашия, нашого, нашой, нашу, нашъ, нашы, нашь, нш҃а, нш҃е, нш҃его, нш҃ем, нш҃ꙋ.

DET occurs with 10 features: PronType (2052; 100% instances), Case (2024; 98% instances), Number (2023; 98% instances), Gender (2022; 98% instances), Poss (636; 31% instances), Reflex (199; 10% instances), Animacy (80; 4% instances), Abbr (37; 2% instances), Variant (19; 1% instances), Typo (1; 0% instances)

DET occurs with 27 feature-value pairs: Abbr=Yes, Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Exc, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes, Variant=Short

DET occurs with 260 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|Poss=Yes|PronType=Prs (82 tokens). Examples: мои, твой, нашъ, мой, моі, нашь, твои, мо, наш, наша

Relations

DET nodes are attached to their parents using 16 different relations: det (1809; 88% instances), obl (78; 4% instances), obj (58; 3% instances), nsubj (49; 2% instances), nmod (18; 1% instances), iobj (17; 1% instances), acl (8; 0% instances), conj (7; 0% instances), amod (3; 0% instances), root (3; 0% instances), dep (2; 0% instances), nsubj:pass (2; 0% instances), parataxis (2; 0% instances), advcl (1; 0% instances), expl:pv (1; 0% instances), mark (1; 0% instances)

Parents of DET nodes belong to 9 different parts of speech: NOUN (1717; 83% instances), VERB (184; 9% instances), PROPN (74; 4% instances), ADJ (48; 2% instances), DET (14; 1% instances), PRON (10; 0% instances), ADV (6; 0% instances), NUM (3; 0% instances), (3; 0% instances)

1836 (89%) DET nodes are leaves.

191 (9%) DET nodes have one child.

16 (1%) DET nodes have two children.

16 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 6.

Children of DET nodes are attached using 25 different relations: advmod (74; 26% instances), case (72; 25% instances), acl (24; 8% instances), acl:relcl (20; 7% instances), punct (15; 5% instances), cc (12; 4% instances), parataxis (12; 4% instances), det (11; 4% instances), conj (10; 3% instances), nsubj (9; 3% instances), appos (5; 2% instances), vocative (5; 2% instances), dep (4; 1% instances), amod (2; 1% instances), mark (2; 1% instances), nmod (2; 1% instances), advcl (1; 0% instances), aux (1; 0% instances), cop (1; 0% instances), dislocated (1; 0% instances), expl (1; 0% instances), fixed (1; 0% instances), iobj (1; 0% instances), nummod (1; 0% instances), obl (1; 0% instances)

Children of DET nodes belong to 15 different parts of speech: PART (74; 26% instances), ADP (70; 24% instances), NOUN (39; 14% instances), VERB (35; 12% instances), PUNCT (15; 5% instances), DET (14; 5% instances), CCONJ (12; 4% instances), ADJ (9; 3% instances), PRON (5; 2% instances), SCONJ (4; 1% instances), X (4; 1% instances), ADV (2; 1% instances), AUX (2; 1% instances), PROPN (2; 1% instances), NUM (1; 0% instances)