home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-RNC: POS Tags: DET

There are 55 DET lemmas (1%), 650 DET types (3%) and 4530 DET tokens (5%). Out of 17 observed tags, the rank of DET is: 10 in number of lemmas, 5 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: тотъ, весь, свой, твой, нашъ, сей, мой, который, всякий, иной

The 10 most frequent DET types: того, всеа, своего, сей, твой, те, все, которые, тех, тѣхъ

The 10 most frequent ambiguous lemmas: сей (DET 289, PRON 1), другой (DET 82, ADJ 2), самъ (DET 72, NOUN 1), одинъ (NUM 118, DET 6, ADJ 2), единъ (NUM 28, DET 4), премногий (ADJ 2, DET 2), злодѣй (NOUN 2, DET 1), смѣкальный (ADJ 8, DET 1)

The 10 most frequent ambiguous types: того (DET 126, PRON 70), все (DET 64, ADV 3), то (PRON 93, DET 54, SCONJ 14, PART 4), тое (DET 46, PRON 1), вся (DET 40, PRON 2), сего (DET 33, PRON 20), тово (DET 27, PRON 16), тому (PRON 37, DET 33), та (DET 31, PART 1), т. (DET 28, PRON 3)

Morphology

The form / lemma ratio of DET is 11.818182 (the average of all parts of speech is 2.250521).

The 1st highest number of forms (46) was observed with the lemma “всякий”: ВСЯКОЯ, всякiя, всяка, всякаго, всякая, всякие, всякии, всякий, всяким, всякими, всякимъ, всяких, всякихъ, всякия, всякова, всяково, всяког(о), всякого, всякое, всякой, всякому, всякомъ, всякою, всяку, всякую, всякъ, всякіе, всякія, всяцей, всꙗкаго, всꙗки(м), всꙗкими, всꙗкихъ, всꙗкиꙗ, всꙗко, всꙗкого, всꙗкой, всꙗкомꙋ, всꙗкъ, всꙗкіꙗ, въся(ко)мъ, въсякимъ, въсякое, въсякой, въсякомъ, въсякую.

The 2nd highest number of forms (46) was observed with the lemma “тотъ”: т[а], та, таво, те, тем, теми, темъ, тех, тии, тимъ, тихъ, то, то(й), то(м), тово, тог[о], того, тое, тоей, тои, той, том, тому, томъ, томꙋ, тот, тотъ, тою, тоя, тоі, тоѣ, ту, тую, тые, тыи, тыми, тымъ, тыхъ, тѣ, тѣи, тѣм, тѣми, тѣмъ, тѣни, тѣх, тѣхъ.

The 3rd highest number of forms (44) was observed with the lemma “весь”: ве(с), вес, вес[ь], весь, все, все(м), всеа, всег[о], всего, всее, всеи, всей, всем, всеми, всему, всемъ, всемꙋ, всех, всехъ, всею, всея, вси, всимъ, всихъ, всомъ, всь, всю, вся, всяго, всѣ, всѣи, всѣм, всѣми, всѣмъ, всѣх, всѣхъ, всꙗ, въвесь, въсе, въсемъ, въсехъ, въсю, вьсе, вьси.

DET occurs with 10 features: PronType (4523; 100% instances), Case (4494; 99% instances), Number (4492; 99% instances), Gender (4491; 99% instances), Poss (1572; 35% instances), Reflex (522; 12% instances), Animacy (190; 4% instances), Variant (53; 1% instances), Abbr (38; 1% instances), Typo (1; 0% instances)

DET occurs with 27 feature-value pairs: Abbr=Yes, Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Exc, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes, Variant=Short

DET occurs with 301 feature combinations. The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing|PronType=Tot (170 tokens). Examples: всеа, всея, всеи, всей, всее, всякия, всꙗкіꙗ, другой, ВСЯКОЯ, всякiя

Relations

DET nodes are attached to their parents using 23 different relations: det (3955; 87% instances), nsubj (139; 3% instances), obl (123; 3% instances), obj (91; 2% instances), obl:float (49; 1% instances), nmod (40; 1% instances), conj (34; 1% instances), iobj (33; 1% instances), nsubj:pass (21; 0% instances), acl (12; 0% instances), fixed (10; 0% instances), orphan (4; 0% instances), ccomp (3; 0% instances), parataxis (3; 0% instances), advcl (2; 0% instances), dep (2; 0% instances), obl:agent (2; 0% instances), root (2; 0% instances), acl:relcl (1; 0% instances), dislocated (1; 0% instances), expl:pv (1; 0% instances), obl:depict (1; 0% instances), xcomp (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (3704; 82% instances), VERB (427; 9% instances), PROPN (199; 4% instances), ADJ (103; 2% instances), PRON (49; 1% instances), DET (25; 1% instances), ADP (9; 0% instances), ADV (6; 0% instances), NUM (5; 0% instances), (2; 0% instances), PART (1; 0% instances)

4013 (89%) DET nodes are leaves.

431 (10%) DET nodes have one child.

51 (1%) DET nodes have two children.

35 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 15.

Children of DET nodes are attached using 26 different relations: advmod (183; 27% instances), case (141; 21% instances), punct (46; 7% instances), appos (42; 6% instances), acl:relcl (41; 6% instances), cc (39; 6% instances), conj (37; 5% instances), acl (27; 4% instances), discourse (20; 3% instances), vocative (20; 3% instances), det (18; 3% instances), nsubj (14; 2% instances), nmod (10; 1% instances), orphan (10; 1% instances), cop (7; 1% instances), mark (5; 1% instances), dislocated (4; 1% instances), dep (3; 0% instances), obl (3; 0% instances), parataxis (3; 0% instances), advcl (2; 0% instances), amod (2; 0% instances), expl (2; 0% instances), aux (1; 0% instances), fixed (1; 0% instances), obl:pronmod (1; 0% instances)

Children of DET nodes belong to 15 different parts of speech: PART (201; 29% instances), ADP (140; 21% instances), NOUN (105; 15% instances), VERB (62; 9% instances), PUNCT (46; 7% instances), CCONJ (41; 6% instances), DET (25; 4% instances), ADJ (17; 2% instances), PRON (15; 2% instances), AUX (9; 1% instances), PROPN (6; 1% instances), SCONJ (6; 1% instances), ADV (3; 0% instances), NUM (3; 0% instances), X (3; 0% instances)