home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: DET

There are 53 DET lemmas (1%), 827 DET types (4%) and 8139 DET tokens (7%). Out of 17 observed tags, the rank of DET is: 9 in number of lemmas, 5 in number of types and 6 in number of tokens.

The 10 most frequent DET lemmas: нашъ, той, весь, свой, вашъ, который, тотъ, сей, самъ, иный

The 10 most frequent DET types: нашим, тыи, того, тых, нашого, ваша, свои, тот, тое, тые

The 10 most frequent ambiguous lemmas: весь (DET 887, ADV 2), который (DET 490, PRON 17), тотъ (DET 306, PRON 4), самъ (DET 227, PRON 1), оный (DET 79, PRON 2), другий (DET 60, ADJ 9), твой (DET 40, PRON 1), кождый (DET 35, ADJ 7), сесь (DET 31, ADV 1), одинъ (NUM 65, DET 20)

The 10 most frequent ambiguous types: того (PRON 238, DET 218), тое (DET 143, PRON 16), сами (DET 109, PRON 1), которые (DET 78, PRON 3), тым (DET 55, PRON 48), тымъ (DET 54, PRON 44), все (DET 59, ADV 2, PRON 1), тому (PRON 66, DET 60), симъ (DET 59, PRON 1), том (PRON 131, DET 54)

Morphology

The form / lemma ratio of DET is 15.603774 (the average of all parts of speech is 2.589846).

The 1st highest number of forms (100) was observed with the lemma “нашъ”: н(а)ш(е)му, н(а)ш(и)мъ, н(а)ш(о)г(о), н(а)ш(о)го, н(а)ш(о)му, н(а)ша, н(а)ше, н(а)шее, н(а)ши, н(а)шим, н(а)шими, н(а)шимъ, н(а)ших, н(а)шихъ, н(а)шо, н(а)шог(о), н(а)шого, н(а)шое, н(а)шои, н(а)шому, н(а)шомъ, н(а)шомꙋ, н(а)шою, н(а)шу, н(а)шъ, н(а)шым, н(а)шымъ, н(а)шь, н(а)шꙋ, на[ш]их, на[шим, наш, наш(а), наш[ими], наша, нашго, наше, нашег(о), нашего, нашее, нашеи, нашей, нашем, нашем(у), нашеми, нашемоу, нашему, нашемъ, нашемь, нашемѹ, нашемꙋ, нашею, нашея, нашеѧ, наши, наши(м), нашим, нашими, нашимъ, нашимь, наших, нашихъ, нашихь, нашо, нашог(о), нашого, нашое, нашои, нашой, нашом, нашому, нашомъ, нашомꙋ, нашою, нашоі, нашу, нашъ, нашы, нашые, нашым, нашыми, нашымъ, нашых, нашыхъ, нашь, нашю, нашіимъ, нашѣ, нашѣг(о), нашѣго, нашѣи, нашѣм, нашѣмоу, нашѣю, нашѹ, нашꙋ, наща, нш̃ꙋ, ши, шыми.

The 2nd highest number of forms (99) was observed with the lemma “весь”: (в)се(м)ъ, (в)сѣ, (в)сѣхъ, вве(с), ввесь, вес(ь), весь, вс[ѣ], вс]ими, все, всег(о), всего, всее, всеи, всей, всем, всем(и), всеми, всемоу, всему, всемъ, всемь, всемꙋ, всех, всехъ, всею, всея, всеѧ, вси, вси(м̑), всим, всими, всимъ, всих, всихъ, всьмъ, всю, вся, всями, всѣ, всѣ(х), всѣг(о), всѣго, всѣе, всѣи, всѣм, всѣми, всѣмоу, всѣму, всѣмъ, всѣмь, всѣмꙋ, всѣх, всѣхъ, всѣю, всѧ, всѧѧ, въсе, въсего, въсеи, въсему, въсемъ, въсею, въсея, въси, въсим, въсими, въсимъ, въсихъ, въсѣми, въсѣмъ, въсѣмь, въсѣхь, въсѣю, вьсимъ, вѣсь, оусее, оусем, оусеми, оусемъ, оусехъ, оусею, оуси, оусимъ, оусихь, оусю, оусѣхъ, оусѧ, уси, усими, усимъ, усих, усихъ, усы, усю, ꙋсе, ꙋсее, ꙋси, ꙋсих.

The 3rd highest number of forms (57) was observed with the lemma “вашъ”: в[а]шем, в[аши], ваш, ваш(а), ваш(е)и, ваш[(а), ваш[е]и, ваш]а, ваша, ваше, вашег(о), вашего, вашее, вашеи, вашем, вашему, вашемꙋ, вашею, вашеѣ, ваши, ваши(х), вашим, вашими, вашимъ, ваших, вашихъ, вашо, вашог(о), вашого, вашое, вашои, вашой, вашом, вашомоу, вашому, вашомъ, вашомꙋ, вашоѣ, вашу, вашъ, вашь, вашьх, вашю, вашя, вашѣ, вашѣг(о), вашѣго, вашѣе, вашѣи, вашѣмоу, вашѣму, вашѣмъ, вашѣмꙋ, вашѣю, вашꙋ, всѧкоe, вяшя.

DET occurs with 10 features: Case (8139; 100% instances), Number (8139; 100% instances), Gender (8136; 100% instances), PronType (8096; 99% instances), Poss (3449; 42% instances), Reflex (812; 10% instances), Animacy (202; 2% instances), Person (47; 1% instances), Variant (7; 0% instances), Typo (1; 0% instances)

DET occurs with 29 feature-value pairs: Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Exc, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes, Variant=Short

DET occurs with 299 feature combinations. The most frequent feature combination is Case=Dat|Gender=Masc|Number=Plur|Poss=Yes|PronType=Prs (287 tokens). Examples: нашим, нашимъ, вашим, нашым, вашимъ, н(а)шимъ, моимъ, моим, нашымъ, н(а)шим

Relations

DET nodes are attached to their parents using 21 different relations: det (7101; 87% instances), obl (281; 3% instances), nsubj (269; 3% instances), obj (152; 2% instances), conj (108; 1% instances), iobj (89; 1% instances), nmod (48; 1% instances), nsubj:pass (25; 0% instances), obl:float (13; 0% instances), orphan (10; 0% instances), amod (9; 0% instances), appos (7; 0% instances), reparandum (6; 0% instances), root (6; 0% instances), advcl (5; 0% instances), parataxis (3; 0% instances), fixed (2; 0% instances), xcomp (2; 0% instances), advmod (1; 0% instances), dislocated (1; 0% instances), expl (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (6780; 83% instances), VERB (757; 9% instances), PROPN (190; 2% instances), PRON (177; 2% instances), ADJ (147; 2% instances), DET (61; 1% instances), ADV (10; 0% instances), NUM (8; 0% instances), (6; 0% instances), X (2; 0% instances), PART (1; 0% instances)

7473 (92%) DET nodes are leaves.

517 (6%) DET nodes have one child.

90 (1%) DET nodes have two children.

59 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 6.

Children of DET nodes are attached using 25 different relations: advmod (242; 27% instances), case (214; 24% instances), cc (107; 12% instances), punct (84; 9% instances), acl:relcl (63; 7% instances), conj (45; 5% instances), nmod (35; 4% instances), det (24; 3% instances), appos (18; 2% instances), acl (16; 2% instances), orphan (11; 1% instances), cop (9; 1% instances), nsubj (9; 1% instances), reparandum (5; 1% instances), amod (4; 0% instances), dislocated (4; 0% instances), fixed (4; 0% instances), obl (4; 0% instances), discourse (3; 0% instances), csubj (2; 0% instances), mark (2; 0% instances), advcl (1; 0% instances), dep (1; 0% instances), nummod:gov (1; 0% instances), parataxis:discourse (1; 0% instances)

Children of DET nodes belong to 15 different parts of speech: PART (223; 25% instances), ADP (208; 23% instances), CCONJ (107; 12% instances), PUNCT (84; 9% instances), VERB (68; 7% instances), DET (61; 7% instances), NOUN (54; 6% instances), PRON (26; 3% instances), ADJ (25; 3% instances), ADV (25; 3% instances), AUX (10; 1% instances), PROPN (7; 1% instances), SCONJ (7; 1% instances), NUM (3; 0% instances), X (1; 0% instances)