home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-TOROT: POS Tags: DET

There are 35 DET lemmas (0%), 391 DET types (1%) and 2786 DET tokens (2%). Out of 14 observed tags, the rank of DET is: 10 in number of lemmas, 6 in number of types and 11 in number of tokens.

The 10 most frequent DET lemmas: вьсь, сии, тыиже, тыи, иныи, самъ, сииже, онъ, оныи, нѣкыи

The 10 most frequent DET types: всѧ, то, вси, сего, тож҃, того, се, все, всю, томьж҃

The 10 most frequent ambiguous lemmas: вьсь (DET 903, ADJ 302, NOUN 4), сии (ADJ 671, DET 457), тыиже (DET 455, ADJ 29), тыи (ADJ 744, DET 449), иныи (DET 171, ADJ 126), самъ (ADJ 228, DET 70), сииже (DET 64, ADJ 3), онъ (ADJ 428, DET 37), оныи (DET 37, ADJ 27), нѣкыи (DET 21, ADJ 3)

The 10 most frequent ambiguous types: всѧ (DET 128, ADJ 15), то (ADV 395, ADJ 174, DET 110), вси (DET 102, ADJ 59, NOUN 1), сего (DET 100, ADJ 90), того (ADJ 128, DET 69), се (INTJ 359, ADJ 214, DET 66), все (DET 58, ADJ 48, ADV 6), всю (DET 57, ADJ 4), тъ (DET 56, ADJ 29, ADV 6, PRON 1), си (ADJ 107, PRON 74, DET 48, ADV 5, INTJ 4, AUX 2)

Morphology

The form / lemma ratio of DET is 11.171429 (the average of all parts of speech is 3.571475).

The 1st highest number of forms (64) was observed with the lemma “вьсь”: в[сѧ, вес, весь, вохи, вохо, все, всего, всее, всеи, всей, всем, всему, всемъ, всемь, всемѹ, всемꙋ, всехъ, всею, всея, всеі, всеѧ, всеꙗ, вси, всихъ, всхѣ, всь, всю, вся, всѣ, всѣго, всѣи, всѣм, всѣмʼ, всѣми, всѣмъ, всѣмь, всѣмѹ, всѣм҃, всѣх, всѣхʼ, всѣхъ, всѣх҃, всѣю, всѧ, всꙗ, въсь, въсю, вьсе, вьсего, вьсеи, вьсемь, вьсею, вьсеꙗ, вьси, вьсь, вьсю, вьсѣ, вьсѣми, вьсѣмъ, вьсѣмь, вьсѣхъ, вьсѧ, вьсꙗ, вѣ.

The 2nd highest number of forms (48) was observed with the lemma “тыиже”: Тоѥ, Тоѥж, тг҃оже, ти, то, того, тогож, тогоже, тогож҃, тогѡ, тог҃же, тое, тоеже, тож, тоже, тож҃, тои, тоиже, тоиж҃, том, томже, тому, томуже, томъже, томь, томьж, томьже, томьж҃, томѹ, томѹж҃, том҃же, тояже, тоѥже, ту, туже, тъ, тъж, тъж҇, тѣ, тѣже, тѣм, тѣмже, тѹже, тѹж҃, тѹюже, т҃же, т҃ож, ѿ.

The 3rd highest number of forms (44) was observed with the lemma “тыи”: Тіи, та, техъ, тех҃, ти, тихъ, тию, то, тог, того, тогѡ, тог҃, тое, тои, том, томо, тому, томъ, томь, томѹ, томꙋ, тою, тоя, тоѣ, тоѥ, тоѧ, тоꙗ, ту, тъ, тъй, ты, тыи, тѡг, тѣ, тѣм, тѣми, тѣмъ, тѣмь, тѣх, тѣхъ, тѣ҃м, тѹ, тꙑ, тꙑи.

DET occurs with 3 features: Case (2786; 100% instances), Number (2786; 100% instances), Gender (2784; 100% instances)

DET occurs with 12 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

DET occurs with 43 feature combinations. The most frequent feature combination is Case=Acc|Gender=Neut|Number=Sing (387 tokens). Examples: тож҃, то, се, тоже, сеже, все, ино, тож, всѣ, сеж

Relations

DET nodes are attached to their parents using 1 different relations: det (2786; 100% instances)

Parents of DET nodes belong to 9 different parts of speech: NOUN (2469; 89% instances), ADJ (160; 6% instances), PROPN (79; 3% instances), PRON (39; 1% instances), VERB (26; 1% instances), AUX (7; 0% instances), NUM (4; 0% instances), ADV (1; 0% instances), DET (1; 0% instances)

2596 (93%) DET nodes are leaves.

185 (7%) DET nodes have one child.

4 (0%) DET nodes have two children.

1 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 11 different relations: flat (123; 63% instances), discourse (32; 16% instances), advmod (23; 12% instances), case (8; 4% instances), nmod (3; 2% instances), acl (2; 1% instances), amod (1; 1% instances), cc (1; 1% instances), conj (1; 1% instances), det (1; 1% instances), nsubj (1; 1% instances)

Children of DET nodes belong to 8 different parts of speech: ADJ (124; 63% instances), ADV (56; 29% instances), ADP (8; 4% instances), PRON (3; 2% instances), VERB (2; 1% instances), CCONJ (1; 1% instances), DET (1; 1% instances), NOUN (1; 1% instances)