home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian-HSE: POS Tags: DET

There are 67 DET lemmas (0%), 307 DET types (1%) and 6730 DET tokens (2%). Out of 17 observed tags, the rank of DET is: 11 in number of lemmas, 9 in number of types and 11 in number of tokens.

The 10 most frequent DET lemmas: наш, гэты, свой, увесь, той, такі, іншы, адзін, самы, які

The 10 most frequent DET types: яго, гэты, наш, яе, свой, іх, сваю, тым, той, свае

The 10 most frequent ambiguous lemmas: гэты (DET 801, PRON 3, ADJ 1), свой (DET 784, ADJ 1), увесь (DET 583, PRON 2), той (DET 476, PRON 3), такі (DET 450, PART 10), адзін (DET 352, NUM 206), самы (DET 316, ADJ 4), які (PRON 1631, DET 217), яго (DET 202, PRON 2), кожны (DET 192, ADJ 3)

The 10 most frequent ambiguous types: яго (PRON 278, DET 191), наш (DET 119, PRON 1), яе (PRON 151, DET 125), іх (PRON 268, DET 114), тым (DET 90, PRON 81, SCONJ 5), кожны (DET 71, ADJ 1), усе (DET 46, PRON 21), адзін (NUM 92, DET 72), гэтым (DET 81, PRON 71), ўсе (DET 88, PRON 14, ADV 1)

Morphology

The form / lemma ratio of DET is 4.582090 (the average of all parts of speech is 1.756638).

The 1st highest number of forms (25) was observed with the lemma “увесь”: увесь, усiх, усе, усю, уся, усяго, усяе, усяму, усё, усёй, усі, усім, усімі, усіх, ўвесь, ўсе, ўсю, ўся, ўсяго, ўсяму, ўсё, ўсёй, ўсім, ўсімі, ўсіх.

The 2nd highest number of forms (19) was observed with the lemma “наш”: Наше, Ніша, н., нам, наш, наша, нашага, нашае, нашай, нашаму, нашая, нашу, нашую, нашы, нашым, нашымі, нашых, нашыя, 👏🏼Нашыя.

The 3rd highest number of forms (19) was observed with the lemma “іншы”: iншая, iншую, iншым, iншымi, iншых, iншыя, інш., іншага, іншае, іншай, іншаму, іншая, інше, іншую, іншы, іншым, іншымі, іншых, іншыя.

DET occurs with 11 features: PronType (6682; 99% instances), Case (6218; 92% instances), Number (6215; 92% instances), Gender (4289; 64% instances), Poss (2540; 38% instances), Animacy (1385; 21% instances), Reflex (767; 11% instances), Degree (44; 1% instances), Abbr (30; 0% instances), NumType (6; 0% instances), Typo (3; 0% instances)

DET occurs with 28 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur, Number=Sing, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Exc, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

DET occurs with 261 feature combinations. The most frequent feature combination is Poss=Yes|PronType=Prs (482 tokens). Examples: яго, яе, іх, iх, мае

Relations

DET nodes are attached to their parents using 27 different relations: det (5737; 85% instances), nsubj (208; 3% instances), obl (142; 2% instances), conj (136; 2% instances), root (113; 2% instances), obj (86; 1% instances), nmod (78; 1% instances), fixed (43; 1% instances), acl (41; 1% instances), iobj (39; 1% instances), xcomp (33; 0% instances), appos (18; 0% instances), parataxis (16; 0% instances), case (7; 0% instances), ccomp (7; 0% instances), nsubj:pass (6; 0% instances), acl:relcl (3; 0% instances), advcl (3; 0% instances), csubj (3; 0% instances), advmod (2; 0% instances), amod (2; 0% instances), orphan (2; 0% instances), dep (1; 0% instances), flat (1; 0% instances), list (1; 0% instances), mark (1; 0% instances), vocative (1; 0% instances)

Parents of DET nodes belong to 15 different parts of speech: NOUN (5178; 77% instances), VERB (551; 8% instances), ADJ (370; 5% instances), PROPN (178; 3% instances), PRON (132; 2% instances), (113; 2% instances), DET (76; 1% instances), X (42; 1% instances), ADP (29; 0% instances), ADV (21; 0% instances), NUM (21; 0% instances), PART (8; 0% instances), SYM (5; 0% instances), AUX (4; 0% instances), SCONJ (2; 0% instances)

5750 (85%) DET nodes are leaves.

600 (9%) DET nodes have one child.

197 (3%) DET nodes have two children.

183 (3%) DET nodes have three or more children.

The highest child degree of a DET node is 9.

Children of DET nodes are attached using 28 different relations: punct (279; 16% instances), advmod (266; 16% instances), nmod (258; 15% instances), case (162; 10% instances), acl:relcl (161; 9% instances), cc (140; 8% instances), nsubj (104; 6% instances), conj (69; 4% instances), obl (43; 3% instances), cop (41; 2% instances), fixed (38; 2% instances), appos (28; 2% instances), det (28; 2% instances), parataxis (28; 2% instances), acl (8; 0% instances), advcl (7; 0% instances), mark (7; 0% instances), amod (5; 0% instances), discourse (5; 0% instances), dep (4; 0% instances), expl (4; 0% instances), orphan (3; 0% instances), csubj (2; 0% instances), dislocated (2; 0% instances), flat (2; 0% instances), iobj (2; 0% instances), nummod (1; 0% instances), vocative (1; 0% instances)

Children of DET nodes belong to 16 different parts of speech: NOUN (328; 19% instances), PUNCT (279; 16% instances), VERB (183; 11% instances), PART (174; 10% instances), ADP (165; 10% instances), CCONJ (138; 8% instances), ADV (108; 6% instances), DET (76; 4% instances), PRON (72; 4% instances), ADJ (45; 3% instances), AUX (42; 2% instances), PROPN (36; 2% instances), SCONJ (21; 1% instances), X (14; 1% instances), NUM (9; 1% instances), SYM (8; 0% instances)