home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian-HSE: POS Tags: DET

There are 70 DET lemmas (0%), 310 DET types (1%) and 6731 DET tokens (2%). Out of 17 observed tags, the rank of DET is: 12 in number of lemmas, 9 in number of types and 11 in number of tokens.

The 10 most frequent DET lemmas: наш, гэты, свой, увесь, той, такі, іншы, адзін, самы, які

The 10 most frequent DET types: яго, гэты, наш, яе, свой, сваю, іх, той, тым, свае

The 10 most frequent ambiguous lemmas: гэты (DET 801, PRON 3, ADJ 1), свой (DET 784, ADJ 1), увесь (DET 581, PRON 3), той (DET 475, PRON 5), такі (DET 450, PART 10), адзін (DET 352, NUM 206), самы (DET 316, ADJ 4), які (PRON 1630, DET 218), яго (DET 202, PRON 2), кожны (DET 192, ADJ 3)

The 10 most frequent ambiguous types: яго (PRON 278, DET 191), наш (DET 119, PRON 1), яе (PRON 151, DET 125), іх (PRON 269, DET 113), тым (DET 90, PRON 81, SCONJ 5), кожны (DET 71, ADJ 1), усе (DET 46, PRON 21), адзін (NUM 92, DET 72), гэтым (DET 81, PRON 71), ўсе (DET 88, PRON 14, ADV 1)

Morphology

The form / lemma ratio of DET is 4.428571 (the average of all parts of speech is 1.754875).

The 1st highest number of forms (25) was observed with the lemma “увесь”: увесь, усiх, усе, усю, уся, усяго, усяе, усяму, усё, усёй, усі, усім, усімі, усіх, ўвесь, ўсе, ўсю, ўся, ўсяго, ўсяму, ўсё, ўсёй, ўсім, ўсімі, ўсіх.

The 2nd highest number of forms (19) was observed with the lemma “наш”: Наше, Ніша, н., нам, наш, наша, нашага, нашае, нашай, нашаму, нашая, нашу, нашую, нашы, нашым, нашымі, нашых, нашыя, 👏🏼Нашыя.

The 3rd highest number of forms (19) was observed with the lemma “іншы”: iншая, iншую, iншым, iншымi, iншых, iншыя, інш., іншага, іншае, іншай, іншаму, іншая, інше, іншую, іншы, іншым, іншымі, іншых, іншыя.

DET occurs with 12 features: PronType (6681; 99% instances), Case (6221; 92% instances), Number (6218; 92% instances), Gender (4294; 64% instances), Poss (2537; 38% instances), Animacy (1386; 21% instances), Reflex (767; 11% instances), Degree (44; 1% instances), Abbr (30; 0% instances), NumType (6; 0% instances), Typo (3; 0% instances), Person (2; 0% instances)

DET occurs with 29 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur, Number=Sing, Person=3, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Exc, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

DET occurs with 265 feature combinations. The most frequent feature combination is Poss=Yes|PronType=Prs (480 tokens). Examples: яго, яе, іх, iх, мае

Relations

DET nodes are attached to their parents using 27 different relations: det (5736; 85% instances), nsubj (208; 3% instances), obl (142; 2% instances), conj (137; 2% instances), root (113; 2% instances), obj (86; 1% instances), nmod (79; 1% instances), fixed (43; 1% instances), acl (41; 1% instances), iobj (38; 1% instances), xcomp (33; 0% instances), appos (19; 0% instances), parataxis (16; 0% instances), case (7; 0% instances), ccomp (7; 0% instances), nsubj:pass (6; 0% instances), acl:relcl (3; 0% instances), advcl (3; 0% instances), csubj (3; 0% instances), advmod (2; 0% instances), amod (2; 0% instances), orphan (2; 0% instances), dep (1; 0% instances), flat (1; 0% instances), list (1; 0% instances), mark (1; 0% instances), vocative (1; 0% instances)

Parents of DET nodes belong to 15 different parts of speech: NOUN (5176; 77% instances), VERB (551; 8% instances), ADJ (369; 5% instances), PROPN (181; 3% instances), PRON (132; 2% instances), (113; 2% instances), DET (77; 1% instances), X (42; 1% instances), ADP (29; 0% instances), ADV (21; 0% instances), NUM (21; 0% instances), PART (8; 0% instances), SYM (5; 0% instances), AUX (4; 0% instances), SCONJ (2; 0% instances)

5750 (85%) DET nodes are leaves.

599 (9%) DET nodes have one child.

198 (3%) DET nodes have two children.

184 (3%) DET nodes have three or more children.

The highest child degree of a DET node is 9.

Children of DET nodes are attached using 28 different relations: punct (281; 17% instances), advmod (266; 16% instances), nmod (258; 15% instances), case (162; 10% instances), acl:relcl (160; 9% instances), cc (140; 8% instances), nsubj (105; 6% instances), conj (69; 4% instances), obl (44; 3% instances), cop (41; 2% instances), fixed (39; 2% instances), appos (28; 2% instances), det (28; 2% instances), parataxis (28; 2% instances), acl (8; 0% instances), advcl (7; 0% instances), mark (7; 0% instances), amod (5; 0% instances), discourse (5; 0% instances), dep (4; 0% instances), expl (4; 0% instances), orphan (3; 0% instances), csubj (2; 0% instances), dislocated (2; 0% instances), flat (2; 0% instances), iobj (2; 0% instances), nummod (1; 0% instances), vocative (1; 0% instances)

Children of DET nodes belong to 16 different parts of speech: NOUN (328; 19% instances), PUNCT (281; 17% instances), VERB (182; 11% instances), PART (174; 10% instances), ADP (166; 10% instances), CCONJ (138; 8% instances), ADV (108; 6% instances), DET (77; 5% instances), PRON (72; 4% instances), ADJ (45; 3% instances), AUX (42; 2% instances), PROPN (37; 2% instances), SCONJ (21; 1% instances), X (14; 1% instances), NUM (9; 1% instances), SYM (8; 0% instances)