home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-IU: POS Tags: DET

There are 63 DET lemmas (0%), 355 DET types (1%) and 4672 DET tokens (4%). Out of 17 observed tags, the rank of DET is: 10 in number of lemmas, 8 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: який, цей, той, свій, такий, весь, інший, один, його, наш

The 10 most frequent DET types: його, які, всі, який, її, яка, той, кілька, цей, цього

The 10 most frequent ambiguous lemmas: весь (DET 281, PRON 1), один (DET 222, NUM 84, ADJ 8), її (DET 93, PRON 1), багато (DET 65, ADV 44), самий (DET 54, ADJ 1), другий (ADJ 55, DET 27), т. (ADV 10, DET 5, PRON 4, NOUN 3, ADJ 1), стільки (ADV 7, DET 4), скільки (ADV 11, DET 3), і. (DET 2, NOUN 1)

The 10 most frequent ambiguous types: його (PRON 202, DET 178, NOUN 1), її (PRON 107, DET 87), цього (DET 66, PRON 38), один (DET 43, NUM 29, ADJ 1), багато (DET 45, ADV 41), того (PRON 95, DET 47, ADV 3), одного (DET 31, NUM 9, ADJ 2), їх (PRON 129, DET 31), самі (DET 31, ADJ 1), все (PRON 76, DET 26, ADV 18, PART 13)

Morphology

The form / lemma ratio of DET is 5.634921 (the average of all parts of speech is 1.738999).

The 1st highest number of forms (16) was observed with the lemma “один”: один, одна, одне, одним, одними, одних, одно, одного, одному, одною, одної, одну, одні, одній, однією, однієї.

The 2nd highest number of forms (16) was observed with the lemma “той”: та, те, тим, тими, тих, того, той, тому, тою, тої, ту, ті, тій, тім, тією, тієї.

The 3rd highest number of forms (15) was observed with the lemma “свій”: свого, свойого, свому, свою, своя, своє, своєму, своєю, своєї, свої, своїй, своїм, своїми, своїх, свій.

DET occurs with 12 features: Case (4672; 100% instances), PronType (4672; 100% instances), Number (4480; 96% instances), Gender (2994; 64% instances), Poss (1153; 25% instances), Person (785; 17% instances), Animacy (657; 14% instances), Reflex (534; 11% instances), Uninflect (334; 7% instances), NumType (192; 4% instances), Variant (32; 1% instances), Abbr (13; 0% instances)

DET occurs with 30 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Uninflect=Yes, Variant=Short

DET occurs with 282 feature combinations. The most frequent feature combination is Case=Nom|Number=Plur|PronType=Rel (131 tokens). Examples: які, котрі

Relations

DET nodes are attached to their parents using 31 different relations: det (3203; 69% instances), nsubj (470; 10% instances), obl (284; 6% instances), obj (183; 4% instances), det:numgov (146; 3% instances), nmod (74; 2% instances), conj (64; 1% instances), root (51; 1% instances), advmod:det (43; 1% instances), det:nummod (35; 1% instances), flat:abs (26; 1% instances), fixed (10; 0% instances), iobj (10; 0% instances), xcomp:pred (10; 0% instances), acl (8; 0% instances), advcl (8; 0% instances), parataxis (8; 0% instances), appos (7; 0% instances), ccomp (7; 0% instances), orphan (5; 0% instances), advcl:pred (4; 0% instances), csubj (2; 0% instances), dislocated (2; 0% instances), expl (2; 0% instances), flat:repeat (2; 0% instances), flat:title (2; 0% instances), mark (2; 0% instances), flat:sibl (1; 0% instances), parataxis:discourse (1; 0% instances), parataxis:rel (1; 0% instances), xcomp (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (3359; 72% instances), VERB (884; 19% instances), ADJ (134; 3% instances), PRON (92; 2% instances), DET (65; 1% instances), (51; 1% instances), PROPN (44; 1% instances), ADV (17; 0% instances), X (11; 0% instances), NUM (8; 0% instances), ADP (7; 0% instances)

4001 (86%) DET nodes are leaves.

456 (10%) DET nodes have one child.

89 (2%) DET nodes have two children.

126 (3%) DET nodes have three or more children.

The highest child degree of a DET node is 9.

Children of DET nodes are attached using 32 different relations: case (229; 20% instances), punct (182; 16% instances), nmod (102; 9% instances), discourse (98; 9% instances), acl:relcl (87; 8% instances), advmod (68; 6% instances), cc (68; 6% instances), nsubj (65; 6% instances), conj (41; 4% instances), flat:abs (33; 3% instances), cop (31; 3% instances), mark (21; 2% instances), appos (16; 1% instances), advcl (13; 1% instances), orphan (13; 1% instances), acl (12; 1% instances), obl (12; 1% instances), parataxis (10; 1% instances), flat:sibl (8; 1% instances), det (7; 1% instances), fixed (5; 0% instances), expl (4; 0% instances), flat:repeat (3; 0% instances), acl:adv (2; 0% instances), nummod:gov (2; 0% instances), amod (1; 0% instances), aux (1; 0% instances), csubj (1; 0% instances), det:numgov (1; 0% instances), flat:title (1; 0% instances), parataxis:discourse (1; 0% instances), vocative (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: ADP (230; 20% instances), PUNCT (182; 16% instances), NOUN (164; 14% instances), PART (119; 10% instances), VERB (104; 9% instances), CCONJ (67; 6% instances), DET (65; 6% instances), ADV (60; 5% instances), PRON (49; 4% instances), ADJ (32; 3% instances), AUX (32; 3% instances), SCONJ (23; 2% instances), PROPN (9; 1% instances), NUM (3; 0% instances)