home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-ParlaMint: POS Tags: DET

There are 41 DET lemmas (1%), 216 DET types (2%) and 2286 DET tokens (4%). Out of 16 observed tags, the rank of DET is: 9 in number of lemmas, 7 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: цей, який, наш, весь, той, такий, свій, інший, кожний, ваш

The 10 most frequent DET types: які, цього, цей, який, яка, всі, це, цю, всіх, ця

The 10 most frequent ambiguous lemmas: цей (DET 520, PRON 3), який (DET 516, PRON 13), весь (DET 173, PRON 4), той (DET 155, PRON 6), такий (DET 126, PRON 2), багато (DET 27, ADV 9), один (NUM 53, DET 27), увесь (DET 27, NOUN 1), його (DET 25, NOUN 1), їх (DET 14, PRON 1)

The 10 most frequent ambiguous types: які (DET 220, PRON 1), цього (DET 101, PRON 47), який (DET 96, PRON 1), яка (DET 72, PRON 5), всі (DET 64, PRON 5), це (PRON 269, DET 54, PART 1), всіх (DET 50, PRON 2), ті (DET 30, PRON 1), цьому (DET 32, PRON 10), тих (DET 29, PRON 1)

Morphology

The form / lemma ratio of DET is 5.268293 (the average of all parts of speech is 1.786380).

The 1st highest number of forms (13) was observed with the lemma “наш”: наш, наша, наше, нашим, нашими, наших, нашого, нашому, нашою, нашої, нашу, наші, нашій.

The 2nd highest number of forms (13) was observed with the lemma “свій”: Своя, свого, свою, своє, своєму, своєю, своєї, свої, своїй, своїм, своїми, своїх, свій.

The 3rd highest number of forms (13) was observed with the lemma “той”: та, те, тим, тими, тих, того, той, тому, ту, ті, тій, тією, тієї.

DET occurs with 13 features: Case (2281; 100% instances), PronType (2274; 99% instances), Number (2242; 98% instances), Gender (1383; 60% instances), Poss (450; 20% instances), Animacy (360; 16% instances), Person (314; 14% instances), Reflex (134; 6% instances), InflClass (50; 2% instances), NumType (41; 2% instances), BadStyle (12; 1% instances), Variant (11; 0% instances), Typo (7; 0% instances)

DET occurs with 30 feature-value pairs: Animacy=Anim, Animacy=Inan, BadStyle=Yes, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, InflClass=Ind, NumType=Card, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes, Variant=Short

DET occurs with 233 feature combinations. The most frequent feature combination is Case=Nom|Number=Plur|PronType=Rel (183 tokens). Examples: які, котрі

Relations

DET nodes are attached to their parents using 27 different relations: det (1629; 71% instances), nsubj (332; 15% instances), obj (110; 5% instances), obl (66; 3% instances), det:numgov (31; 1% instances), nmod (24; 1% instances), nsubj:pass (17; 1% instances), conj (14; 1% instances), orphan (8; 0% instances), root (8; 0% instances), det:nummod (7; 0% instances), advcl (5; 0% instances), fixed (5; 0% instances), iobj (5; 0% instances), obl:arg (4; 0% instances), ccomp (3; 0% instances), acl:relcl (2; 0% instances), advmod (2; 0% instances), appos (2; 0% instances), dislocated (2; 0% instances), obl:agent (2; 0% instances), parataxis (2; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances), advmod:det (1; 0% instances), amod (1; 0% instances), reparandum (1; 0% instances)

Parents of DET nodes belong to 13 different parts of speech: NOUN (1650; 72% instances), VERB (464; 20% instances), ADJ (75; 3% instances), PRON (44; 2% instances), DET (14; 1% instances), PROPN (13; 1% instances), (8; 0% instances), ADV (7; 0% instances), AUX (5; 0% instances), ADP (3; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)

2081 (91%) DET nodes are leaves.

162 (7%) DET nodes have one child.

21 (1%) DET nodes have two children.

22 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 6.

Children of DET nodes are attached using 23 different relations: case (72; 25% instances), punct (27; 9% instances), discourse (25; 9% instances), nmod (23; 8% instances), acl:relcl (18; 6% instances), cc (18; 6% instances), acl (16; 6% instances), advmod (15; 5% instances), conj (14; 5% instances), nsubj (12; 4% instances), mark (9; 3% instances), appos (8; 3% instances), advmod:neg (6; 2% instances), cop (6; 2% instances), fixed (4; 1% instances), det (3; 1% instances), orphan (3; 1% instances), parataxis (3; 1% instances), advmod:emph (2; 1% instances), obl (2; 1% instances), vocative (2; 1% instances), amod (1; 0% instances), expl (1; 0% instances)

Children of DET nodes belong to 13 different parts of speech: ADP (72; 25% instances), NOUN (40; 14% instances), VERB (34; 12% instances), PART (31; 11% instances), PUNCT (27; 9% instances), ADV (18; 6% instances), CCONJ (18; 6% instances), PRON (15; 5% instances), DET (14; 5% instances), SCONJ (10; 3% instances), AUX (6; 2% instances), ADJ (4; 1% instances), PROPN (1; 0% instances)