home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-ParlaMint: POS Tags: DET

There are 50 DET lemmas (1%), 256 DET types (2%) and 4269 DET tokens (4%). Out of 16 observed tags, the rank of DET is: 9 in number of lemmas, 7 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: цей, який, наш, весь, той, такий, свій, інший, кожний, ваш

The 10 most frequent DET types: які, цього, який, всі, цей, яка, всіх, цю, ці, це

The 10 most frequent ambiguous lemmas: цей (DET 885, PRON 8), який (DET 800, PRON 201), весь (DET 356, PRON 21), той (DET 325, PRON 25), такий (DET 297, PRON 25), інший (DET 166, ADJ 5, PRON 2), кожний (DET 84, PRON 2), увесь (DET 48, NOUN 1), його (DET 47, NOUN 1), один (NUM 146, DET 42, PRON 25)

The 10 most frequent ambiguous types: які (DET 321, PRON 82, X 1), цього (DET 162, PRON 75), який (DET 144, PRON 35), всі (DET 130, PRON 12), цей (DET 113, PRON 1), яка (DET 112, PRON 32), всіх (DET 91, PRON 10), це (PRON 533, DET 81, PART 31), тих (DET 69, PRON 10), ті (DET 64, PRON 7)

Morphology

The form / lemma ratio of DET is 5.120000 (the average of all parts of speech is 1.931827).

The 1st highest number of forms (15) was observed with the lemma “той”: та, те, тим, тими, тих, того, той, том, тому, тої, ту, ті, тій, тією, тієї.

The 2nd highest number of forms (13) was observed with the lemma “весь”: весь, все, всього, всьому, всю, вся, всі, всій, всім, всіма, всіх, всією, всієї.

The 3rd highest number of forms (13) was observed with the lemma “наш”: наш, наша, наше, нашим, нашими, наших, нашого, нашому, нашою, нашої, нашу, наші, нашій.

DET occurs with 16 features: PronType (4269; 100% instances), Case (4267; 100% instances), Number (4200; 98% instances), Gender (2589; 61% instances), Poss (908; 21% instances), Person (661; 15% instances), Animacy (633; 15% instances), Reflex (231; 5% instances), InflClass (101; 2% instances), NumType (69; 2% instances), BadStyle (31; 1% instances), Polite (22; 1% instances), ExtPos (21; 0% instances), Variant (19; 0% instances), Typo (12; 0% instances), Style (1; 0% instances)

DET occurs with 39 feature-value pairs: Animacy=Anim, Animacy=Inan, BadStyle=Yes, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, ExtPos=ADV, ExtPos=DET, ExtPos=PRON, Gender=Fem, Gender=Masc, Gender=Neut, InflClass=Ind, NumType=Card, Number=Plur, Number=Ptan, Number=Sing, Person=1, Person=2, Person=3, Polite=Form, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Style=Arch, Typo=Yes, Variant=Short

DET occurs with 311 feature combinations. The most frequent feature combination is Case=Nom|Number=Plur|PronType=Rel (269 tokens). Examples: які, котрі

Relations

DET nodes are attached to their parents using 27 different relations: det (3155; 74% instances), nsubj (500; 12% instances), obj (149; 3% instances), obl (115; 3% instances), nmod (60; 1% instances), fixed (56; 1% instances), det:numgov (51; 1% instances), conj (47; 1% instances), nsubj:pass (30; 1% instances), amod (20; 0% instances), root (18; 0% instances), det:nummod (13; 0% instances), orphan (13; 0% instances), iobj (12; 0% instances), appos (5; 0% instances), advcl (4; 0% instances), obl:arg (4; 0% instances), parataxis (4; 0% instances), acl:relcl (2; 0% instances), advmod (2; 0% instances), dislocated (2; 0% instances), obl:agent (2; 0% instances), acl (1; 0% instances), advmod:det (1; 0% instances), ccomp (1; 0% instances), reparandum (1; 0% instances), xcomp (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (3170; 74% instances), VERB (697; 16% instances), ADJ (122; 3% instances), PRON (89; 2% instances), PROPN (67; 2% instances), ADP (42; 1% instances), DET (42; 1% instances), (18; 0% instances), ADV (12; 0% instances), AUX (5; 0% instances), X (2; 0% instances), NUM (1; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances)

3914 (92%) DET nodes are leaves.

268 (6%) DET nodes have one child.

49 (1%) DET nodes have two children.

38 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 15.

Children of DET nodes are attached using 27 different relations: case (108; 21% instances), punct (55; 11% instances), cc (54; 11% instances), discourse (46; 9% instances), conj (41; 8% instances), acl:relcl (32; 6% instances), fixed (28; 6% instances), acl (27; 5% instances), advmod (25; 5% instances), nmod (16; 3% instances), nsubj (15; 3% instances), advmod:neg (11; 2% instances), appos (8; 2% instances), parataxis (6; 1% instances), advmod:emph (5; 1% instances), det (5; 1% instances), mark (5; 1% instances), obl (5; 1% instances), orphan (4; 1% instances), cop (3; 1% instances), flat (2; 0% instances), reparandum (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), aux (1; 0% instances), goeswith (1; 0% instances), vocative (1; 0% instances)

Children of DET nodes belong to 15 different parts of speech: ADP (109; 21% instances), PART (66; 13% instances), NOUN (56; 11% instances), PUNCT (55; 11% instances), VERB (55; 11% instances), CCONJ (51; 10% instances), DET (42; 8% instances), ADV (32; 6% instances), PRON (14; 3% instances), ADJ (10; 2% instances), PROPN (6; 1% instances), SCONJ (6; 1% instances), AUX (4; 1% instances), NUM (1; 0% instances), X (1; 0% instances)